A simple Python module for parsing human names into their individual components

Derek Gulbranson

Last update: Dec 20, 2022

Related tags

Overview

Name Parser

A simple Python (3.2+ & 2.6+) module for parsing human names into their individual components.

hn.title
hn.first
hn.middle
hn.last
hn.suffix
hn.nickname
hn.surnames (middle + last)

Supported Name Structures

The supported name structure is generally "Title First Middle Last Suffix", where all pieces are optional. Comma-separated format like "Last, First" is also supported.

Title Firstname "Nickname" Middle Middle Lastname Suffix
Lastname [Suffix], Title Firstname (Nickname) Middle Middle[,] Suffix [, Suffix]
Title Firstname M Lastname [Suffix], Suffix [Suffix] [, Suffix]

Instantiating the HumanName class with a string splits on commas and then spaces, classifying name parts based on placement in the string and matches against known name pieces like titles and suffixes.

It correctly handles some common conjunctions and special prefixes to last names like "del". Titles and conjunctions can be chained together to handle complex titles like "Asst Secretary of State". It can also try to correct capitalization of names that are all upper- or lowercase names.

It attempts the best guess that can be made with a simple, rule-based approach. Its main use case is English and it is not likely to be useful for languages that do not conform to the supported name structure. It's not perfect, but it gets you pretty far.

Installation

pip install nameparser

If you want to try out the latest code from GitHub you can install with pip using the command below.

pip install -e git+git://github.com/derek73/python-nameparser.git#egg=nameparser

If you need to handle lists of names, check out namesparser, a compliment to this module that handles multiple names in a string.

Quick Start Example

>>> from nameparser import HumanName
>>> name = HumanName("Dr. Juan Q. Xavier de la Vega III (Doc Vega)")
>>> name
<HumanName : [
    title: 'Dr.'
    first: 'Juan'
    middle: 'Q. Xavier'
    last: 'de la Vega'
    suffix: 'III'
    nickname: 'Doc Vega'
]>
>>> name.last
'de la Vega'
>>> name.as_dict()
{'last': 'de la Vega', 'suffix': 'III', 'title': 'Dr.', 'middle': 'Q. Xavier', 'nickname': 'Doc Vega', 'first': 'Juan'}
>>> str(name)
'Dr. Juan Q. Xavier de la Vega III (Doc Vega)'
>>> name.string_format = "{first} {last}"
>>> str(name)
'Juan de la Vega'

The parser does not attempt to correct mistakes in the input. It mostly just splits on white space and puts things in buckets based on their position in the string. This also means the difference between 'title' and 'suffix' is positional, not semantic. "Dr" is a title when it comes before the name and a suffix when it comes after. ("Pre-nominal" and "post-nominal" would probably be better names.)

>>> name = HumanName("1 & 2, 3 4 5, Mr.")
>>> name
<HumanName : [
    title: ''
    first: '3'
    middle: '4 5'
    last: '1 & 2'
    suffix: 'Mr.'
    nickname: ''
]>

Customization

Your project may need some adjustment for your dataset. You can do this in your own pre- or post-processing, by customizing the configured pre-defined sets of titles, prefixes, etc., or by subclassing the HumanName class. See the full documentation for more information.

Full documentation

Contributing

If you come across name piece that you think should be in the default config, you're probably right. Start a New Issue and we can get them added.

Please let me know if there are ways this library could be structured to make it easier for you to use in your projects. Read CONTRIBUTING.md for more info on running the tests and contributing to the project.

GitHub Project

https://github.com/derek73/python-nameparser

Comments

Feature: Add first and middle name(s) initials
Initials can be quite important when comparing two names in order to determine whether they are the same or not. I have added a property to HumanName called initials which holds the first letters of the first name and the middle names. The list-version of the property is a list of single characters. The string-version is build from the list and has a dot and space after each character.

Some examples:

>>> HumanName("John Doe") <HumanName : [ title: '' initials: 'J.' first: 'John' middle: '' last: 'Doe' suffix: '' nickname: '' ]> >>> HumanName("Dr. Juan Q. Xavier Velasquez y Garcia") <HumanName : [ title: 'Dr.' initials: 'J. Q. X.' first: 'Juan' middle: 'Q. Xavier' last: 'Velasquez y Garcia' suffix: '' nickname: '' ]> >>> HumanName("Doe, John Boris D.") <HumanName : [ title: '' initials: 'J. B. D.' first: 'John' middle: 'Boris D.' last: 'Doe' suffix: '' nickname: '' ]> >>> HumanName("Doe, John Boris D.").initials 'J. B. D.' >>> HumanName("Doe, John Boris D.").initials_list ['J', 'B', 'D']

Since the property is derived from the first and middle names, it will not be counted in the len function nor will it be displayed in the str function.

Each time the first or middle names are updated using the setter, the initials are updated as well. The initial creation of the initials is executed in the post_process phase.

I have added tests and updated the documentation where needed.

I hope this pull request is in line with the quality requirements and vision of the library. The changes should be backwards compatible, but please let me know if I have missed anything!
enhancement
opened by rinkstiekema 20
Parsing title and last name, e.g. "Mr XXX" should be last name, not first name

Love this library - very useful.

However, I've noticed that parsing names in the format Prefix Lastname (e.g. Mr Magoo) parse with a blank name.last and the last name in the first name position (e.g. name.first == Magoo, name.last == ''). I think this should be the other way round!

I may have time to fix this in your code later, but for now i'm using the following kludge to make it work in my code...

if name.title != '': if name.last == '': name.last = name.first name.first = ''
enhancement

opened by danhartropp 16
Wrong capitalized letter in Portuguese names
First of all, congrats for the great project.

I have found a small issue related to Portuguese names. By running the following code:

from nameparser import HumanName name = HumanName('joao da silva do amaral de souza') name.capitalize() str(name)

I get the following result:

'Joao da Silva Do Amaral de Souza'

when it should be:

'Joao da Silva do Amaral de Souza'

The d from do should be lowercase.
enhancement
opened by kelvins 10
Option to get None instead of empty string
Hey,

is there an option to get None instead of an empty string for the components? E.g.

>>> name = HumanName("1 & 2, 3 4 5, Mr.") >>> name.title None >>>name.first '3'
enhancement
opened by Xennis 9
error on two joiner words

If a name contains two joiner words one after another, s.a. "John of the Doe", get: HumanName:ERROR:parser.py:Couldn't find 'The' in pieces. error.
bug

opened by daryanypl 7
possibly incorrect parsing of "grand"
Just noticed that if I parse "Grand Rounds", it returns:

>>> name <HumanName : [ title: 'Grand' first: '' middle: '' last: 'Rounds' suffix: '' nickname: '' ]>

Kinda odd no?
enhancement wontfix probabilistic
opened by thehesiod 6

a few common special cases

Sister Souljah -- "Sister" is more of a title than last name. His Holiness the Dalai Lama -- "His Holiness" -- the two words together is the title. Bob Jones, composer
Bob Jones, author Bob Jones, compositeur -- (French for composer)

Here's the code I used:

        name = HumanName('Sister Souljah')
        library_sort_name = u' '.join([name.first, name.middle, name.suffix, name.nickname, name.title])
        if name.last:
            library_sort_name = u''.join([name.last, ", ", library_sort_name])
        print "library_sort_name=%s" % library_sort_name
        
        name = HumanName('His Holiness the Dalai Lama')
        library_sort_name = u' '.join([name.first, name.middle, name.suffix, name.nickname, name.title])
        if name.last:
            library_sort_name = u''.join([name.last, ", ", library_sort_name])
        print "library_sort_name=%s" % library_sort_name
        
        name = HumanName('Bob Jones, author')
        library_sort_name = u' '.join([name.first, name.middle, name.suffix, name.nickname, name.title])
        if name.last:
            library_sort_name = u''.join([name.last, ", ", library_sort_name])
        print "library_sort_name=%s" % library_sort_name
        
        name = HumanName('Bob Jones, compositeur')
        library_sort_name = u' '.join([name.first, name.middle, name.suffix, name.nickname, name.title])
        if name.last:
            library_sort_name = u''.join([name.last, ", ", library_sort_name])
        print "library_sort_name=%s" % library_sort_name

enhancement

opened by daryanypl 6

First name Van (which is also sometimes a prefix) not handled correctly

>>> from nameparser import HumanName
... HumanName('Van Nguyen')
0: <HumanName : [
    title: '' 
    first: 'Van Nguyen' 
    middle: '' 
    last: '' 
    suffix: ''
    nickname: ''
]>
>>> import nameparser
>>> nameparser.VERSION
1: (0, 3, 3)
>>>

bug

opened by htoothrot 6

Can't handle Japanese names

name = nameparser.HumanName("鈴木太郎")

name Traceback (most recent call last): File "", line 1, in UnicodeEncodeError: 'ascii' codec can't encode characters in position 36-39: ordinal not in range(128)

Also: the concept of "last name" and "first name" isn't valid for Chinese, Japanese, Korean (CJK) names.
bug

opened by pludemann 5
Judge-related titles not parsing

Hey -

First off, awesome package. I've been working with a dataset of ~3000 judges and associated titles, and noticed nameparser doesn't pick most (well, any) of them up. Below is the filtered list with at least a few examples/variations on each. I'm happy to do the changes if you'd like. Let me know.

common

Magistrate Judge John F. Forster, Jr Magistrate Judge Joaquin V.E. Manibusan, Jr Magistrate-Judge Elizabeth Todd Campbell Mag-Judge Harwell G Davis, III Mag. Judge Byron G. Cudmore Chief Judge J. Leon Holmes Chief Judge Sharon Lovelace Blackburn Judge James M. Moody Judge G. Thomas Eisele Judge Callie V. S. Granade Judge C Lynwood Smith, Jr Senior Judge Charles R. Butler, Jr Senior Judge Harold D. Vietor Senior Judge Virgil Pittman
Honorable Terry F. Moorer Honorable W. Harold Albritton, III Honorable Judge W. Harold Albritton, III Honorable Judge Terry F. Moorer Honorable Judge Susan Russ Walker Hon. Marian W. Payson Hon. Charles J. Siragusa

rare

US Magistrate Judge T Michael Putnam Designated Judge David A. Ezra Sr US District Judge Richard G Kopf
enhancement

opened by end0 5
Ph.D., Esq., M.D., C.F.P., etc. are titles, not suffixes
Moving the previous issue from the Google Code because I like the idea and would like to implement it, someday. Originally posted by jayqhacker, Feb 7, 2012.

Ph.D., Esq., M.D. and other titles are classified as suffixes. This is perhaps convenient for parsing, since they appear at the end of a name, but they are in fact titles. A suffix distinguishes people and is part of your legal name; a title does not and (in most countries) is not. "J. Smith Jr." and "J. Smith Sr." are certainly different people, whereas "J. Smith", "J. Smith, PhD" and "J. Smith, MD" may or may not be.

I propose titles end up in the .title field, and suffices end up in the .suffix field.

Name parsing is a hard problem; ultimately I think you'd want a statistical, machine learning approach, but you can probably get pretty far with rules.

The two issues are: 1) some suffixes are part of your name, some aren't; and 2) some titles come before your name, some after.

You could solve both by splitting titles into pre- and post-titles, and making suffixes just ('jr','sr','2','i','ii','iii','iv','v').

Project Member #3 derek73

I played with adding a new list to keep track of titles that were added at the end. If we treat the suffixes as a definitive and complete list, then we can assume anything else is a title. The initials "i" and "v" are problematic, but we could probably assume that they are initials in the case of "John V".

I like the idea of separating out the parts of the name that definitely signify another person, and your definition of suffix. Thinking about it, I guess a suffix always comes directly after the name? Like you wouldn't have "John Doe, Phd, Jr". Also the case of having 2 suffixes seems somewhat remote, e.g. "'Smith, John E, III, Jr'"? So I guess that would make the patterns look something like this.

# no commas: title first middle middle middle last suffix|title_suffix title_suffix # suffix comma: title first middle last, suffix|title_suffix [, title_suffix] # lastname comma: last, title first middles[,] suffix|title_suffix [,title_suffix] SUFFIXES = set(( 'jr','sr','2','i','ii','iii','iv','v', )) TITLE_SUFFIXES = set(( 'phd','md','esquire','esq','clu','chfc','cfp', ))

I got as far as finding that equality test would need to be updated. It got me wondering if perhaps we should change the equality test, per your example, to test that ' '.join(first, middle, last, suffix) are the same. Perhaps its easy enough for someone to test if unicode() representations are equal on their own if they want titles too. Or maybe that's too smart.

That sounds like a reasonable approach. I don't personally use equality, but you might consider having it do the "dumb" least-surprise exact comparison, and adding a similarity method that returns a float in 0.0 - 1.0, eventually aiming for something like the probability that these two names reference the same person.

Also, watch out for "King John V." ;)
enhancement wontfix probabilistic
opened by derek73 5
Issue in names that contain comma(,) and has more than 3 words.

Hi Derek, I have a usecase where I need to parse names containing comma(,), but the library seems to work differently in this case. from nameparser import HumanName as hm #1) here first name should be-E ANNE and lastname should be-LEONARDO hm("E Anne D,Leonardo") <HumanName : [ title: '' first: 'Leonardo' middle: '' last: 'E Anne D' suffix: '' nickname: '' ]>

#2) here first name should be-Marry Ann and lastname should be-Luther hm("Mary Ann H,Luther") <HumanName : [ title: '' first: 'Luther' middle: '' last: 'Mary Ann H' suffix: '' nickname: '' ]>

Even if I removed the comma from the name, it has different output.

hm("Mary Ann H Luther") <HumanName : [ title: '' first: 'Mary' middle: 'Ann H' last: 'Luther' suffix: '' nickname: '' ]>

opened by da-sbarde 1

Capitalizing Suffixes

I believe acronym-based suffixes are being incorrectly capitalized.

> python3
Python 3.8.10 (default, Jun 22 2022, 20:18:18) 
[GCC 9.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import nameparser
>>> nameparser.__version__
'1.1.1'
>>> n = nameparser.HumanName('GREGORY HOUSE M.D.')
>>> n
<HumanName : [
	title: '' 
	first: 'GREGORY' 
	middle: '' 
	last: 'HOUSE' 
	suffix: 'M.D.'
	nickname: ''
]>
>>> n.capitalize()
>>> n
<HumanName : [
	title: '' 
	first: 'Gregory' 
	middle: '' 
	last: 'House' 
	suffix: 'M.d.'
	nickname: ''
]>
>>>

I believe the suffix should be 'M.D.'

bug

opened by DLu 0

output string formatting space before suffix if suffix does not exist
I have this name: John Smith I'd like to reformat the name to look like this: Smith, John

I've set the formatting I'd like to use:

from nameparser.config import CONSTANTS CONSTANTS.string_format = "{last} {suffix}, {title} {first} ({nickname}) {middle}"

The result I get is: Smith , John due to the space that precedes {suffix} in my string_format.

However, I'd like the suffix to follow the last name if it ever occurs. Does your package allow for the trimming of space if no suffix exists, or should I implement this on my end?

Apologies if this is already addressed in the documentation! Thank you!
enhancement
opened by jocelynpender 0
Problem parsing name with , V
nameparser version 1.1.1

When I use HumanName with the string "John W. Ingram, V", it can't parse correctly but if I remove the comma, it works. Also, if I try using IV (4th) instead of V (5th), then it works with the comma so I think even though V seems to be recognized in the documentation, it isn't fully working.

from nameparser import HumanName >>> name = HumanName("John W. Ingram, V") >>> name <HumanName : [ title: '' first: 'V' middle: '' last: 'John W. Ingram' suffix: '' nickname: '' ]> >>> name = HumanName("John W. Ingram V") >>> name <HumanName : [ title: '' first: 'John' middle: 'W.' last: 'Ingram' suffix: 'V' nickname: '' ]>
bug
opened by pusateri 0
Weak Arabic Name handling

The library does not handle Arabic names well, even the most common patterns. I'm no expert on the topic, but I'm Arabic and know the common patterns.

Compound Names My first name is "Mohamad Ali", but the library identifies "Ali" as my middle name. Arabic full names of the form "Mohamad X Surname" are almost always meant to have "Mohamad X" as a first name (with exceptions such as when X is "El" or "Al", in which case the surname is compound with the first word being "El" or "Al"). Other exceptions are "Bin" (the library handles these correctly). Examples: Mohamad Khalil, Mohamad Amin, Mohamad Ali, Mohamad El Amin, Mohamad Bin Salman, etc...

Well-known Surname Suffixes Some names like "Mohamad Zeineddine" can be written as "Mohamad Zein El Dine". Here the first name is Mohamad and the surname is "Zein El Dine" which is equivalent to "Zeineddine". "El Dine"/"eddine" is an extremely common suffix to have in Arabic surnames (e.g. Zeineddine, Alameddine, Charafeddine, Safieddine, Saifeddine, etc...). Other suffixes like "-allah"/"-ullah"/"-ollah" are extremely common as well (e.g., Nasrallah). This is to say that "El Dine" and "Allah" are almost always the 2nd part of a surname (at least one more word is needed on the left to complete the surname)

Middle names hardly exist An Arabic-looking name is a good hint that there is no middle name. Arabic cultures adopt chaining of names instead of middle names (first name, followed by father's name, followed by father's father's name, etc..., and then the surname).

Edit: Honestly, the Wikipedia page discusses this really well https://en.wikipedia.org/wiki/Arabic_name
enhancement

opened by us88 1

Releases(v1.1.2.1)

v1.1.2.1(Nov 14, 2022)
Add support for attributes in constructor (#140)

Make HumanName instances hashable (#138)

Update repr for names with single quotes (#137)

Source code(tar.gz)
Source code(zip)
v1.1.2(Jan 29, 2022)
Fix bug in is_suffix() handling of lists (#129)

Source code(tar.gz)
Source code(zip)
v1.1.0(Jan 4, 2022)
Add initials support (#128)

Add more titles, suffixes and and prefixes (#120, #127, #128, #119, #116, #114, #117, #126, #102, #123)

Source code(tar.gz)
Source code(zip)
v1.0.6(Feb 8, 2020)
Fix Python 3.8 syntax error warning (#104)

Source code(tar.gz)
Source code(zip)
v1.0.5(Dec 12, 2019)
Fix suffix parsing bug in comma parts (#98)

Fix deprecation warning on Python 3.7 (#94)

Improved capitalization support of mixed case names (#90)

Remove "elder" from titles (#96)

Add post-nominal list from Wikipedia to suffixes (#93)

Source code(tar.gz)
Source code(zip)
v1.0.4(Jun 27, 2019)
Better nickname handling of multiple single quotes (#86)

full_name attribute now returns formatted string output instead of original string (#87)

Source code(tar.gz)
Source code(zip)
v1.0.3(Apr 20, 2019)
1.0.3 - April 18, 2018

fix sys.stdin usage when stdin doesn't exist (#82)

support for escaping log entry arguments (#84)

1.0.2 - Oct 26, 2018

Fix handling of only nickname and last name (#78)

Source code(tar.gz)
Source code(zip)
v1.0.1(Sep 1, 2018)
Fix overzealous regex for "Ph. D." (#43)

Add surnames attribute as aggregate of middle and last names

Source code(tar.gz)
Source code(zip)
v1.0(Aug 31, 2018)
Refactor prefix handling based on learnings from issues #72, #23, #70, and #60. New algorithm joins prefixes to the following pieces but stops at other non-contiguous prefixes or suffixes

Fix support for nicknames in single quotes (#74)

Change prefix handling to support prefixes on first names (#60)

Fix prefix capitalization when not part of lastname (#70)

Handle erroneous space in "Ph. D." (#43)

Source code(tar.gz)
Source code(zip)
v0.5.7(Jun 16, 2018)
Fix doc link (#73)

Fix handling of "do" and "dos" Portuguese prefixes (#71, #72)

Source code(tar.gz)
Source code(zip)
v0.5.6(Jun 14, 2018)

Fix python version check (#64)
Source code(tar.gz)
Source code(zip)
v0.5.4(Dec 7, 2017)
Add Dr to suffixes (#62)

Add the full set of Italian derivatives from "di" (#59)

Add parameter to specify the encoding of strings added to constants, use 'UTF-8' as fallback (#67)

Fix handling of names composed entirely of conjunctions (#66)

Source code(tar.gz)
Source code(zip)
v0.5.3(Jun 28, 2017)
Remove emojis from initial string by default with option to include emojis (#58)

Source code(tar.gz)
Source code(zip)
v0.5.2(Mar 20, 2017)
Added names scrapped from VIAF data, thanks daryanypl (#57)

Source code(tar.gz)
Source code(zip)
v0.5.1(Aug 12, 2016)

Fix error for names that end with conjunction (#54)
Source code(tar.gz)
Source code(zip)
v0.5.0(Aug 5, 2016)

Refactor join_on_conjunctions(), fix #53
Source code(tar.gz)
Source code(zip)
v0.4.1(Aug 3, 2016)
Remove "bishop" from titles because it also could be a first name

Fix handling of lastname prefixes with periods, e.g. "Jane St. John" (#50)

Source code(tar.gz)
Source code(zip)
v0.4.0(Jun 2, 2016)
Remove "CONSTANTS.suffixes", replaced by "suffix_acronyms" and "suffix_not_acronyms" (#49)

Fix broken support for multiple suffixes separated by periods instead of commas, e.g. "John Doe Msc.Ed."

Add "du" to prefixes

Add "sheikh" variations to titles

Add parameter to force capitalization of mixed-case strings

Source code(tar.gz)
Source code(zip)
v0.3.16(Mar 24, 2016)
Clarify LGPL licence version (#47)

Skip pickle tests if pickle not installed (#48)

Source code(tar.gz)
Source code(zip)
v0.3.15(Mar 21, 2016)
Fix string format when empty_attribute_default = None (#45)

Include tests in release source tarball (#46)

Source code(tar.gz)
Source code(zip)
v0.3.14(Mar 19, 2016)
Add CONSTANTS.empty_attribute_default to customize value returned for empty attributes (#44)

Source code(tar.gz)
Source code(zip)
v0.3.13(Mar 15, 2016)
Improve string format handling

Improve customization documentation

Source code(tar.gz)
Source code(zip)
v0.3.12(Mar 14, 2016)
Fix first name clash with suffixes (#42)

Fix encoding of constants added via the python shell

Add "MSC" to suffixes, fix #41

Source code(tar.gz)
Source code(zip)
v0.3.11(Oct 18, 2015)
Fix #39, capitalization bug with initials that are also conjunctions ("e" and "y")

Source code(tar.gz)
Source code(zip)
v0.3.10(Sep 20, 2015)
fix bytestring handling on python 2.x (#38)

Source code(tar.gz)
Source code(zip)
v0.3.9(Sep 5, 2015)
Separate suffixes that are acronyms to handle periods differently, fixes #29, #21

Don't find titles after first name is filled, fixes (#27)

Add "chair" titles (#37)

Source code(tar.gz)
Source code(zip)
v0.3.8(Sep 3, 2015)
Fixes for #36, better handling of roman numerals at the end of a name

Source code(tar.gz)
Source code(zip)
v0.3.7(Aug 31, 2015)
Speed improvement, 3x faster

Make HumanName instances pickleable

Source code(tar.gz)
Source code(zip)
v0.3.6(Aug 6, 2015)
Fix strings that start with conjunctions (#20)

handle assigning lists of names to a name attribute

support dictionary-like assignment of name attributes

Source code(tar.gz)
Source code(zip)
v0.3.5(Aug 4, 2015)
Fix handling of string encoding in python 2.x (#34)

Add support for dictionary key access, e.g. name['first']

add 'santa' to prefixes, add 'cpa', 'csm', 'phr', 'pmp' to suffixes (#35)

Fix prefixes before multi-part last names (#23)

Fix capitalization bug (#30)

Source code(tar.gz)
Source code(zip)

Owner

Derek Gulbranson

GitHub http://nameparser.readthedocs.org/en/latest/

A username generator made from French Canadian most common names.

This script is used to generate a username list using the most common first and last names in Quebec in different formats. It can generate some passwords using specific patterns such as Tremblay2020.

5 Nov 26, 2022

Search for terms(word / table / field name or any) under Snowflake schema names

snowflake-search-terms-in-ddl-views Search for terms(word / table / field name or any) under Snowflake schema names Version : 1.0v How to use ? Run th

1 Dec 15, 2021

Find a Doc is a free online resource aimed at helping connect the foreign community in Japan with health services in their native language.

Find a Doc - Localization Find a Doc is a free online resource aimed at helping connect the foreign community in Japan with health services in their n

18 Dec 19, 2022

A Python3 script that simulates the user typing a text on their keyboard.

A Python3 script that simulates the user typing a text on their keyboard. (control the speed, randomness, rate of typos and more!)

3 Feb 22, 2022

The project is investigating methods to extract human-marked data from document forms such as surveys and tests.

The project is investigating methods to extract human-marked data from document forms such as surveys and tests. They can read questions, multiple-choice exam papers, and grade.

5 Mar 27, 2022

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

Contents Maintainer wanted Introduction Installation Documentation License History Source code Authors Maintainer wanted I am looking for a new mainta

1.2k Dec 16, 2022

A non-validating SQL parser module for Python

python-sqlparse - Parse SQL statements sqlparse is a non-validating SQL parser for Python. It provides support for parsing, splitting and formatting S

3.1k Jan 4, 2023

PyMultiDictionary is a Dictionary Module for Python 3+ to get meanings, translations, synonyms and antonyms of words in 20 different languages

PyMultiDictionary PyMultiDictionary is a Dictionary Module for Python 3+ to get meanings, translations, synonyms and antonyms of words in 20 different

19 Dec 26, 2022

py-trans is a Free Python library for translate text into different languages.

Free Python library to translate text into different languages.

13 Aug 27, 2022

A python Tk GUI that creates, writes text and attaches images into a custom spreadsheet file

13 Dec 9, 2022

Markup is an online annotation tool that can be used to transform unstructured documents into structured formats for NLP and ML tasks, such as named-entity recognition. Markup learns as you annotate in order to predict and suggest complex annotations. Markup also provides integrated access to existing and custom ontologies, enabling the prediction and suggestion of ontology mappings based on the text you're annotating.

Markup is an online annotation tool that can be used to transform unstructured documents into structured formats for NLP and ML tasks, such as named-entity recognition. Markup learns as you annotate in order to predict and suggest complex annotations. Markup also provides integrated access to existing and custom ontologies, enabling the prediction and suggestion of ontology mappings based on the text you're annotating.

146 Dec 18, 2022

Split large XML files into smaller ones for easy upload

Split large XML files into smaller ones for easy upload. Works for WordPress Posts Import and other XML files.

1 Jan 30, 2022

PyNews 📰 Simple newsletter made with python 🐍🗞️

PyNews ?? Simple newsletter made with python Install dependencies This project has some dependencies (see requirements.txt) that are not included in t

4 Aug 21, 2022

Simple python program to auto credit your code, text, book, whatever!

Credit Simple python program to auto credit your code, text, book, whatever! Setup First change credit_text to whatever text you would like to credit

1 Jan 29, 2022

🚩 A simple and clean python banner generator - Banners

?? A simple and clean python banner generator - Banners

12 Oct 9, 2022

Fuzzy string matching like a boss. It uses Levenshtein Distance to calculate the differences between sequences in a simple-to-use package.

1.2k Jan 1, 2023

A simple Python module for parsing human names into their individual components

Related tags

Overview

Name Parser

Supported Name Structures

Installation

Quick Start Example

Customization

Contributing

Comments

common

rare

Releases(v1.1.2.1)

v1.1.2.1(Nov 14, 2022)

v1.1.2(Jan 29, 2022)

v1.1.0(Jan 4, 2022)

v1.0.6(Feb 8, 2020)

v1.0.5(Dec 12, 2019)

v1.0.4(Jun 27, 2019)

v1.0.3(Apr 20, 2019)

v1.0.1(Sep 1, 2018)

v1.0(Aug 31, 2018)

v0.5.7(Jun 16, 2018)

v0.5.6(Jun 14, 2018)

v0.5.4(Dec 7, 2017)

v0.5.3(Jun 28, 2017)

v0.5.2(Mar 20, 2017)

v0.5.1(Aug 12, 2016)

v0.5.0(Aug 5, 2016)

v0.4.1(Aug 3, 2016)

v0.4.0(Jun 2, 2016)

v0.3.16(Mar 24, 2016)

v0.3.15(Mar 21, 2016)

v0.3.14(Mar 19, 2016)

v0.3.13(Mar 15, 2016)

v0.3.12(Mar 14, 2016)

v0.3.11(Oct 18, 2015)

v0.3.10(Sep 20, 2015)

v0.3.9(Sep 5, 2015)

v0.3.8(Sep 3, 2015)

v0.3.7(Aug 31, 2015)

v0.3.6(Aug 6, 2015)

v0.3.5(Aug 4, 2015)

Owner

Derek Gulbranson

A username generator made from French Canadian most common names.

Search for terms(word / table / field name or any) under Snowflake schema names

Find a Doc is a free online resource aimed at helping connect the foreign community in Japan with health services in their native language.

A Python3 script that simulates the user typing a text on their keyboard.

The project is investigating methods to extract human-marked data from document forms such as surveys and tests.

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

A non-validating SQL parser module for Python

PyMultiDictionary is a Dictionary Module for Python 3+ to get meanings, translations, synonyms and antonyms of words in 20 different languages

py-trans is a Free Python library for translate text into different languages.

A python Tk GUI that creates, writes text and attaches images into a custom spreadsheet file

Split large XML files into smaller ones for easy upload

PyNews 📰 Simple newsletter made with python 🐍🗞️

Simple python program to auto credit your code, text, book, whatever!

🚩 A simple and clean python banner generator - Banners

Fuzzy string matching like a boss. It uses Levenshtein Distance to calculate the differences between sequences in a simple-to-use package.

Making simplex testing clean and simple

A simple text editor for linux

Free & simple way to encipher text