Skip to content

find any kind of occupation or job title in a text or file

License

Notifications You must be signed in to change notification settings

fluquid/find_job_titles

Folders and files

NameName
Last commit message
Last commit date

Latest commit

a8a64e8 · Mar 12, 2020

History

29 Commits
Aug 17, 2017
Mar 10, 2020
Mar 10, 2020
Aug 22, 2017
Aug 22, 2017
Aug 17, 2017
Aug 17, 2017
Mar 10, 2020
Sep 6, 2019
Aug 17, 2017
Aug 17, 2017
Mar 10, 2020
Aug 17, 2017
Aug 17, 2017
Aug 17, 2017
Oct 24, 2017
Aug 17, 2017
Mar 10, 2020
Aug 17, 2017
Aug 17, 2017
Aug 22, 2017
Aug 17, 2017
Aug 17, 2017
Aug 17, 2017
Aug 17, 2017
Sep 6, 2019

Repository files navigation

find_job_titles

Coverage Status

Find Job Titles in Strings

  • Free software: MIT license
  • Python versions: 2.7, 3.4+

Features

  • Find any of 77k job titles in a given string
  • Text processing is extremely fast using "acora" library
  • Dictionary generation takes about 20 seconds upfront

Quickstart

Instantiate "Finder" and start extracting job titles:

>>> from find_job_titles import FinderAcora
>>> finder=FinderAcora()
>>> finder.findall(u'I am the Senior Vice President')
[('Senior Vice President', 9),
 ('Vice President', 16),
 ('President', 21)]

All possible, overlapping matches are returned. Matches contain positional information of where the match was found.

Alternatively use "finditer" for lazy consumption of matches:

>>> finder.finditer('I am the Senior Vice President')]
<generator object ...>

Credits

This package was created with Cookiecutter and the fluquid/cookiecutter-pypackage project template.