Résumé parsing

Last updated

Resume parsing, also known as CV parsing, resume extraction, or CV extraction, allows for the automated storage and analysis of resume data. The resume is imported into parsing software and the information is extracted so that it can be sorted and searched.

Contents

Principle

Resume parsers analyze a resume, extract the desired information, and insert the information into a database with a unique entry for each candidate. [1] Once the resume has been analyzed, a recruiter can search the database for keywords and phrases and get a list of relevant candidates. Many parsers support semantic search, which adds context to the search terms and tries to understand intent in order to make the results more reliable and comprehensive. [2]

Machine learning

Machine learning is extremely important for resume parsing. Each block of information needs to be given a label and sorted into the correct category, whether that's education, work history, or contact information. [3] Rule-based parsers use a predefined set of rules to parse the text. This method does not work for resumes because the parser needs to "understand the context in which words occur and the relationship between them." [4] For example, if the word "Harvey" appears on a resume, it could be the name of an applicant, refer to the college Harvey Mudd, or reference the company Harvey & Company LLC. The abbreviation MD could mean "Medical Doctor" or "Maryland". A rule-based parser would require incredibly complex rules to account for all the ambiguity and would provide limited coverage.

Natural language processing (NLP) is a branch of artificial intelligence which uses machine learning to make predictions and to understand content and context. [5] Acronym normalization and tagging accounts for the different possible formats of acronyms and normalizes them. Lemmatization reduces words to their root using a language dictionary and stemming removes “s”, “ing”, etc. Entity extraction uses regular expressions, dictionaries, statistical analysis and complex pattern-based extraction to identify people, places, companies, phone numbers, email addresses, important phrases and more. [4]

Effectiveness

Resume parsers have achieved up to 87% accuracy, [6] which refers to the accuracy of data entry and categorizing the data correctly. Human accuracy is typically not greater than 96%, so the resume parsers have achieved "near human accuracy." [7]

One executive recruiting company tested three resume parsers and humans to compare the accuracy in data entry. They ran 1000 resumes through the resume parsing software and had humans manually parse and enter the data. The company brought in a third party to evaluate how the humans did compared to the software. They found that the results from the resume parsers were more comprehensive and had fewer mistakes. The humans did not enter all the information on the resumes and occasionally misspelled words or wrote incorrect numbers. [8]

In a 2012 experiment, a resume for an ideal candidate was created based on the job description for a clinical scientist position. After going through the parser, one of the candidate's work experiences was completely lost due to the date being listed before the employer. The parser also didn't catch several educational degrees. The result was that the candidate received a relevance ranking of only 43%. If this had been a real candidate's resume, they wouldn't have moved on to the next step even though they were qualified for the position. [9] It would be helpful if a similar study was conducted on current resume parsers to see if there have been any improvements over the past few years.

Benefits

Challenges

The parsing software has to rely on complex rules and statistical algorithms to correctly capture the desired information in the resumes. There are many variations of writing style, word choice, syntax, etc. and the same word can have multiple meanings. The date alone can be written hundreds of different ways. [1] It is still a challenge for these resume parsers to account for all the ambiguity. Natural Language Processing and Artificial Intelligence still have a way to go in understanding context-based information and what humans mean to convey in written language.

Resume optimization

Resume parsers have become so omnipresent that it is now recommended that candidates focus on writing to the parsing system rather than to the recruiter. The following techniques have been proposed to increase the probability of success:

  1. Use keywords from the job description in relevant places on your resume. [12]
  2. Don't use headers or footers, since they may confuse the parsing algorithms. [15]
  3. Use a simple style for fonts, layouts and formatting. [15]
  4. Avoid graphics. [15]
  5. Use standard section names such as “Work Experience” and “Education”. [3]
  6. Avoid using acronyms unless they're included in the job description. [3]
  7. Don't start with dates in the "Work Experience" section. [3]
  8. Stay consistent with formatting past work experience. [9]
  9. Send resume in docx, doc and PDF file formats. [3]

Software and vendors

There are many stand-alone options for resume parsers including [16] RChilli, Skillate, CandidateZip, Sovren, Daxtra, Textkernel, Hireability and they are also typically bundled in with applicant tracking systems, which are used by companies to streamline the hiring process. [17]

With recent advancements in machine learning, the text mining and analysis processes, which ensure up to 95% accuracy [18] in data processing, many AI technologies [19] have sprung up to help the job seekers in the creation of application documents. These services focus on creating ATS-friendly resumes, execute resume check and screening, and help with all of the preparation and application processes. Some of the AI builders, such as Leap.ai and Skillroads, concentrate on the resume creation while others, like Stella, also offer help with the job hunt itself as they match candidates to appropriate vacancies. In 2017, Google launched Google for Jobs. This expansion to the search engine uses Cloud Talent Solution, [20] Google's own iteration of the AI resume builder and matching system.

Future

Resume parsers are already standard in most mid- to large-sized companies and this trend will continue as the parsers become even more affordable. [12]

A qualified candidate's resume can be ignored if it is not formatted the proper way or doesn't contain specific keywords or phrases. As Machine Learning and Natural Language Processing get better, so will the accuracy of resume parsers.

One of the areas resume parsing software is working on expanding into is performing contextual analysis on the information in the resume rather than purely extracting it. One employee at a parsing company said “a parser needs to classify data, enrich it with knowledge from other sources, normalize data so it can be used for analysis and allow for better searching.” [21]

Parsing companies are also being asked to expand beyond just resumes or even LinkedIn profiles. They are working on extracting information from industry-specific sites such as GitHub and social media profiles. [21]  

Related Research Articles

<span class="mw-page-title-main">Optical character recognition</span> Computer recognition of visual text

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo or from subtitle text superimposed on an image.

<span class="mw-page-title-main">Interview</span> Structured series of questions and answers

An interview is a structured conversation where one participant asks questions, and the other provides answers. In common parlance, the word "interview" refers to a one-on-one conversation between an interviewer and an interviewee. The interviewer asks questions to which the interviewee responds, usually providing information. That information may be used or provided to other audiences immediately or later. This feature is common to many types of interviews – a job interview or interview with a witness to an event may have no other audience present at the time, but the answers will be later provided to others in the employment or investigative process. An interview may also transfer information in both directions.

<span class="mw-page-title-main">Résumé</span> Document describing a persons skills

A résumé, sometimes spelled resume, is a document created and used by a person to present their background, skills, and accomplishments. Résumés can be used for a variety of reasons, but most often they are used to secure new employment.

Staffing is the process of finding the right worker with appropriate qualifications or experience and recruiting them to fill a job position or role. Through this process, organizations acquire, deploy, and retain a workforce of sufficient quantity and quality to create positive impacts on the organization's effectiveness. In management, staffing is an operation of recruiting the employees by evaluating their skills and knowledge before offering them specific job roles accordingly.

<span class="mw-page-title-main">Recruitment</span> Process of attracting, selecting and appointing candidates to a job or other organization

Recruitment is the overall process of identifying, sourcing, screening, shortlisting, and interviewing candidates for jobs within an organization. Recruitment also is the process involved in choosing people for unpaid roles. Managers, human resource generalists, and recruitment specialists may be tasked with carrying out recruitment, but in some cases, public-sector employment, commercial recruitment agencies, or specialist search consultancies such as Executive search in the case of more senior roles, are used to undertake parts of the process. Internet-based recruitment is now widespread, including the use of artificial intelligence (AI).

An applicant tracking system (ATS) is a software application that enables the electronic handling of recruitment and hiring needs. An ATS can be implemented or accessed online at enterprise- or small-business levels, depending on the needs of the organization; free and open-source ATS software is also available. An ATS is very similar to customer relationship management (CRM) systems, but are designed for recruitment tracking purposes. In many cases they filter applications automatically based on given criteria such as keywords, skills, former employers, years of experience and schools attended. This practice of application filtering has caused many to adopt resume optimization techniques similar to those used in search engine optimization when creating and formatting their résumé.

<span class="mw-page-title-main">Job interview</span> Type of interview

A job interview is an interview consisting of a conversation between a job applicant and a representative of an employer which is conducted to assess whether the applicant should be hired. Interviews are one of the most common methods of employee selection. Interviews vary in the extent to which the questions are structured, from an unstructured and informal conversation to a structured interview in which an applicant is asked a predetermined list of questions in a specified order; structured interviews are usually more accurate predictors of which applicants will make suitable employees, according to research studies.

Yahoo HotJobs, formerly known as hotjobs.com, was an online job search engine. It provided tools and advice for job seekers, employers, and staffing firms. It was acquired by Yahoo in 2002, then acquired by Monster Worldwide, owner of its major competitor Monster.com in 2010—leading to its merger with Monster.com and eventual closure.

Sourcing is a talent acquisition discipline which is focused on the identification, assessment and engagement of skilled worker candidates through proactive recruiting techniques. Professionals specializing in sourcing are known primarily as Sourcers; but also Internet Recruiters, Recruiting Researchers or Talent Scouts.

Video résumé or video resume is a recording promoting a job seeker.

<span class="mw-page-title-main">Application for employment</span> Standard business document

An application for employment is a standard business document that is prepared with questions deemed relevant by employers. It is used to determine the best candidate to fill a specific role within the company. Most companies provide such forms to anyone upon request, at which point it becomes the responsibility of the applicant to complete the form and return it to the employer for consideration. The completed and returned document notifies the company of the applicant's availability and desire to be employed as well as their qualifications and background so that a determination can be made as to the candidate's suitability to the position.

A talent community is a mechanism that employers use to keep active pipelines of talent for future recruitment. Talent pipelines consist of potential employees who are interested in working at a given employer, but are not ready to apply. This group is engaged with on a frequent basis so that when the potential employees are ready to seek a new job, they have a relationship with the company.

One-way interview, also known as asynchronous interview, pre recorded interview, virtual interview or digital interview, enables prospective employers to conduct online video interviews in an automated fashion. The interviews are conducted via websites or internet-enabled devices which use digital interviewing applications.

Entelo, Inc. is a privately held technology company located in San Francisco, California. Entelo offers a SaaS platform used by recruiters and hiring managers to find potential employees.

Recruitment marketing refers to the inbound strategies and tactics an organization uses to find, attract, engage, and nurture talent before they apply for a job, also called the pre-applicant phase of talent acquisition. It is the practice of promoting the benefits and value of working for an employer in order to recruit talent. It is analogous in many ways to corporate marketing, and is extremely similar to employer branding except recruitment marketing relates to trackable initiatives that drive awareness, engagement and conversion of applicants versus someone's impression of working at a company. Of course others see employer branding as a subset of recruitment marketing, in addition to extending the reach and exposure of career opportunities through search engine optimization (SEO), building and nurturing candidate relationships through talent communities, and the management of messaging and advertising of talent acquisition efforts.

A human resources management system (HRMS) or Human Resources Information System (HRIS) or Human Capital Management (HCM) is a form of Human Resources (HR) software that combines a number of systems and processes to ensure the easy management of human resources, business processes and data. Human resources software is used by businesses to combine a number of necessary HR functions, such as storing employee data, managing payroll, recruitment, benefits administration, time and attendance, employee performance management, and tracking competency and training records.

<span class="mw-page-title-main">Algorithmic bias</span> Technological phenomenon with social implications

Algorithmic bias describes systematic and repeatable errors in a computer system that create "unfair" outcomes, such as "privileging" one category over another in ways different from the intended function of the algorithm.

Artificial intelligence (AI) in hiring involves the use of technology to automate aspects of the hiring process. Advances in artificial intelligence, such as the advent of machine learning and the growth of big data, enable AI to be utilized to recruit, screen, and predict the success of applicants. Proponents of artificial intelligence in hiring claim it reduces bias, assists with finding qualified candidates, and frees up human resource workers' time for other tasks, while opponents worry that AI perpetuates inequalities in the workplace and will eliminate jobs. Despite the potential benefits, the ethical implications of AI in hiring remain a subject of debate, with concerns about algorithmic transparency, accountability, and the need for ongoing oversight to ensure fair and unbiased decision-making throughout the recruitment process.

A social media background check is an investigative technique that involves scrutinizing the social media profiles and activities of individuals, primarily for pre-employment screening and other official verifications. These checks are performed to review people's online behavioral history on social media websites such as Facebook, Twitter, and LinkedIn. Social media background checks have become a common part of recruitment processes, among other verification procedures.

<span class="mw-page-title-main">HireVue</span> American human resources technology company

HireVue is an artificial intelligence (AI) and human resources management company headquartered in South Jordan, Utah. Founded in 2004, the company allows its clients to conduct digital interviews during the hiring process, where the job candidate interacts with a computer instead of a human interviewer.

References

  1. 1 2 “What Is CV/Resume Parsing?” Daxtra, Daxtra Technologies Ltd, 18 Oct. 2016, https://info.daxtra.com/blog/2016/10/18/what-is-cvresume-parsing.
  2. Ratcliff, Christopher. “Search Engine Watch.” What Is Semantic Search and Why Does It Matter?, ClickZ Group Limited, 21 Oct. 2015, searchenginewatch.com/sew/opinion/2431292/what-is-semantic-search-and-why-does-it-matter.
  3. 1 2 3 4 5 “Is Your Resume Ready for Automated Screening?” Resume Hacking, Resume Hacking, 2 Jan. 2016, www.resumehacking.com/ready-for-automated-resume-screening.
  4. 1 2 Nelson, Paul. "Natural Language Processing (NLP) Techniques for Extracting Information." Search Technologies, Search Technologies, www.searchtechnologies.com/blog/natural-language-processing-techniques.
  5. Reynolds, Brandon. “The Terrible Trouble with Natural Language Processing (It's Us.).”Salesforce Blog, Salesforce.com, Inc., 17 Aug. 2016, www.salesforce.com/blog/2016/08/trouble-with-natural-language-processing.html.
  6. "HR software companies? Why structuring your data is crucial for your business?". 15 April 2019.
  7. “The Ultimate Guide to CV/Resume Parsing.” Daxtra, Daxtra Technologies Ltd, 30 Jun. 2022, https://info.daxtra.com/the-ultimate-guide-to-cv-resume-parsing.
  8. "A Top Executive Recruiter Puts Accuracy to the Ultimate Test." Resume Parsing: Putting Accuracy to the Ultimate Test, Sovren Group, Inc., www.sovren.com/resource-center/a-top-executive-recruiter-puts-accuracy-to-the-ultimate-test/.
  9. 1 2 Levinson, Meridith. “5 Insider Secrets for Beating Applicant Tracking Systems (ATS).”CIO, CIO, 1 Mar. 2012, www.cio.com/article/2398753/careers-staffing/careers-staffing-5-insider-secrets-for-beating-applicant-tracking-systems.html.
  10. Bertrand, Marianne; Mullainathan, Sendhil (July 2003). "Are Emily and Greg More Employable than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination". National Bureau of Economic Research. 9873. doi: 10.3386/w9873 .
  11. “3 Ways Recruiters Can Use AI to Reduce Unconscious Bias.” Undercover Recruiter, 12 May 2017, theundercoverrecruiter.com/ai-reduce-unconscious-bias/.
  12. 1 2 3 4 “Baby Steps in HR Technology: What Is Resume Parsing?” Recruiterbox, Recruiterbox Inc, 12 Oct. 2017, recruiterbox.com/blog/baby-steps-in-hr-technology-what-is-resume-parsing-2/.
  13. Cain, Áine. “The Real Reason 60% of Job Seekers Can't Stand the Application Process.” Business Insider, Business Insider, 16 June 2016, www.businessinsider.com/why-most-ob-seekers-cant-stand-the-application-process-2016-6.
  14. Schultz, Carol. “Got a Minute? If So, Spend It Looking at Resumes.” ERE, ERE Media., 3 May 2012, www.ere.net/got-a-minute-if-so-spend-it-looking-at-resumes/.
  15. 1 2 3 Cappelli, Peter. “How to Get a Job? Beat the Machines.” Time, Time Inc., 11 June 2012, business.time.com/2012/06/11/how-to-get-a-job-beat-the-machines/.
  16. "What is the best resume parsing software?".
  17. Hu, James. “Your Top 7 Questions About Applicant Tracking Systems, Answered.”Recruiter, Recruiter.com, Inc., 16 Aug. 2017, www.recruiter.com/i/your-top-7-questions-about-applicant-tracking-systems-answered/.
  18. "up to 95% accuracy". Towards Data Science. 17 January 2018.
  19. "AI technologies that help you to get hired". Skillroads.
  20. "Cloud Talent Solution". Google.
  21. 1 2 Zielinkski, Dave. “Does Your Resume Parser Stack Up? How to Evaluate Next-Generation Systems.” SHRM Society for Human Resource Management, SHRM, 10 May 2016, www.shrm.org/resourcesandtools/hr-topics/technology/pages/does-your-resume-parser-stack-up-how-to-evaluate-next-generation-systems.aspx?sthash.2dz2wgkl.mjjo.