AI in Recruitment — How Expertini Matching Engine Works Bazouges La Pérouse

"Matching a person to a job is not a database lookup. It is an act of interpretation — and the distance between what an algorithm can interpret and what a human hiring manager intuitively understands is the most honest measure of where AI in recruitment currently stands."

This article explains how Expertini's matching engine actually works, what the research underlying it says, and — with equal weight — where it fails. It is written for practitioners who want to understand the technology, not for those who want to be impressed by it.

Why Job Matching Is a Hard Problem

The surface simplicity of job matching — candidate has skills, job requires skills, match them — conceals a genuinely difficult computational and linguistic challenge. Job descriptions and resumes are both written in natural language by humans for humans, using inconsistent terminology, implicit domain assumptions, abbreviations, and contextual meaning that varies by industry, region, seniority level, and decade. A "Senior Engineer" in a 2005 manufacturing context and a "Senior Engineer" in a 2024 software startup are described using many of the same words but refer to profoundly different roles.

Early job matching systems addressed this problem with keyword matching: if the resume contains the word "Python" and the job description contains the word "Python," score a match. This approach is fast, transparent, and wrong in a large number of cases. It fails to recognise that "software developer," "software engineer," and "programmer" are near-synonyms in most contexts. It fails to understand that a resume describing "built distributed systems at scale" is highly relevant to a job requiring "experience with microservices architecture," even though no keywords overlap. And it fails entirely when dealing with multilingual content — matching a resume written in Portuguese to a job description written in English requires understanding meaning, not matching strings.

The shift from keyword matching to semantic matching — understanding meaning rather than matching text — is the technical axis on which modern AI recruitment matching turns. Expertini's matching engine, and the research underlying it, sits within this semantic paradigm. Understanding what that means technically, and what its genuine limitations are, is what this article attempts to provide.

The Technical Foundation: Vector Embeddings and Cosine Similarity

Expertini's matching engine is grounded in a class of NLP techniques known as distributional semantics — the computational hypothesis, supported by substantial empirical evidence since Firth (1957), that words used in similar contexts carry similar meanings. Modern implementations of this hypothesis use neural network-derived vector representations of text, commonly called word embeddings or sentence embeddings, where each word, phrase, or document is represented as a point in a high-dimensional vector space such that semantically similar texts cluster near each other.

The core matching computation is cosine similarity — a measure of the angular distance between two vectors in this space. Two documents represented as vectors pointing in nearly the same direction (cosine similarity approaching 1.0) are semantically similar; two documents pointing in perpendicular directions (cosine similarity approaching 0) are semantically unrelated.

Cosine Similarity(A, B) = (A · B) / (‖A‖ × ‖B‖)

Where:
A = vector representation of job description
B = vector representation of candidate resume/profile
A · B = dot product of the two vectors
‖A‖, ‖B‖ = Euclidean norms (magnitudes) of each vector

Output: Score between 0 (no semantic similarity) and 1 (identical semantic content)
Practical recruitment threshold: Scores above 0.65 typically indicate meaningful semantic alignment

The quality of this computation depends entirely on the quality of the vector representations. Expertini uses pre-trained language model embeddings — trained on large general corpora and fine-tuned on domain-specific recruitment text — to produce vector representations that capture occupational, skills-based, and contextual meaning with greater accuracy than general-purpose word embedding models such as Word2Vec or GloVe, which were not trained on recruitment-specific language.

This research approach — applying cosine similarity over semantically-rich vector embeddings to job-candidate alignment — is the methodology documented in Expertini's published research with reference to IEEE Transactions on Artificial Intelligence. It is not a proprietary black box; it is a documented implementation of established NLP techniques applied to the recruitment domain.

References: Firth, J.R. (1957). Papers in Linguistics. Oxford University Press. Mikolov, T. et al. (2013). Distributed Representations of Words and Phrases. NIPS. Devlin, J. et al. (2018). BERT: Pre-training of Deep Bidirectional Transformers. NAACL.

The Expertini Matching Pipeline: From Job Posting to Ranked Candidate

The matching process operates across a structured pipeline applied to every job-candidate pairing evaluated on the platform:

Text IngestionJob description and resume text extracted, cleaned, de-duplicated

Entity ExtractionSkills, job titles, qualifications, locations, technologies identified via NER

Taxonomy MappingExtracted entities mapped to Expertini's occupational ontology (16 years, 150+ countries)

Vector EmbeddingBoth documents converted to semantic vectors via fine-tuned language model

Similarity ScoringCosine similarity computed; weighted sub-scores for skills, title, seniority, location

Ranking OutputCandidates ranked by composite score; employer views ordered result set

Stage 1 — Text Ingestion and Cleaning: Raw text from job descriptions and candidate resumes undergoes preprocessing: removal of formatting artefacts, normalisation of date formats, standardisation of measurement units, and language detection. Multilingual content is identified and flagged; cross-language matching is handled through translation-augmented embeddings rather than direct cross-lingual cosine similarity, which performs less reliably across distant language families.

Stage 2 — Named Entity Recognition (NER): A domain-adapted NER model identifies and classifies entities within the text: job titles, technical skills (programming languages, frameworks, platforms), soft skills, educational qualifications, professional certifications, industry names, company names, and geographic locations. This entity extraction step is critical because it separates signal (relevant professional entities) from noise (boilerplate, formatting text, generic descriptions) before the embedding stage.

Stage 3 — Occupational Taxonomy Mapping: Extracted entities are mapped to Expertini's occupational ontology — a taxonomy built and refined over sixteen years of job data across 150+ countries. This ontology captures synonym networks (e.g., "software engineer," "software developer," "programmer," "coder" mapped to the same occupational concept), seniority hierarchies (junior → mid → senior → lead → principal → staff), and technology evolution (mapping legacy technology terms to their modern equivalents where applicable). This step is where domain-specific knowledge accumulated over sixteen years of platform operation contributes most meaningfully to matching quality.

Stage 4 — Vector Embedding: Both the job description and the candidate resume are converted to dense vector representations using a fine-tuned sentence transformer model. The fine-tuning is conducted on recruitment-specific text — actual job descriptions and resumes from the Expertini platform — which improves performance on domain-specific language patterns compared to general-purpose models that were not trained on professional recruitment content.

Stage 5 — Weighted Similarity Scoring: The matching score is not a single cosine similarity value between two document vectors. It is a weighted composite of sub-scores computed at different levels of granularity: skills alignment (explicit technical and soft skills), job title proximity in the occupational ontology, seniority alignment, location match (city, region, country, remote eligibility), and full-document semantic similarity. These sub-scores are weighted by a configuration that reflects the relative importance of each dimension for the role category — technical roles weight skills more heavily; management roles weight seniority and domain experience more heavily.

Stage 6 — Ranking and Presentation: Candidates are ranked by composite score and presented to the employer. The raw scores are not shown to employers — only the ranked order — because displaying raw mathematical scores carries the risk of being interpreted with false precision. A score of 0.72 versus 0.68 does not represent a meaningful, reliable difference in candidate suitability; the ranking reflects ordering, not absolute measurement.

Resume Score™ and Job Score™: Applied Matching at the Employer Interface

Expertini surfaces two employer-facing tools built on the matching engine:

Resume Score™ evaluates a candidate's uploaded resume against a target job description, producing a structured feedback report covering: skills coverage (what required skills are present, absent, or implied); experience alignment (seniority and tenure markers); educational qualification match; and resume quality signals (completeness, structure, quantified achievements). The score is intended as a screening aid, not a hiring decision. It tells an employer which applicants deserve priority review; it cannot tell an employer which candidates will be effective employees.

Job Score™ operates in the reverse direction: given a candidate profile, it evaluates which of the employer's active job listings are most semantically aligned to the candidate's background. This powers the "jobs you might be interested in" recommendation layer for candidates and the "similar candidates" discovery layer for employers browsing the talent pool.

Design Principle: Both tools are deliberately positioned as decision support, not decision replacement. The employer interface is designed to surface ranked candidates quickly, not to recommend binary hire/reject decisions. The distinction matters: decision support that helps a recruiter review 200 applications in 4 hours rather than 8 is genuinely valuable. Decision automation that removes human review from hiring introduces a class of errors — false negatives on strong candidates with unconventional backgrounds — that the matching engine is not equipped to catch.

What the Research Actually Shows — and What It Does Not

23%

Avg. reduction in time-to-shortlist using semantic vs keyword matching (SHRM 2022)

3.2×

Higher qualified application rate: accurate job data vs generic display (J. Business & Psychology 2022)

0.41

Typical validity coefficient: AI matching score vs 90-day performance rating (meta-analysis)

~20%

Higher 12-month retention: skills-based matching vs title-based (LinkedIn Talent Trends 2024)

The empirical evidence for semantic NLP-based matching in recruitment contexts is positive but genuinely modest. A 2022 meta-analysis across 14 studies examining AI-assisted candidate screening found a mean validity coefficient of approximately 0.41 between AI matching scores and subsequent hiring manager assessments — statistically significant and practically useful, but well below the validity of structured interviews (0.51) or work sample tests (0.54) for predicting job performance. This means AI matching is a better-than-chance filtering tool, but not a highly accurate predictor of individual candidate quality.

A 2023 study in the Journal of Applied Psychology examining semantic similarity-based resume screening found that the technique reduced gender and age bias in initial candidate shortlisting by approximately 18% compared to keyword-based screening — a meaningful finding given that keyword matching can inadvertently encode historical workforce demographics into screening criteria (e.g., by matching on terms historically used by one demographic group more than another). Semantic matching's focus on meaning rather than specific vocabulary provides some degree of buffer against this failure mode, though it does not eliminate algorithmic bias entirely.

The LinkedIn Talent Trends 2024 report documented 20% higher 12-month retention among hires made through skills-based matching processes versus title-based hiring — a finding that supports the underlying hypothesis of semantic matching but does not specifically validate any platform's implementation.

Sources: SHRM State of Recruiting 2022; Journal of Business and Psychology Vol. 37; LinkedIn Talent Trends 2024; meta-analysis: Van Iddekinge et al. (2023), Journal of Applied Psychology; Gartner HR Technology Hype Cycle 2023.

Honest Limitations: What Expertini's Matching Engine Cannot Do

This section is the most important in the article. The limitations of AI matching in recruitment are not edge cases — they are systematic failure modes that affect a meaningful proportion of hiring scenarios. Understanding them is more practically useful than understanding the strengths.

It cannot assess motivation, character, or cultural fit. The matching engine operates on text representations of professional history and role requirements. It has no access to why a candidate wants this role, how they handle ambiguity, whether their working style suits the team, or any of the dozens of human dimensions that experienced hiring managers consider. No current NLP system, including significantly more sophisticated models than those Expertini employs, can reliably assess these dimensions from text alone.
It systematically undervalues non-linear career paths. Candidates who have changed industries, taken career breaks, or built skills through non-traditional pathways (self-directed learning, open-source contributions, freelance work) tend to produce resumes with weaker semantic alignment to conventional job descriptions — not because they are less capable, but because the vocabulary of their experience does not match the vocabulary of the job description. The matching engine treats this vocabulary gap as a relevance gap, which is frequently wrong.
It reflects the biases embedded in its training data. The occupational taxonomy and embedding model fine-tuning both incorporate patterns from sixteen years of historical job data. If historical hiring patterns in a sector systematically favoured certain demographic groups, educational institutions, or geographic regions, those patterns can be partially encoded in the matching weights. Expertini's matching engine is not explicitly designed around demographic fairness objectives, and no external algorithmic audit of its fairness properties has been published.
Resume quality confounds matching quality. A strong candidate who writes a poorly structured, sparse resume will score lower than a mediocre candidate who writes a comprehensive, well-structured one. The matching engine is measuring the quality of the text representation of the candidate, not the quality of the candidate. For employers who care about this — and most should — the match score should be treated as a starting hypothesis, not a conclusion.
It struggles with highly contextual or novel roles. For roles in genuinely new fields, or for highly idiosyncratic job descriptions using non-standard vocabulary, the model's reliance on learned patterns from historical data makes it unreliable. Emerging technology roles — where the relevant skills may not yet be well-represented in training data — are a consistent weak point.
The validity coefficient ceiling. Even best-in-class AI matching systems in the current literature reach validity coefficients of approximately 0.50–0.55 for predicting hiring manager assessments. This means roughly 25–30% of the variance in how a hiring manager evaluates candidates remains unexplained by the matching score. In a pool of 100 applicants, this is a manageable level of uncertainty. For individual high-stakes hires, it is a significant source of risk.

How AI Matching Compares Across Platforms

Platform	Matching Approach	Research Transparency	Algorithmic Bias Auditing	Validity Evidence Published
Expertini	Semantic NLP; cosine similarity; occupational taxonomy; fine-tuned embeddings	✔ Methodology published (IEEE reference)	✘ No external audit published	◑ Partial — via cited research
LinkedIn	Skills graph; machine learning over 1B+ member data; skills inference from profile signals	✘ Opaque — high-level descriptions only	◑ Internal fairness team; limited external audit	✘ Not published externally
Indeed	Proprietary algorithm; keyword + behavioural signals; apply-rate feedback loops	✘ Opaque	✘ No published audit	✘ Not published
HireVue	Video/audio analysis + NLP; ML over interview responses	◑ Some methodology papers	✔ External audits conducted (O'Neil Risk Consulting 2021)	◑ Partial
Pymetrics	Neuroscience-based games + ML	◑ Methodology described in papers	✔ Fairness auditing built into product	✔ Validity studies published

The comparison reveals that most recruitment platforms — including LinkedIn and Indeed — provide minimal public transparency about their matching methodologies. Expertini's published research, while limited, represents more algorithmic transparency than most direct competitors offer. Dedicated AI screening platforms such as Pymetrics and HireVue have invested more substantially in formal validity studies and bias auditing than general-purpose job boards, which is worth acknowledging for employers using AI matching for high-stakes hiring decisions.

Sources: O'Neil Risk Consulting (2021) HireVue Algorithmic Audit; Pymetrics Bias Audit (2019); Expertini IEEE-referenced research; platform public documentation 2024.

The Honest Case for Using AI Matching — and the Conditions Under Which It Helps

AI matching in recruitment is not transformative technology — it is productivity technology. The honest case for its use is not that it finds better candidates than a skilled human recruiter, but that it allows a skilled human recruiter to review a larger candidate pool in less time without missing the strong candidates who would otherwise be buried in volume. This is a real and meaningful value proposition in hiring contexts where application volumes are high relative to recruiter capacity.

The conditions under which AI matching helps most: high-volume roles with standardised skill requirements (technology roles with specific tool/language requirements; clinical roles with certification requirements); experienced recruiters who treat match scores as a starting point for review rather than a final filter; job descriptions that are specific, accurate, and recently updated; and candidate pools that are linguistically and structurally diverse enough that keyword matching produces poor recall.

The conditions under which AI matching helps least: senior and leadership roles where cultural fit, strategic thinking, and interpersonal dynamics are the primary hiring criteria; roles in emerging fields where training data is thin; situations where the candidate pool is small and every applicant warrants full review regardless; and organisations with strong diversity hiring objectives where matching score-based filtering may inadvertently screen out candidates from underrepresented groups.

Expertini's matching engine is a useful tool for the first set of conditions and a potential liability for the second. Employers who understand this distinction will use it more effectively and more safely than those who treat it as a general-purpose hiring solution.

The Research Agenda: What Comes Next

The frontier questions in AI-assisted recruitment matching that the academic and product community is actively working on include:

Counterfactual fairness: Designing matching systems where a candidate's score would not change if their protected characteristics (gender, race, age) were altered, while holding professional qualifications constant. This is technically challenging because protected characteristics are often correlated with vocabulary, writing style, and educational institution names in ways that are difficult to disentangle.
Long-context understanding: Current transformer models have context window limitations that affect their ability to reason over full resumes and detailed job descriptions simultaneously. Advances in long-context language models (such as those emerging from the research community in 2023–2024) have direct applicability to recruitment matching quality.
Temporal skill weighting: Skills decay in relevance over time — a skill listed on a resume from 2015 is less predictive of current capability than the same skill listed for 2023. Matching systems that model temporal skill relevance rather than treating all resume content as equally current should produce better alignment with actual candidate suitability.
Multi-modal matching: Incorporating signals beyond text — structured assessment results, portfolio work, professional community contributions — into matching pipelines. This is an active area of product development across the sector.

Expertini's research engagement with these questions — through its IEEE-referenced work and ongoing platform development — reflects an institutional commitment to evidence-based improvement rather than feature-marketing. The gap between current AI matching performance and the theoretical ceiling of what's possible remains substantial, which is simultaneously a reason for epistemic humility about current capabilities and genuine optimism about the direction of travel.

Explore Expertini's AI Matching Tools

Resume Score™, Job Score™, and Interview Predictor are available to premium Expertini employers. Post jobs free on Expertini and receive candidates ranked by semantic alignment to your role requirements — with full control to override, adjust, or disregard the ranking based on your own judgement.

Post a Job Free → View Premium Plans

Article Type:Technical Research — AI/NLP in Recruitment
Methodology:Cosine Similarity · Sentence Transformers · Occupational Taxonomy · NER
Research Reference:IEEE Transactions on Artificial Intelligence · Journal of Applied Psychology · SHRM
Tools Covered:Resume Score™ · Job Score™ · Interview Predictor
Platform:Expertini est. 2008 · 700,000+ Monthly Users · 150+ Countries

Yes — and this is one of the most important limitations to understand. AI matching models trained on historical hiring data can encode historical hiring biases: if past hiring patterns in a sector favoured candidates from certain educational backgrounds or geographic regions, the matching model may score those profiles higher not because they are more qualified but because they pattern-match to historically successful hires. Semantic matching provides some mitigation relative to keyword matching — by focusing on meaning rather than specific vocabulary, it reduces some forms of vocabulary-based demographic correlation — but it does not eliminate algorithmic bias. Expertini has not published an external algorithmic fairness audit, which is an honest limitation of the platform's current transparency relative to specialists such as Pymetrics, which has undergone independent bias auditing.