How AI Catalogued 1.4 Million Automobile Parts and Gave Each One a Human-Readable Identity

The Problem

The automotive aftermarket parts industry operates on a fundamental information problem. Millions of parts and part numbers are distributed across manufacturer websites, distributor catalogues, and legacy database systems , but the data is inconsistent, incomplete, and poorly described. A mechanic searching for a specific part number frequently encounters a bare SKU with no description of what the part is, what vehicle fitments it covers, what it does, or how it relates to adjacent parts. Distributors carry the same component from multiple manufacturers under different part numbers with no standardisation across them.

The client , an automotive parts platform serving both trade (workshops) and retail (DIY mechanics) customers , had access to a large catalogue of parts and part numbers scraped from manufacturer websites and supplier feeds. The data existed. What it lacked was organisation, enrichment, and the kind of human-readable description that allows a mechanic or parts counter staff to identify and recommend the right component with confidence.

The scale of the problem: the client's working catalogue contained approximately 2.1 million parts across 23 manufacturer families. Of these, fewer than 180,000 had even basic human-readable descriptions. The rest were bare part numbers with a category code and an occasional one-line label , effectively invisible in search results and useless for customer decision-making.

The Challenge

Automotive parts cataloguing is a domain requiring deep technical specificity. A description that cites the wrong fitment range, describes the wrong function, or conflates similar parts from different model years is worse than no description , it results in wrong parts being ordered, which generates returns, erodes trust, and wastes workshop time. Specific challenges the solution had to address:

Data heterogeneity: Source data arrived in five formats , manufacturer PDFs, supplier EDI feeds, website scrapes, legacy CSVs, and OEM cross-reference spreadsheets , with no consistent schema. Part numbers for the same physical component varied by brand, region, and catalogue era.
Domain accuracy: LLM-generated descriptions needed to be technically accurate , specifying correct fitment ranges, OEM equivalent numbers, material specifications, and installation context. Generic descriptions were not acceptable to the client's trade customers.
Scale: The catalogue had 2.1 million parts. Processing each individually was not viable , the architecture needed to generate descriptions in batch at scale, with quality validation built into the pipeline rather than applied as a separate manual review step.
Deduplication: Multiple part numbers in the catalogue referred to identical or near-identical components. The system needed to detect these duplicates, group them, and generate cross-reference data rather than producing separate descriptions for what was effectively the same part.

Our Approach

Kovil AI embedded an AI Engineer into the client's team to design and build the cataloguing system end-to-end. The first two weeks were spent mapping the source data landscape , ingesting representative samples of each format, identifying schema variations, and building the normalisation rules that would allow all five source types to flow through a single pipeline cleanly.

For description generation, we evaluated several approaches before settling on a structured prompting strategy with GPT-4o. Rather than asking the model to describe a part from a bare part number, we enriched each input with all available structured data , manufacturer code, category, model year range, associated OEM numbers , before generating the description. This grounding dramatically reduced hallucination risk and produced descriptions accurate enough to pass spot-check review by the client's automotive specialists.

Quality was validated at scale using a separate classification model that scored each generated description on accuracy indicators: correct fitment language, consistent use of automotive terminology, absence of hedging markers that signalled the model was guessing. Descriptions below threshold were flagged for manual review rather than published automatically.

The Solution

Data Ingestion and Normalisation Pipeline

The ingestion pipeline processed all five source data formats. Each source was parsed, normalised to a common schema (part number, manufacturer code, category, fitment metadata, raw description if available), and deduplicated using both exact matching (same part number) and fuzzy matching (similar numbers from the same manufacturer family, matching fitment ranges, matching category codes). The pipeline produced a clean, deduplicated working catalogue as its primary output.

Deduplication yielded an immediate result: the 2.1 million parts in the raw catalogue reduced to 1.4 million unique components , a 33% reduction driven by cross-manufacturer duplicates and legacy data artefacts that had accumulated over years of catalogue management.

AI Description Generation Engine

For each unique part, the description engine assembled a structured context block , all available factual data about the component , and submitted it to GPT-4o with an engineered prompt specifying: the target audience (trade mechanics and parts counter staff), the required description format (fitment range, function, installation notes, OEM equivalents where available), and explicit constraints (no speculative language, no unsupported fitment claims, metric and imperial specifications where applicable).

Descriptions were generated in batches of 500, with parallel processing to manage throughput. The full catalogue of 1.4 million unique parts was described in 18 days of pipeline operation , a rate that would have taken a human cataloguing team years at equivalent quality.

Quality Validation Layer

Every generated description passed through a validation model before publication. The validator checked for: technically inconsistent fitment claims, terminology errors, generic language that didn't contain part-specific information, and descriptions shorter than a minimum useful length. The reject rate across the full catalogue was 4.2% , these 58,800 descriptions were routed to a human review queue where the client's parts specialists resolved them at their own pace.

Search and Cross-Reference Infrastructure

The final component was a search layer built on the enriched catalogue: a full-text search index combining structured catalogue data with the AI-generated descriptions, enabling trade and retail customers to find parts by function, fitment, OEM number, or competitor cross-reference. The cross-reference data built during deduplication meant a mechanic searching for a manufacturer-specific part number would surface all equivalent aftermarket options simultaneously.

Results

1.4 million unique parts catalogued , processed from a raw input of 2.1 million, with deduplication revealing 700,000 redundant records the client hadn't previously identified
95.8% of descriptions published without manual intervention , the 4.2% reject rate represented the portion requiring human specialist review, against the previous 100% manual effort baseline
Search-to-purchase conversion improved 38% on the trade platform within 60 days of the enriched catalogue going live , mechanics found the right part on first search rather than calling the parts counter for clarification
Returns from incorrect part orders dropped 27% in the first quarter after launch , directly linked to the accuracy of fitment information in the generated descriptions

The cataloguing pipeline now runs continuously, processing new parts arriving from supplier feeds within 24 hours of ingestion , keeping the catalogue current without ongoing manual effort.