Harnessing AI for Detection and Correction of Hallucinations in Large Search Systems

Author(s): Ratnesh Parihar, Ritesh Agarwal

This paper outlines our approach to tackling a core challenge in designing a robust e-commerce search system for over 10 million stock-keeping units (SKUs). The system employs AI models (OpenAI, Llama 3) for three major functions: categorization, tagging, and semantic searching using vector embeddings. Categorization refers to classifying incoming information from different third parties into a taxonomy essential for data accuracy and standardization. Tagging aids search by linking relevant tags for easier SKU filtering. Lastly, vector search, using Elastic Search and cosine similarity, enables efficient searches and retrieval of relevant information. To enhance tagging and categorization accuracy, mathematical models using embeddings and cosine similarity were applied to reduce hallucination effects.

Access this Lattice journal:

Picture of Association of Data Scientists

Association of Data Scientists

The Chartered Data Scientist Designation

Achieve the highest distinction in the data science profession.

Elevate Your Team's AI Skills with our Proven Training Programs

Strengthen Critical AI Skills with Trusted Generative AI Training by Association of Data Scientists.

Our Accreditations

Get global recognition for AI skills

Chartered Data Scientist (CDS™)

The highest distinction in the data science profession. Not just earn a charter, but use it as a designation.

Certified Data Scientist - Associate Level

Global recognition of data science skills at the beginner level.

Certified Generative AI Engineer

An upskilling-linked certification initiative designed to recognize talent in generative AI and large language models

Join thousands of members and receive all benefits.

Become Our Member

We offer both Individual & Institutional Membership.