Upskill your Team on Generative AI. Start here >

Lattice | Volume 3 ISSUE-1

Blended Document Similarity based on Text & Image Features

Author(s):Anand Jha

Abstract:

Document Similarity could be a building block for many useful applications, including Information Retrieval, Document Clustering, and Question-Answering Systems, to name a few. In the modern digital world, Informative Documents are composed of Text, Images and Videos. In such a scenario, similarity-based purely on Text, Image or Video may not be adequate. Hence a metrics blending similarity on all these aspects should be used. In this paper, a weighted similarity measure based on Texts and Images has been developed, using some popular open-source Machine Learning (ML) libraries. This provides a flexible and easy method without using large training data, which often is the case with ML tasks.

Lattice | Volume 3 ISSUE-1

₹1,668.00

Add to cart

The Chartered Data Scientist Designation

Achieve the highest distinction in the data science profession.

Elevate Your Team's AI Skills with our Proven Training Programs

Strengthen Critical AI Skills with Trusted Generative AI Training by Association of Data Scientists

Explore more from Association of Data Scientists

Safeguarding Data Privacy in LLM-Powered Generative AI: Top Concerns and Effective Mitigation Strategies

Multi-Modal Vibration Analysis of Industrial Motors using Cross-Attention-Based Transformers for Fault Diagnosis

B2B Sales Leads Generation Using Commercial Payments Data: A Novel Application of Recommender Systems

Energy

Transforming Industries: The Power of Generative AI in Energy Trading

AI Driven Audience Expansion by Recommending Lookalike Postal Codes and Domains

Our AI Courses

Lattice: Our ML Journal