A Novel Approach for Lookalikes with Multi-Level Sub-Category on Large-Scale Data

Authors(s): Thejaswini P, Kiranmayi KL Chandra

Abstract

Lookalike audience generation is an effective way to increase the audience base in online advertising. Segregating the lookalike audience into multiple priority levels gives greater flexibility to the advertiser in selecting their user reach. In this paper, a novel approach of lookalike audience generation in multiple priority levels on a large-scale data with millions of users and thousands of audience segments is explained. An automated system combining custom models to generate similar audience segments and group lookalike audience into priority levels using Spark Scala and Hadoop ecosystem is developed. The experimental results comparing different approaches show that our proposed model outperforms others in reach, scalability, and speed.

Picture of Association of Data Scientists

Association of Data Scientists

The Chartered Data Scientist Designation

Achieve the highest distinction in the data science profession.

Elevate Your Team's AI Skills with our Proven Training Programs

Strengthen Critical AI Skills with Trusted Generative AI Training by Association of Data Scientists.

Our Accreditations

Get global recognition for AI skills

Chartered Data Scientist (CDS™)

The highest distinction in the data science profession. Not just earn a charter, but use it as a designation.

Certified Data Scientist - Associate Level

Global recognition of data science skills at the beginner level.

Certified Generative AI Engineer

An upskilling-linked certification initiative designed to recognize talent in generative AI and large language models

Join thousands of members and receive all benefits.

Become Our Member

We offer both Individual & Institutional Membership.