Memberships

Individual Membership
Join the world’s leading Data Science professional community. You can access both General & Premium Memberships.

Learn More

Corporate Membership
Any corporate, organization or academic institution having common interests in the AI field can become a member of ADaSci.

Learn More
Accreditations

Institutional Accreditation
Our accreditation is a mark of excellence, validating the quality, relevance, and industry alignment of your programs, products, and services.

Learn More

Chartered Data Scientist™
The Chartered Data Scientist (CDS) credential gives a strong understanding of advanced data science profession and in-depth, applied analytics skills.

Learn More

Certified Generative AI Engineer
An upskilling-linked certification initiative designed to recognize talent in generative AI and large language models.

Learn More
Continuous Learning

Our Latest Courses

Advanced RAG with Pinecone

₹3,416.00
Add to cart

ADaSci Certified Vibe Coding Practitioner

₹21,268.00
Add to cart

ADaSci Certified Data Engineer

₹21,268.00
Add to cart

ADaSci Certified Agentic AI System Architect

₹21,268.00
Add to cart

Hi, Welcome back!

Keep me signed in
Forgot Password?

Don't have an account? Register Now

Access all Courses
Corporate Trainings
Contact

Lattice | Volume-4 Issue-3

PII Detection in Emails through QLoRA Fine-tuned LLMs: A comparative analysis with BERT and GPT3.5

Author(s): Chinmay Prakash, Rishit Tyagi, Prakash Selvakumar

Explore more from ADaSci

Choosing the Right Generative AI Training Providers for Your Team

Leveraging AI for unlocking cross- sell growth in B2B SAAS Industry

Imbalance Handling with Combination of Deep Variational Autoencoder and NEATER

Deep Reinforcement Learning for Next-gen Cruise Control

A Comprehensive Guide to Vector Databases and their Utilities

Implementing Rapid LLM Inferencing using Groq

Mastering AI Code Execution in Secure Sandboxes with E2B

Strategies for Scaling LLM Deployment

Advancing Communication with GPT-4 and MLflow

A Hands-on Guide to Airtrain AI: A No-code Compute Platform

Abstract

Personally Identifiable Information (PII) detection is critical due to the increasing exploitation of individual data, particularly in the text analytics domain. With the rise in the application of large language models (LLMs) for Natural Language Processing (NLP) solutions, data security concerns call for effective on-premises solutions and privacy-centric methods.

This paper explores the use of LLMs fine-tuned on limited domain-specific datasets for detecting and masking PII and benchmarking this solution against existing NLP methods such as BERT and GPT3.5. Our approach includes fine-tuning the Vicuna-7B LLM using the Quantized and Low Rank Adaptation (QLoRA) technique, enabling cost-effective fine-tuning and deployment on consumer GPUs; The proposed approach offers several advantages, including improved performance and reliability compared to GPT3.5, enhanced data security by keeping data within the company’s cloud, domain adaptability through model fine-tuning, and on-premise usage benefits such as reduced dependence on proprietary models, quota limitations, and flexible scaling of model hosting infrastructure.

Overall, this paper presents an efficient and secure solution for domain specific PII detection tasks using LLMs.

Access The Research Paper:

Lattice | Vol 4 Issue 3

₹1,708.00

Add to cart

Vaibhav Kumar

The Chartered Data Scientist Designation

Achieve the highest distinction in the data science profession.

Elevate Your Team's AI Skills with our Proven Training Programs

Strengthen Critical AI Skills with Trusted Generative AI Training by Association of Data Scientists.

Our AI Courses

Build AI Agents with Google ADK
₹1,709.00
Add to cart

Our Latest Courses

PII Detection in Emails through QLoRA Fine-tuned LLMs: A comparative analysis with BERT and GPT3.5

Explore more from ADaSci

Abstract

Access The Research Paper:

Vaibhav Kumar

The Chartered Data Scientist Designation

Elevate Your Team's AI Skills with our Proven Training Programs

Our AI Courses

Build AI Agents with Google ADK

Our Accreditations

Get global recognition for AI skills

Chartered Data Scientist (CDS™)

The highest distinction in the data science profession. Not just earn a charter, but use it as a designation.

Certified Data Scientist - Associate Level

Global recognition of data science skills at the beginner level.

Certified Generative AI Engineer

An upskilling-linked certification initiative designed to recognize talent in generative AI and large language models

Join thousands of members and receive all benefits.

Become Our Member

We offer both Individual & Institutional Membership.

The power of intelligence to propel humanity and make a difference

Our Accrediations

CDS Program

Membership

About

For Organizations

Journal