GitHub - tburc/rag-bias: Investigating social biases in RAG systems

This is an analysis of social biases in RAG (retrieval-augmented generation), as well as an implementation of a technique to mitigate that bias. RAG is a technique for implementing text search via large language model embeddings.

For evaluation, the code here uses the GrepBiasIR dataset ("Gender REPresentation Bias for Information Retrieval") to evaluate gender bias, as well as search quality.

The mitigation technique shows a 30% reduction in bias when removing 45% of dimensions. This is commensurate with industry standard practice of dropping 50% of dimensions for the purpose of cost savings.

This repository contains a copy of the original GrepBiasIR dataset, as well as precomputed embeddings for all the documents in that dataset, using various embedding models. To use the precomputed embeddings, you will need to uncompress embeddings.zip. To regenerate the embeddings for yourself, you will need to provide your API keys for various embedding providers (e.g. OpenAI and Google)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
embeddings.zip		embeddings.zip
queries-documents_Appearance.csv		queries-documents_Appearance.csv
queries-documents_Career.csv		queries-documents_Career.csv
queries-documents_Child Care.csv		queries-documents_Child Care.csv
queries-documents_Cognitive Capabilities.csv		queries-documents_Cognitive Capabilities.csv
queries-documents_Domestic Work.csv		queries-documents_Domestic Work.csv
queries-documents_Physical Capabilities.csv		queries-documents_Physical Capabilities.csv
queries-documents_Sex & Relationship.csv		queries-documents_Sex & Relationship.csv
rag_bias.ipynb		rag_bias.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

tburc/rag-bias

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages