Skip to content
@argilla-io

Argilla

Building the open-source feedback layer for LLMs

Pinned Loading

  1. argilla argilla Public

    Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

    Python 4.7k 461

  2. distilabel distilabel Public

    Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

    Python 2.9k 221

  3. notus notus Public

    Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

    Python 169 14

  4. synthetic-data-generator synthetic-data-generator Public

    Build datasets using natural language

    Python 539 63

Repositories

Showing 10 of 86 repositories
  • distilabel Public

    Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

    argilla-io/distilabel’s past year of commit activity
    Python 2,911 Apache-2.0 221 78 (1 issue needs help) 16 Updated Oct 28, 2025
  • argilla Public

    Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

    argilla-io/argilla’s past year of commit activity
    Python 4,731 Apache-2.0 461 15 17 Updated Oct 27, 2025
  • synthetic-data-generator Public

    Build datasets using natural language

    argilla-io/synthetic-data-generator’s past year of commit activity
    Python 539 Apache-2.0 63 12 0 Updated Sep 19, 2025
  • coset Public

    Code for Mediaflows Coset competition

    argilla-io/coset’s past year of commit activity
    Jupyter Notebook 1 0 0 0 Updated Aug 21, 2025
  • mlflow-inference Public

    A toolling set for mlflow models deployments "automatization"

    argilla-io/mlflow-inference’s past year of commit activity
    Python 0 0 0 0 Updated Aug 21, 2025
  • python-elasticsearch-runner Public Forked from comperiosearch/python-elasticsearch-runner

    A standalone Python runner for Elasticsearch. Intended for transient and lightweight usage such as small integration tests.

    argilla-io/python-elasticsearch-runner’s past year of commit activity
    Python 1 2 0 0 Updated Aug 21, 2025
  • spacy-wordnet Public

    spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface

    argilla-io/spacy-wordnet’s past year of commit activity
    Python 260 MIT 20 4 2 Updated Aug 21, 2025
  • selectra Public

    Repo to pre-train a spanish language model and zero-shot classifier based on the ELECTRA model

    argilla-io/selectra’s past year of commit activity
    Python 3 1 0 1 Updated Aug 21, 2025
  • deepspeech.pytorch Public Forked from SeanNaren/deepspeech.pytorch

    Speech Recognition using DeepSpeech2 and the CTC activation function. Edit

    argilla-io/deepspeech.pytorch’s past year of commit activity
    Python 0 MIT 633 0 0 Updated Aug 21, 2025
  • services Public
    argilla-io/services’s past year of commit activity
    Python 0 0 0 0 Updated Aug 21, 2025

Most used topics

Loading…