Build data pipelines, the easy way 🛠️
-
Updated
Jun 6, 2023 - TypeScript
Build data pipelines, the easy way 🛠️
Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!
The Supabase of AI era. A modular, open-source backend for building AI-native software — designed for knowledge, not static data.
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
Frontend & BFF (Backend for frontend) for Olake. This includes the UI code and backend code for storing the configuration of sync and orchestrating it.
OpenETL is a free, lightweight, and flexible ETL (Extract, Transform, Load) framework built in TypeScript, designed for developers who need a simple yet powerful tool to orchestrate data workflows in Node.js environments.
A CLI tool for transforming large RDF datasets using pure SPARQL
Collection of pkgs to build pipelines in JS/TS
Irish Property Price Register transformed into a data warehouse via an EtLT pipeline.
Anyparser Typescript SDK for RAG/ETL Pipelines - File Content Extraction. Supports extraction from various file formats including PDF, Microsoft Office documents, OCR/Image to Text, Audio to Text, and Website to Text.
Universal APIs for unstructured data. Connect to SaaS tools with turnkey auth and sync documents from N data sources with only one integration
An ETL pipeline (and REST API web server) implementation that ingests bulk data (such as CSV files from UK censuses) to produce a single stats lookup table with OA (output area) resolution; queryable by OA, LSOA (lower-layer super output area), MSOA (middle-layer super output area), LAD (local area district), or postal code.
🚀 JetShift is a powerful and lightweight ETL framework that simplifies the process of building data pipelines.
Serverless healthcare ETL reference, built with AWS CDK on an event-driven architecture
An extract-transform-load pipeline for stock transactions
This repository contains a data faker tool designed to seed e-commerce db for learning dbt.
SERFF Document Search A simple web application for searching through SERFF (System for Electronic Rate and Form Filing) insurance documents using Turbopuffer vector search technology.
smartive/proc-that forked to play with
Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."