lakeFS - Data version control for your data lake | Git for data
-
Updated
Aug 31, 2025 - Go
lakeFS - Data version control for your data lake | Git for data
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
A demo of Bufstream, a drop-in replacement for Apache Kafka that's 8x less expensive to operate and brings broker-side schema awareness to Kafka
This project is an ETL / ELT Framework powered by DuckDB, designed to seamlessly integrate and process data from diverse sources. It leverages Markdown as a configuration medium, where YAML blocks define metadata for each data source, and embedded SQL blocks specify the extraction, transformation, and loading logic.
DataBridge Quality Control
Real-time Go monitor for ML feature pipeline quality & drift detection
Generates a match score of two person names from 0-100, where 100 is the highest, on how closely two individual full names match. The scoring is based on a series of tests, algorithms, AI, and an ever-growing body of Machine Learning-based generated knowledge
Open-source Delta Lake data quality and management tool. Go-first, dbt-compatible, CLI-friendly. Supports profiling, validation, lineage, and alerts.
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."