Skip to content
View machadoluiz's full-sized avatar

Block or report machadoluiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,786 305 Updated Mar 5, 2025

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 30,770 5,639 Updated Feb 22, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,208 1,166 Updated Mar 6, 2025

The Metadata Platform for your Data and AI Stack

Java 10,371 3,061 Updated Mar 6, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 41,085 5,415 Updated Mar 6, 2025

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 10,467 1,689 Updated Mar 6, 2025

Efficient data transformation and modeling framework that is backwards compatible with dbt.

Python 2,119 193 Updated Mar 6, 2025

Limbo is a project to build the modern evolution of SQLite.

Rust 9,697 355 Updated Mar 6, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,314 10,773 Updated Mar 6, 2025

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,810 14,622 Updated Mar 6, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,419 4,351 Updated Mar 6, 2025

Static Type Checker for Python

Python 13,920 1,588 Updated Mar 6, 2025

Optional static typing for Python

Python 19,023 2,883 Updated Mar 2, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,235 5,424 Updated Mar 6, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,945 3,118 Updated Mar 6, 2025

ClickHouse® is a real-time analytics database management system

C++ 39,369 7,154 Updated Mar 6, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,633 1,931 Updated Mar 6, 2025

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,417 143 Updated Mar 5, 2025

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,284 466 Updated Mar 6, 2025

A framework for managing and maintaining multi-language pre-commit hooks.

Python 13,455 849 Updated Feb 17, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 12,649 1,608 Updated Mar 6, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 18,522 1,731 Updated Mar 6, 2025

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 16,366 1,374 Updated Mar 6, 2025

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,179 824 Updated Mar 5, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,039 14,762 Updated Mar 6, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 36,403 1,233 Updated Mar 6, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 42,292 1,192 Updated Mar 6, 2025

Apache DataFusion Comet Spark Accelerator

Rust 905 185 Updated Mar 6, 2025

The OS for your personal finances

Ruby 41,974 2,985 Updated Mar 5, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,849 1,783 Updated Mar 6, 2025
Next
Showing results