machadoluiz

Luiz Machado machadoluiz

Data Engineer

Rio de Janeiro, Brazil
in/machado-luiz

Achievements

Stars

deepseek-ai / smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,786 305 Updated Mar 5, 2025

fastapi / full-stack-fastapi-template

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 30,770 5,639 Updated Feb 22, 2025

open-metadata / OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,208 1,166 Updated Mar 6, 2025

datahub-project / datahub

The Metadata Platform for your Data and AI Stack

Java 10,371 3,061 Updated Mar 6, 2025

metabase / metabase

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 41,085 5,415 Updated Mar 6, 2025

dbt-labs / dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 10,467 1,689 Updated Mar 6, 2025

TobikoData / sqlmesh

Efficient data transformation and modeling framework that is backwards compatible with dbt.

Python 2,119 193 Updated Mar 6, 2025

tursodatabase / limbo

Limbo is a project to build the modern evolution of SQLite.

Rust 9,697 355 Updated Mar 6, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,314 10,773 Updated Mar 6, 2025

apache / superset

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,810 14,622 Updated Mar 6, 2025

airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,419 4,351 Updated Mar 6, 2025

microsoft / pyright

Static Type Checker for Python

Python 13,920 1,588 Updated Mar 6, 2025

python / mypy

Optional static typing for Python

Python 19,023 2,883 Updated Mar 2, 2025

prestodb / presto

The official home of the Presto distributed SQL query engine for big data

Java 16,235 5,424 Updated Mar 6, 2025

trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,945 3,118 Updated Mar 6, 2025

ClickHouse / ClickHouse

ClickHouse® is a real-time analytics database management system

C++ 39,369 7,154 Updated Mar 6, 2025

StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,633 1,931 Updated Mar 6, 2025

kwai / blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,417 143 Updated Mar 5, 2025

apache / incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,284 466 Updated Mar 6, 2025

pre-commit / pre-commit

A framework for managing and maintaining multi-language pre-commit hooks.

Python 13,455 849 Updated Feb 17, 2025

dagster-io / dagster

An orchestration platform for the development, production, and observation of data assets.

Python 12,649 1,608 Updated Mar 6, 2025

PrefectHQ / prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 18,522 1,731 Updated Mar 6, 2025

kestra-io / kestra

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 16,366 1,374 Updated Mar 6, 2025

mage-ai / mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,179 824 Updated Mar 5, 2025

apache / airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,039 14,762 Updated Mar 6, 2025

astral-sh / ruff

An extremely fast Python linter and code formatter, written in Rust.

Rust 36,403 1,233 Updated Mar 6, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 42,292 1,192 Updated Mar 6, 2025

apache / datafusion-comet

Apache DataFusion Comet Spark Accelerator

Rust 905 185 Updated Mar 6, 2025

maybe-finance / maybe

The OS for your personal finances

Ruby 41,974 2,985 Updated Mar 5, 2025

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,849 1,783 Updated Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luiz Machado machadoluiz

Achievements

Achievements

Block or report machadoluiz

Stars

deepseek-ai / smallpond

fastapi / full-stack-fastapi-template

open-metadata / OpenMetadata

datahub-project / datahub

metabase / metabase

dbt-labs / dbt-core

TobikoData / sqlmesh

tursodatabase / limbo

ollama / ollama

apache / superset

airbytehq / airbyte

microsoft / pyright

python / mypy

prestodb / presto

trinodb / trino

ClickHouse / ClickHouse

StarRocks / starrocks

kwai / blaze

apache / incubator-gluten

pre-commit / pre-commit

dagster-io / dagster

PrefectHQ / prefect

kestra-io / kestra

mage-ai / mage-ai

apache / airflow

astral-sh / ruff

astral-sh / uv

apache / datafusion-comet

maybe-finance / maybe

delta-io / delta