All Data Engineering Tools

517 tools ranked by quality score · Page 4 of 6

Showing 301–400 of 517
# Tool Score Tier
301 Indexical-Metrics-Measure-Advisory/watchmen

Watchmen Platform is a low code data platform for data pipeline, meta data...

40
Emerging
302 turbot/steampipe-plugin-openai

Use SQL to instantly query OpenAI for completions, models & more. Open...

40
Emerging
303 apache/incubator-devlake-playground

Apache DevLake is an open-source dev data platform to ingest, analyze, and...

40
Emerging
304 Canner/vulcan-sql

Data API Framework for AI Agents and Data Apps

40
Emerging
305 rannd1nt/phaethon

Dimensional Data Pipeline & Semantic Data Engineering Framework

40
Emerging
306 turbot/steampipe-plugin-bitbucket

Use SQL to instantly query Bitbucket. Open source CLI. No DB required.

40
Emerging
307 colliery-io/cloacina

Embedded workflow orchestration library for Rust and Python. Build...

40
Emerging
308 DataKitchen/dataops-observability-agents

DataOps Observability Integration Agents are part of DataKitchen's Open...

40
Emerging
309 terrylica/exness-data-preprocess

Professional forex tick data preprocessing with unified DuckDB storage,...

39
Emerging
310 turbot/steampipe-plugin-crowdstrike

Use SQL to instantly query CrowdStrike resources. Open source CLI. No DB required.

39
Emerging
311 DevDizzle/gammarips-engine

An end-to-end, serverless AI platform built on Google Cloud that...

39
Emerging
312 PeopleForBikes/brokenspoke

A collection of tools for the BNA.

39
Emerging
313 Data-Research-Analysis/data-research-analysis-platform

Stop Guessing. Start Dominating Your Market. The only data platform built...

39
Emerging
314 AbsaOSS/pramen

Resilient data pipeline framework running on Apache Spark

39
Emerging
315 turbot/steampipe-plugin-wiz

Use SQL to instantly query Wiz resources. Open source CLI. No DB required.

39
Emerging
316 turbot/steampipe-plugin-exec

Use SQL to instantly query & run shell commands on local & remote servers....

39
Emerging
317 wherobots/airflow-providers-wherobots

Airflow extensions for communicating with Wherobots Cloud

39
Emerging
318 turbot/steampipe-plugin-pagerduty

Use SQL to instantly query resources from PagerDuty. Open source CLI. No DB required.

39
Emerging
319 turbot/steampipe-plugin-shopify

Use SQL to instantly query Shopify products, orders and more. Open source...

39
Emerging
320 ivszhuravlev/spark-tuning-handbook

Hands-on Spark internals and performance engineering.

39
Emerging
321 turbot/steampipe-plugin-ipstack

Use SQL to instantly query IP geolocation and more from ipstack. Open source...

39
Emerging
322 B1AAB/EBA

An ML-first temporal graph of Bitcoin's on-chain fund flows.

39
Emerging
323 stitchfix/hamilton

A scalable general purpose micro-framework for defining dataflows. THIS...

39
Emerging
324 wilson-mok/demo

In this repository, you will find varies demo and presentations I have...

39
Emerging
325 turbot/steampipe-plugin-twilio

Use SQL to instantly query Twilio resources across accounts. Open source...

38
Emerging
326 turbot/steampipe-plugin-googledirectory

Use SQL to instantly query users, groups, domains and more from Google...

38
Emerging
327 aasouzaconsult/portfolio-dados

Repositório de Projetos em Análises de Dados (buscando valor em dados!!!)

38
Emerging
328 turbot/steampipe-plugin-steampipe

Use SQL to instantly query plugin metadata from the Steampipe Hub. Open...

38
Emerging
329 turbot/steampipe-plugin-vanta

Use SQL to instantly query Vanta resources. Open source CLI. No DB required.

38
Emerging
330 turbot/steampipe-plugin-finance

Use SQL to instantly query financial data including quotes (equities,...

38
Emerging
331 turbot/steampipe-plugin-nomad

Use SQL to instantly query Nomad ACLs, deployments, namespaces & more. Open...

38
Emerging
332 rush-db/rushdb

RushDB is an Instant Database for Modern Apps & AI. Built on top of Neo4j.

38
Emerging
333 turbot/steampipe-plugin-auth0

Use SQL to instantly query Auth0 resources. Open source CLI. No DB required.

38
Emerging
334 vedanthv/data-engineering-portfolio

Cool DE Projects

38
Emerging
335 turbot/steampipe-plugin-servicenow

Use SQL to instantly query ServiceNow CMDB CI services, servers, incidents,...

38
Emerging
336 turbot/steampipe-plugin-hcloud

Use SQL to instantly query servers, networks and more from Hetzner Cloud....

38
Emerging
337 logjuicer/logjuicer

LogJuicer extracts anomalies from log

38
Emerging
338 turbot/steampipe-plugin-snowflake

Use SQL to instantly query Snowflake resources. Open source CLI. No DB required.

38
Emerging
339 turbot/steampipe-plugin-tailscale

Use SQL to instantly query Tailscale resources. Open source CLI. No DB required.

38
Emerging
340 turbot/steampipe-plugin-scaleway

Use SQL to instantly query instances, networks, databases, and more from...

38
Emerging
341 turbot/steampipe-plugin-equinix

Use SQL to instantly query infrastructure resources (e.g. servers, networks)...

38
Emerging
342 MTSWebServices/syncmaster-ui

Frontend for Syncmaster, no-code ETL tool. WIP

37
Emerging
343 kevin-hanselman/dud

A lightweight CLI tool for versioning data alongside source code and...

37
Emerging
344 turbot/steampipe-plugin-code

Use SQL to instantly query secrets and more from source code. Open source...

37
Emerging
345 polakowo/datadocs

Documentation for data enthusiasts

37
Emerging
346 turbot/steampipe-plugin-whois

Use SQL to instantly query WHOIS. Open source CLI. No DB required.

37
Emerging
347 turbot/steampipe-plugin-ansible

Use SQL to instantly query Ansible resources. Open source CLI. No DB required.

37
Emerging
348 ineelhere/forex-connect

Streamlit Connection to Explore Foreign Currency Exchange rates 💰 in real-time

37
Emerging
349 exasol/exasol-personal

The High-Performance Analytics Engine — Free for Personal Use

37
Emerging
350 turbot/steampipe-plugin-hackernews

Use SQL to instantly query stories, users and other items from Hacker News....

37
Emerging
351 turbot/steampipe-plugin-sentry

Use SQL to instantly query Sentry organizations, projects, teams and more....

37
Emerging
352 nationalarchives/ds-caselaw-ingester

Parse judgements from the Transformation Engine and load them into MarkLogic...

37
Emerging
353 turbot/steampipe-plugin-workos

Use SQL to instantly query resources from WorkOS. Open source CLI. No DB required.

37
Emerging
354 turbot/steampipe-plugin-zoom

Use SQL to instantly query meetings, users & more from Zoom. Open source...

37
Emerging
355 yanghaiji/JsonCleanseETL

JSONCleanseETL是一款专业的数据清洗和转换工具,旨在为用户提供高效处理JSON格式数据的解决方案。...

37
Emerging
356 moj-analytical-services/etl_manager

A python package to create a database on the platform using our moj data...

37
Emerging
357 turbot/steampipe-plugin-dockerhub

Use SQL to instantly query Docker Hub repositories, tags, tokens and more....

37
Emerging
358 turbot/steampipe-plugin-linear

Use SQL to instantly query Linear organizations, projects, teams, users &...

37
Emerging
359 netxs2000/devops

DevOps Data Application Platform...

37
Emerging
360 turbot/steampipe-plugin-mongodbatlas

Use SQL to instantly query MongoDB Atlas resources. Open source CLI. No DB required.

37
Emerging
361 BEKO2210/World_report

A self-updating global dashboard that aggregates 40+ open data sources...

36
Emerging
362 gopidesupavan/qualink

Data quality validation, profiling, anomaly detection, and YAML-driven...

36
Emerging
363 Zipstack/visitran

Modern, AI-native and agentic Pythonic data transformation platform.

36
Emerging
364 mindsdb/dbt-mindsdb

dbt adapter for connecting to MindsDB

36
Emerging
365 Amber-Williams/hackernews-whos-hiring

Real-time SQL database from Hacker News "hiring" thread

36
Emerging
366 turbot/steampipe-plugin-chaos

Chaos plugin for testing Steampipe with the craziest edge cases we can think...

36
Emerging
367 turbot/steampipe-plugin-ipinfo

Use SQL to instantly query ipinfo.io for IP address information. Open source...

36
Emerging
368 tshu-w/DBCopilot

Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over...

36
Emerging
369 joryeugene/dadbod-grip.nvim

Edit database tables like Vim buffers. Staged mutations + live SQL preview,...

36
Emerging
370 viadee/camunda-kafka-polling-client

Stream your process history to Kafka

36
Emerging
371 Smart-Shaped/chaM3Leon

By Smart Shaped s.r.l. (https://www.smartshaped.com/)

35
Emerging
372 turbot/steampipe-plugin-tfe

Use SQL to instantly query workspaces, runs and more from Terraform...

35
Emerging
373 sicara/sicarator

Instant Setup & Best Quality for Data Projects!

35
Emerging
374 turbot/steampipe-plugin-hibp

Use SQL to instantly query breaches, passwords, pastes and more from HIBP....

35
Emerging
375 turbot/steampipe-plugin-databricks

Use SQL to instantly query Databricks resources. Open source CLI. No DB required.

35
Emerging
376 turbot/steampipe-plugin-crtsh

Use SQL to instantly query crt.sh for certificates, log entries and more....

35
Emerging
377 Beyond-Finance/dataeng-de-technical-assessment

Public repo of Beyond Finance's technical assessment for Data Engineering candidates

35
Emerging
378 turbot/steampipe-plugin-trivy

Use SQL to instantly query advisories, vulnerabilities, packages, findings...

35
Emerging
379 DawnbrandBots/yaml-yugipedia

An automatically-updated collection of wikitexts from Yugipedia. Part of YAML Yugi.

35
Emerging
380 neokd/DataStorehouse

DataStoreHouse is an open-source project that aims to create a collaborative...

34
Emerging
381 turbot/steampipe-plugin-linkedin

Use SQL to instantly query LinkedIn for profiles, companies, connections &...

34
Emerging
382 turbot/steampipe-plugin-grafana

Use SQL to instantly query dashboards, data sources, users and more from...

34
Emerging
383 chalk-ai/chalk-go

Go client for Chalk

34
Emerging
384 ankiano/etl

Extract transform load CLI tool for extracting small and middle data volume...

34
Emerging
385 turbot/steampipe-plugin-abuseipdb

Use SQL to instantly query IP abuse scores and more from AbuseIPDB. Open...

34
Emerging
386 MTSWebServices/syncmaster

No-code ETL tool, based on onETL + PySpark

34
Emerging
387 ccao-data/data-architecture

Codebase for CCAO data infrastructure construction and management

34
Emerging
388 SwellDB/SwellDB

The data system that answers anything.

34
Emerging
389 turbot/steampipe-plugin-ldap

Use SQL to instantly query users, groups, OUs and more from LDAP. Open...

34
Emerging
390 RustedBytes/audios-to-dataset

Convert your audio files into DuckDB or Parquet files

34
Emerging
391 Hardork/DataLoom

DataLoom旨在提供复杂的数据转换以及分析服务,用户上传数据源(支持MySQL、API、Excel等),用户可以从多种数据源中创建数据集而不必在意数据...

34
Emerging
392 jroakes/SEODP

The SEO Data Platform automates SEO analysis, aggregating data from Google...

34
Emerging
393 bitroot/coflux

Open-source workflow engine. Orchestrate and observe computational workflows...

33
Emerging
394 frectonz/pg-when

Just say when.

33
Emerging
395 betoalien/PardoX

PardoX: The Hyper-Fast Data Engine

33
Emerging
396 equitusai/arcxa

Mapping intelligence for enterprise data migrations: schema mapping,...

33
Emerging
397 mahmoudparsian/data-warehousing

This repository is a place for the Data Warehousing course at the...

33
Emerging
398 crackcell/hpipe

Workflow engine for various computing systems.

33
Emerging
399 turbot/steampipe-plugin-twitter

Use SQL to instantly query tweets, users and followers from Twitter. Open...

33
Emerging
400 bogwi/sarpro

Blazing-fast Sentinel‑1 Synthetic Aperture Radar (SAR) GRD to GeoTIFF/JPEG...

32
Emerging