Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
-
Updated
Nov 7, 2025 - Java
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
An open-source columnar data format designed for fast & realtime analytic with big data.
Hydra九头龙,面向PB级别知识库取数、情报系统、数据平台、大规模控制调度系统。面向大规模数据采集、分析、智能取数。——以实现大规模分布式爬虫搜索引擎为例。
Roadmap for Data Engineering
Distributed, Column-oriented storage, Realtime analysis, High performance Database
Implement a complete data warehouse etl using spark SQL
Data Vault data model and ETL generator for Oracle Databases
Business Intelligence for support staff - Desktop Application
Hi Guys. I'm Biagio, teacher of Computer Science. This repository is where I share code co-developed during our lessons, providing interesting solutions to programming problems. Share your favorite one(s) with friends and colleagues, and if you have any suggestions or edits, I'll be happy to consider them.
Data Warehouse, Migration Tool. SQL and scripting execution engine and report builder
A data warehouse for a clothing shop using Oracle (XE) and talend as the ETL process. MySQL, Excel and document databases (JSON) are used as the main data sources. Diagrams related to the data warehouse are also included.
This project focuses on 400K movies, providing a comprehensive overview of the movie industry across various dimensions like genres, cast, directors, ratings, and more.
Add a description, image, and links to the datawarehouse topic page so that developers can more easily learn about it.
To associate your repository with the datawarehouse topic, visit your repo's landing page and select "manage topics."