sqoop
Here are 28 public repositories matching this topic...
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
-
Updated
Jun 10, 2018 - Python
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
-
Updated
Apr 13, 2022 - Python
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
-
Updated
Apr 29, 2022 - Python
Learn Big Data tools/ framework by doing examples, POC, per projects.
-
Updated
Jul 29, 2022 - Python
Solving the Restaurant User Review Data Pipeline Scenarios using Shellscript, Python, Sqoop, HDFS
-
Updated
Dec 31, 2021 - Python
Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
-
Updated
Sep 20, 2023 - Python
Large-Scale Data Pipeline Migration from Mainframe to Hadoop | Hadoop | Spark | Hive | Sqoop | Oozie | MySQL Migrated a legacy mainframe data warehouse to a modern Hadoop-based big data ecosystem, enabling scalable storage, faster analytics, and automated workflows.
-
Updated
Nov 12, 2025 - Python
A data pipeline on GCP Dataproc using Sqoop, HDFS, Hive, and PySpark to implement SCD Type 2 for an e-commerce use case. Tracks customer and product changes (e.g., address, price) and their impact on sales, demonstrating scalable data warehousing and processing.
-
Updated
Mar 2, 2025 - Python
Improve this page
Add a description, image, and links to the sqoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sqoop topic, visit your repo's landing page and select "manage topics."