Skip to content
#

cloud-data-warehouse

Here are 19 public repositories matching this topic...

Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.

  • Updated Dec 29, 2024
  • Python

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

  • Updated Sep 9, 2024
  • Jupyter Notebook

This project demonstrates Snowflake Streams for change data capture. It covers creating streams to track INSERT, UPDATE, and DELETE operations on tables, loading data from S3, querying captured changes, and managing stream objects for real-time data monitoring.

  • Updated Aug 17, 2025

Hands-on project covering Snowflake data loading with custom file formats, validation modes, error handling, string length limits, TRUNCATECOLUMNS, and analyzing load history using account_usage.load_history.

  • Updated Aug 17, 2025

This project demonstrates data sampling techniques in Snowflake. It covers loading datasets from S3, performing RANDOM and SYSTEM sampling methods to extract subsets, validating sampled data, and optimizing analysis on datasets.

  • Updated Aug 17, 2025

This project explores Snowflake’s table types, including Permanent, Temporary, Transient, and External tables. It demonstrates creating tables, loading data from S3 stages, querying and validating data, and understanding differences in persistence, retention, and Time Travel support.

  • Updated Aug 17, 2025

This project explores Snowflake’s Time Travel feature, including querying historical data using offsets, retention periods, and query IDs. It demonstrates restoring previous table states after updates, managing retention settings, and recovering data efficiently.

  • Updated Aug 17, 2025

This project demonstrates Snowflake table cloning and swapping techniques. It covers creating original and cloned tables, loading data from S3, verifying cloned data, and performing table swaps to efficiently exchange data between staging and production tables.

  • Updated Aug 17, 2025

Improve this page

Add a description, image, and links to the cloud-data-warehouse topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cloud-data-warehouse topic, visit your repo's landing page and select "manage topics."

Learn more