#

cloud-data-warehouse

Here are 19 public repositories matching this topic...

BethanyWeisberg / data-pipeline-practice

Data engineering practice, including building data pipelines (ELT) from a variety of sources.

mysql python aws sql mongodb aws-s3 postgresql redshift apache-airflow etl-pipeline cloud-data-warehouse rds-mysql

Updated Feb 13, 2023
Python

sanketrs / implementation-of-modern-data-engineering-architecture-with-fabric_analytics

Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.

data-science etl azure data-visualization data-warehouse data-engineering data-analytics etl-framework azure-data-factory big-data-analytics cloud-dataflow etl-pipeline big-data-projects data-engineering-pipeline bi-analytics data-pipeline-monitoring cloud-data-warehouse azure-fabric data-engineering-project

Updated Dec 29, 2024
Python

vuanhtuan1012 / data-warehouse

Building an ETL pipeline that extracts data from S3, stages them in Redshift.

aws redshift etl-pipeline cloud-data-warehouse

Updated May 5, 2021
Jupyter Notebook

lisa-lumos / Snowflake

Summary/Notes of Snowflake cloud data warehouse. (Complete ✅)

cloud-data-warehouse

Updated Nov 4, 2024

MaxineXiong / Cloud-Data-Warehousing-with-AWS-Redshift

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

etl aws-s3 postgresql data-warehouse data-engineering infrastructure-as-code postgresql-database data-warehousing extract-transform-load aws-boto3 aws-redshift dimensional-model etl-pipeline redshift-cluster dimensional-modeling cloud-data-warehouse

Updated Sep 9, 2024
Jupyter Notebook

debashisdash1999 / snowflake_proj7_tasks_scheduling

Automating Data Workflows in Snowflake with Task Scheduling & Management.

automation sql analytics tasks cron-jobs snowflake data-engineering data-pipelines workflow-automation portfolio-project etl-automation task-scheduling task-monitoring cloud-data-warehouse task-dependency

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj12_streams

This project demonstrates Snowflake Streams for change data capture. It covers creating streams to track INSERT, UPDATE, and DELETE operations on tables, loading data from S3, querying captured changes, and managing stream objects for real-time data monitoring.

etl aws-s3 snowflake streams data-engineering cdc real-time-data change-data-capture portfolio-project cloud-data-warehouse insert-stream update-stream delete-stream

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj1_warehouse_setup_and_basics

The objective of this task is to create and configure a new virtual warehouse in Snowflake. Warehouses are crucial for query execution and data processing, as they provide the compute resources required to run SQL statements.

sql etl aws-s3 snowflake data-engineering portfolio-project database-setup cloud-data-warehouse

Updated Aug 17, 2025

husskhosravi / aws-snowflake-analytics-pipeline

End-to-end pipeline analysing Yelp reviews using AWS S3, Snowflake, Python UDFs and advanced SQL sentiment analysis

json aws-s3 snowflake yelp-dataset textblob-sentiment-analysis cloud-data-warehouse python-udf

Updated May 5, 2025
Python

debashisdash1999 / snowflake_proj4_validation_modes_copy_options

Hands-on project covering Snowflake data loading with custom file formats, validation modes, error handling, string length limits, TRUNCATECOLUMNS, and analyzing load history using account_usage.load_history.

sql etl analytics aws-s3 snowflake data-engineering error-handling stages data-quality portfolio-project file-formats data-loading cloud-data-warehouse load-history copy-command validation-mode on-error truncatecolumns

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj11_data_sampling

This project demonstrates data sampling techniques in Snowflake. It covers loading datasets from S3, performing RANDOM and SYSTEM sampling methods to extract subsets, validating sampled data, and optimizing analysis on datasets.

sql etl analytics aws-s3 snowflake data-engineering data-analysis data-exploration portfolio-project data-sampling sample-data random-sampling cloud-data-warehouse copy-into system-sampling

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj3_error_handling

Error Handling Hands-on project showcasing Snowflake data loading with error handling using VALIDATION_MODE, ON_ERROR = CONTINUE, ON_ERROR = SKIP_FILE, and ON_ERROR = SKIP_FILE_% while ingesting CSV files from AWS S3.

sql etl analytics aws-s3 snowflake data-engineering error-handling stages data-quality portfolio-project data-loading cloud-data-warehouse validation-mode on-error

Updated Aug 17, 2025

FatemehTarashi / cloud-data-warehouse

moved, cleaned, and transformed data stored in S3 as json to Redshift.

etl redshift dwh udacity-data-engineer-nanodegree cloud-data-warehouse

Updated Jun 21, 2020
Jupyter Notebook

debashisdash1999 / snowflake_proj2_stages_and_transformations

This project demonstrates how to use Snowflake stages for loading data from Amazon S3 into Snowflake tables. It also covers applying transformations during loading and selecting only specific columns from the source data.

sql etl analytics aws-s3 data-transformation snowflake data-engineering stages portfolio-project data-loading cloud-data-warehouse

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj9_table_types

This project explores Snowflake’s table types, including Permanent, Temporary, Transient, and External tables. It demonstrates creating tables, loading data from S3 stages, querying and validating data, and understanding differences in persistence, retention, and Time Travel support.

sql etl analytics aws-s3 snowflake data-engineering data-management time-travel portfolio-project temporary-tables data-loading cloud-data-warehouse external-tables table-types permanent-tables transient-tables

Updated Aug 17, 2025

debashisdash1999 / snowflake_proj8_time_travel

This project explores Snowflake’s Time Travel feature, including querying historical data using offsets, retention periods, and query IDs. It demonstrates restoring previous table states after updates, managing retention settings, and recovering data efficiently.

debugging sql rollback snowflake data-engineering compliance time-travel historical-data data-recovery data-versioning portfolio-project data-retention error-correction cloud-data-warehouse business-continuity data-auditing query-id

Updated Aug 17, 2025

najuzilu / CDW-AWSRedshift

Building a cloud data warehouse with AWS Redshift.

python aws-ec2 aws-redshift cloud-data-warehouse

Updated Aug 1, 2021
Python

debashisdash1999 / snowflake_proj10_cloning_swapping_tables

This project demonstrates Snowflake table cloning and swapping techniques. It covers creating original and cloned tables, loading data from S3, verifying cloned data, and performing table swaps to efficiently exchange data between staging and production tables.

sql analytics aws-s3 snowflake data-engineering data-management portfolio-project staging-tables cloud-data-warehouse backup-and-recovery table-cloning table-swapping cloned-tables swap-tables etl-workflows

Updated Aug 17, 2025

Khateeb21 / snowflake_proj2_stages_and_transformations

🌨️ Load and transform data from Amazon S3 into Snowflake efficiently using stages, enhancing your data ingestion practices without altering source files.

sql etl analytics aws-s3 data-transformation snowflake data-engineering stages portfolio-project data-loading cloud-data-warehouse

Updated Aug 23, 2025

Improve this page

Add a description, image, and links to the cloud-data-warehouse topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cloud-data-warehouse topic, visit your repo's landing page and select "manage topics."