Snowflake

Snowflake

A cloud-native data warehouse platform that separates storage and compute, enabling elastic scaling and pay-per-use pricing.

Snowflake is a cloud-native data warehouse platform built for the modern data stack. Unlike traditional data warehouses, Snowflake was designed from the ground up for the cloud, offering unique architecture that separates storage and compute resources.

Key Architecture Features

Snowflake uses a multi-cluster shared data architecture that consists of three layers:

1. Database Storage Layer: Data is stored in a compressed, columnar format in cloud object storage (AWS S3, Azure Blob, or Google Cloud Storage). This layer is fully managed and automatically optimized.

2. Query Processing Layer (Virtual Warehouses): Compute clusters that execute queries independent of storage. You can spin up multiple warehouses of different sizes without affecting each other.

3. Cloud Services Layer: Handles authentication, infrastructure management, metadata, query optimization, and access control.

Why Data Engineers Choose Snowflake

- Zero Management: No indexes to tune, no partitions to manage
- Instant Elasticity: Scale compute up/down in seconds
- Concurrency: Multiple workloads without resource contention
- Time Travel: Query historical data up to 90 days back
- Data Sharing: Share live data across organizations securely
- Semi-structured Data: Native support for JSON, Avro, Parquet

Snowflake vs Traditional Data Warehouses

Traditional on-premise solutions like Teradata or Oracle require significant hardware investment and maintenance. Snowflake eliminates this with its SaaS model, offering true pay-per-second pricing and automatic performance optimization.

Common Use Cases

- Data Lakes: Combine structured and semi-structured data
- Data Engineering: Build scalable ETL/ELT pipelines
- Data Science: Run ML workloads with Snowpark
- Business Intelligence: Power dashboards with fast queries

Key Points

Cloud-native architecture separating storage and compute
Pay-per-second pricing model
Zero-maintenance with automatic optimization
Time Travel feature for historical data access
Native support for semi-structured data (JSON, Parquet)

Frequently Asked Questions

What is Snowflake used for?

Snowflake is primarily used as a cloud data warehouse for storing, processing, and analyzing large volumes of structured and semi-structured data. It supports data engineering, analytics, data science, and data sharing use cases.

Is Snowflake a database or data warehouse?

Snowflake is a cloud data warehouse, not a traditional transactional database. It is optimized for analytical workloads (OLAP) rather than transactional operations (OLTP). However, it can store and query data like a database.

How does Snowflake pricing work?

Snowflake uses a consumption-based pricing model. You pay separately for storage (per TB/month) and compute (per credit consumed). Compute is charged per-second with a 60-second minimum, so you only pay when queries are running.

What is Snowflake Time Travel?

Time Travel is a Snowflake feature that lets you access historical data at any point within a defined retention period (up to 90 days). You can query, clone, or restore data as it existed at a specific timestamp.

← Back to Glossary

Last updated: 2026-01-21

Published by

Sainath Reddy

Data Engineer at Anblicks

🎯 4+ years experience

About Me → LinkedIn