SRIVATSAV009 (SAI SRIVATSAV DHARINI) · GitHub
Skip to content
View SRIVATSAV009's full-sized avatar

Block or report SRIVATSAV009

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SRIVATSAV009/README.md

👋 Hi, I'm Sai Srivatsav Dharini

Profile Views

🚀 Senior Data Engineer | Cloud-Native | Distributed Systems | AI-Augmented Platforms

I design and build scalable, production-grade, cloud-native data platforms across AWS, Azure, and GCP.
Specialized in distributed processing, real-time ingestion, and modern lakehouse architectures.


🌍 About Me

  • 🔹 4+ years of experience in Data Engineering & Analytics
  • 🔹 Expertise in AWS, Azure & Snowflake ecosystems
  • 🔹 Strong in Spark, PySpark, Delta Lake & distributed systems
  • 🔹 Experience in Financial & Healthcare domains
  • 🔹 Passionate about scalable, fault-tolerant data architectures
  • 🔹 Focused on AI-integrated data engineering workflows

🛠️ Core Data Engineering Stack

☁️ Cloud Platforms

Commit changes

AWS Azure GCP


🔄 Data Processing & Lakehouse

Apache Spark PySpark Databricks Delta Lake AWS Glue


🗄️ Data Warehousing & Analytics

Snowflake Amazon Redshift BigQuery


🔁 Streaming & Orchestration

Kafka AWS Kinesis Apache Airflow AWS Step Functions


💻 Programming & Querying

Python SQL Scala


🧰 DevOps & Infrastructure

Terraform CI/CD Docker GitHub Actions


🏗️ Architecture Specialization

  • Serverless ETL & ELT pipelines
  • Lakehouse (Bronze / Silver / Gold) Architecture
  • Incremental & CDC ingestion patterns
  • Near real-time streaming pipelines
  • Distributed Spark optimization
  • Columnar storage (Parquet / Delta)
  • Data contracts & governance
  • Infrastructure as Code (IaC)
  • AI & LLM integration in data workflows

📊 GitHub Analytics

GitHub Stats

GitHub Streak

Activity Graph


🚀 Currently Building

  • 🔹 Real-time Retail Analytics Platform
  • 🔹 AI-Augmented Data Pipelines
  • 🔹 Snowflake Lakehouse Implementations
  • 🔹 End-to-End Cloud Data Engineering Projects

📫 Connect With Me

LinkedIn Email


⚡ Engineering Philosophy

Build scalable.
Automate everything.
Optimize performance.
Design for failure.
Think distributed.


⭐ If you like building scalable data systems, let’s connect.

Pinned Loading

  1. Serverless-ETL-Pipeline-using-AWS-Lambda-and-Snowflake Serverless-ETL-Pipeline-using-AWS-Lambda-and-Snowflake Public

    Serverless ETL pipeline using AWS Lambda, Glue (Spark), S3, Step Functions, and Snowflake with Snowpipe for automated, scalable job data ingestion.

    Python 1

  2. Adventure-Works-Sales-Data-Analysis-2019 Adventure-Works-Sales-Data-Analysis-2019 Public

    End-to-end sales data analysis using star schema modeling, SQL data warehouse design, and Power BI dashboards for KPI-driven insights.

    TSQL 1

  3. vehicle_insurance_fraud_detection vehicle_insurance_fraud_detection Public

    Vehicle Insurance Fraud Detection: An ML classification project to flag suspicious claims (staged accidents, phantom passengers, false injury) using historical claims data and feature engineering.

    Jupyter Notebook 1

  4. Ipl-Data-Analytics Ipl-Data-Analytics Public

    Data analysis of iPhone pricing, discounts, and customer ratings on Flipkart. Built with Python & Pandas. Explores pricing strategies across 62 products with actionable insights

    Jupyter Notebook

  5. data-engineering-portfolio data-engineering-portfolio Public

    Python