azmatsiddique (Azmat) · GitHub
Skip to content
View azmatsiddique's full-sized avatar
:octocat:
:octocat:

Block or report azmatsiddique

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
azmatsiddique/README.md

👨‍💻 About Me

🏢 Module Lead Software Engineer @ Impetus Technologies

📍 Gurugram, Haryana, India

💼 7+ years in Big Data, Data Engineering & AI

🔭 Working on LLMs, Generative AI, Intelligent Search & Agentic AI

💬 Ask me about Apache Spark, PySpark, AWS, GCP, Airflow, Databricks, LLMs, SageMaker, Bedrock

🎓 Certifications: Stanford ML Certified · Apache Airflow DAG Authoring · Intelligent Search Technical Expert · AWS Certified ML (SageMaker) · Professional Data Engineer

📝 Published researcher in Data Migration & Data Engineering Productivity

🏆 Recognitions: Apache Airflow Certified · Scalable Data Solutions Leader


🤝 Open For

Hiring Collaboration Speaking Mentoring


🔥 GitHub Analytics


🛠️ Tech Stack

📊 Big Data & Data Engineering

spark pyspark airflow databricks hadoop hive kafka hdfs etl sql hbase sqoop

🤖 AI, ML & Generative AI

llms genai bedrock sagemaker intelligent search sklearn tensorflow pandas numpy matplotlib ai agents prompt engineering

☁️ Cloud & DevOps

aws gcp s3 glue emr athena redshift jenkins git jira

💻 Languages & Databases

python sql sparksql mysql hbase linux vscode pycharm

🏢 Experience

🏢 Company 💼 Role 📅 Duration
Impetus Technologies Module Lead Software Engineer Dec 2025 – Present
Publicis Sapient Senior Associate Consultant Apr 2024 – Dec 2025
Moody's Ratings Software Engineer Mar 2023 – Mar 2024
Capgemini Associate Consultant Jul 2021 – Mar 2023
KVCH Data Engineer Jan 2019 – Jun 2021
TcsIon Hadoop Developer Jul 2018 – Dec 2018

🎓 Certifications & Badges

GCP Cloud Architect GCP Data Engineer AI/ML Pre-sales Data Analytics Pre-sales Intelligent Search Vertex AI Gemini App Modernization Database Engineer Networking Infrastructure Security Credly Top Earner Airflow AWS SageMaker Stanford ML



🏅 Certification / Badge 🏢 Issuer 📊 Status
☁️ Professional Cloud Architect (×2) Google Cloud ✅ Certified
📊 Professional Data Engineer (×2) Google Cloud ✅ Certified
🤖 AI/ML Pre-sales Technical Expert Google Cloud ✅ Certified
📈 Data Analytics Pre-sales Technical Expert Google Cloud ✅ Certified
🔍 Intelligent Search Technical Expert Google Cloud ✅ Certified
🧠 Build with Vertex Technical Expert Google Cloud ✅ Certified
💬 Gemini Enterprise for Customer Experience Google Cloud ✅ Certified
🚀 Application Modernization Pre-sales Expert Google Cloud ✅ Certified
🗄️ Database Engineer Pre-sales Expert Google Cloud ✅ Certified
🌐 Networking Pre-sales Technical Expert Google Cloud ✅ Certified
🏗️ Infrastructure Modernization Pre-sales Expert Google Cloud ✅ Certified
🔒 Security Pre-sales Technical Expert Google Cloud ✅ Certified
🏆 Credly Top Badge Earner of 2024 Credly ✅ Awarded
🎖️ DAG Authoring for Apache Airflow Astronomer ✅ Certified
🔬 ML Algorithms in SageMaker AWS ✅ Certified
🎓 Machine Learning Stanford University ✅ Completed

🔗 View all verified badges on Credly →


📚 Publications

📄 Publication
Design Environment of Live Data Migration Database System
Live Data Migration Approach From Relational Tables To Schema-free with RDD
Enhancing Productivity in Data Engineering: Proven Strategies

🚀 Profile Summary

BigData GenAI Cloud

🤝 Let's Connect!

I'm always open to exciting collaborations in Data Engineering & AI!

LinkedIn Email Gmail

📫 Reach out — I'd love to hear from you!


⭐ From Azmat with ❤️

Pinned Loading

  1. AI_ed AI_ed Public

    learning

    Python 2

  2. data-time-machine data-time-machine Public

    Python 1

  3. audio-transcript-cli audio-transcript-cli Public

    Python

  4. schema-sync-tool schema-sync-tool Public

    Python

  5. dag-cost-tracker dag-cost-tracker Public

    Python 1

  6. pipemedic pipemedic Public

    Python 16