π― Google Cloud Certified Professional Data Engineer with 4+ years of experience building robust, scalable data pipelines and cloud-native data solutions. I specialize in ETL/ELT workflows, cloud data warehousing, and API development using both GCP and AWS platforms.
Languages & Tools:
Python SQL Spark Airflow DBT FastAPI Pandas SQLAlchemy pytest
Cloud Platforms:
βοΈ Google Cloud β BigQuery, Dataproc, Cloud Run, Pub/Sub, Cloud Functions
βοΈ AWS β Glue, Lambda, S3, Athena, Kinesis
DevOps & Infra:
Terraform Docker GitHub Actions Cloud Monitoring CI/CD
- π Build data pipelines (batch & real-time) with tools like Airflow, Spark, and Dataproc
- β Apply data quality checks and testing using pytest for robust workflows
- π Design and implement secure APIs for internal/external data access
- π Migrate infrastructure using Infrastructure as Code (Terraform)
- β»οΈ Optimize cloud costs by managing GCP/AWS infrastructure efficiently
π Based in Bengaluru, India
π§ arvind2512patel@gmail.com


