Saketh Reddy Dodda - Data Engineer
Building scalable ETL pipelines and cloud-based data solutions that improve throughput by 40% and drive faster data-driven decisions.
Boulder, CO (Open to Relocate)
About Me
I'm a Data Engineer with 3+ years of experience specializing in building robust ETL pipelines, automating database migrations, and creating BI reporting solutions using AWS and GCP cloud platforms. Currently pursuing my MS in Data Science at University of Colorado Boulder with a perfect 4.0 GPA.
My passion lies in designing scalable data architectures that transform raw data into actionable insights, helping organizations make faster, data-driven decisions.
🏆 Microsoft Certified: Power BI Data Analyst AssociateKey Achievements
Education
Master of Science in Data Science
Technical Arsenal
A comprehensive toolkit of technologies and frameworks that power modern data solutions
Programming & Scripting
Data Engineering
Databases
Cloud Platforms
Visualization
DevOps
Professional Journey
Transforming data into actionable insights across diverse industries
- Analyzed 0+ courses data to identify performance patterns
- Improved student pass rates by 0% through data-driven insights
- Built comprehensive Tableau dashboards reducing support turnaround by 0%
- Led Thomson Reuters platform engineering initiatives
- Migrated 0+ years of Oracle data to PostgreSQL with 0% query improvement
- Designed scalable AWS Glue pipelines and PySpark workflows
- Automated ETL processes reducing SLA breaches by 0%
- Delivered Power BI dashboards enhancing transparency by 0%
- Built robust PySpark ETL pipelines for processing 0K+ daily sensor data points
- Implemented data quality checks reducing system outages by 0%
- Optimized data processing workflows for real-time analytics
Featured Projects
Showcase of technical projects demonstrating practical application of skills and innovation
Multilingual Podcast Translation Service
Advanced GCP pipeline automating transcription and translation for 14+ languages using enterprise-grade cloud architecture.
- Reduced manual overhead by 80%
- Kubernetes orchestration & microservices
- Pub/Sub messaging & BigQuery analytics
- REST APIs for seamless integration
Real-time Stock Market Pipeline
High-performance data pipeline delivering sub-1-minute market insights through automated AWS infrastructure.
- Apache Kafka streaming on AWS EC2
- Python automation & data processing
- AWS Glue ETL & Athena querying
- Sub-1-minute insight delivery
Campus Tree Health Dashboard
Comprehensive tree health analysis for CU Boulder campus using advanced statistical methods and geospatial visualization.
- Python data analysis & processing
- Z-score & IQR outlier detection
- Power BI geospatial dashboard
- Interactive campus tree mapping
Certifications
Professional credentials that validate expertise and commitment to continuous learning
Let's Connect
Ready to discuss data engineering opportunities or collaboration? Reach out through any channel below.