S

Sai Pavan.

I am an ML Engineer

Building thoughtful, performant data experiences.

Projects.

🌙

Sleep Health Analysis

ML-powered sleep health analysis with predictive modeling and interactive visualizations (84% accuracy)

PythonStreamlitScikit-learnPandas
Data Science & MLLive Demo
🚗

Real-Time Roadside Assistance

Event-driven serverless system for real-time vehicle breakdown detection with AI-powered emergency response

AWSPythonLambdaKinesisDynamoDBTerraform
Cloud Architecture & IoTEvent-Driven
💬

Global Chat Platform

Real-time global chat application with WebSockets, multi-user support, and Redis message queue for horizontal scaling

FlaskWebSocketsRedisPythonDocker
Real-time Web AppLive Chat
R

Redinai

AI-Powered Platform

Advanced AI-driven platform featuring intelligent automation, real-time analytics, and interactive data visualization with cutting-edge machine learning capabilities.

AI/MLReactNext.jsTypeScript
🚀 Featured ProjectInteractive Demo
Click to explore →

Apple Clone

Pixel-perfect clone of Apple's website with modern animations

ReactThree.jsGSAP
Web Development
📋

Diligent Project

A diligent project management and tracking application for efficient workflow management

PythonFlaskSQLAlchemy
Project Management
find more on my GitHub

ML & AI Stack.

Apache Airflow
Apache Airflow
Click to expand
Skills & Expertise
  • Workflow Orchestration
  • DAG Management
  • Task Scheduling
  • Data Pipeline Automation
AWS Glue
AWS Glue
Click to expand
Skills & Expertise
  • ETL Jobs
  • Data Cataloging
  • Schema Evolution
  • Serverless Processing
dbt
dbt
Click to expand
Skills & Expertise
  • Data Transformation
  • SQL Modeling
  • Testing & Documentation
  • Version Control
Prefect
Prefect
Click to expand
Skills & Expertise
  • Flow Management
  • Task Dependencies
  • Error Handling
  • Monitoring & Alerts
Apache Kafka
Apache Kafka
Click to expand
Skills & Expertise
  • Stream Processing
  • Event Sourcing
  • Real-time Analytics
  • Message Queuing
Apache Spark
Apache Spark
Click to expand
Skills & Expertise
  • Big Data Processing
  • In-memory Computing
  • MLlib Integration
  • Distributed Computing
AWS Kinesis
AWS Kinesis
Click to expand
Skills & Expertise
  • Real-time Streaming
  • Data Ingestion
  • Analytics Processing
  • Auto Scaling
Apache Flink
Apache Flink
Click to expand
Skills & Expertise
  • Stream Processing
  • Low Latency
  • Stateful Computations
  • Event Time Processing

Background.

22

Data Analyst

Hexaware

Remote

Jan 2020 – Mar 2022
  • Collected, cleaned, and validated large datasets from ERP, CRM, and internal databases to ensure data accuracy and consistency, improving analytics reliability across departments.
  • Designed and automated ETL pipelines using Python and SQL to streamline data integration and processing workflows, reducing data preparation time by 28%.
  • Built and maintained interactive Power BI dashboards to visualize KPIs, performance trends, and sales metrics, improving reporting efficiency by 35%.
  • Collaborated with cross-functional teams to identify data gaps and translate analytical findings into actionable recommendations, strengthening operational decision-making.
PythonSQLPower BIETLPostgreSQL
23

ML Engineer

Hexaware

Remote

Mar 2022 – Jul 2023
  • Transitioned into Machine Learning Engineering, leading development of predictive and analytical models to solve business problems.
  • Built and fine-tuned models using Random Forest, K-Means, Logistic Regression, and Gradient Boosting with Python, TensorFlow, and scikit-learn to deliver scalable, production-ready solutions.
  • Reduced preprocessing and training time by 25% through pipeline optimizations and efficient data handling on large-scale datasets.
  • Worked closely with engineering teams to productionize models and integrate ML workflows into existing data platforms.
Pythonscikit-learnTensorFlowSQLSparkDocker
nt

ML Engineer

UnitedHealthCare Group

Remote

Aug 2024 – Present
  • Developed predictive models for patient readmission risk and claims processing, increasing prediction accuracy from 78% to 92% and reducing high-risk cases by 12% through targeted interventions.
  • Built and enhanced classification, regression, and NLP models on large-scale claims and clinical datasets, reducing preprocessing and training time by 25%, accelerating analytics delivery for operations teams.
  • Applied transformer-based LLMs to automate clinical note summarization and patient feedback analysis, improving text classification accuracy by 18% and reducing manual review time for clinicians.
  • Converted raw claims and clinical data into structured formats, improving data reliability and reducing input errors by 10%, ensuring more accurate patient risk predictions.
  • Implemented explainable AI (SHAP, LIME) to interpret patient risk scores and NLP outputs, increasing stakeholder trust and ensuring compliance with HIPAA and internal audit standards.
  • Documented all models, code, and results for reproducibility and knowledge sharing, enabling efficient cross-team decision-making and supporting operational improvements.
Pythonscikit-learnTensorFlowPyTorchSQLAWSSparkTransformersSHAPLIME
25

Graduate Teaching Assistant

Wichita State University

Wichita, KS

Aug 2024 – May 2025
  • Assisted in teaching Intro to Data Science under Professor Alden Wilner
  • Conducted lab sessions and provided one-on-one tutoring for students
  • Developed course materials and programming assignments for data science concepts
  • Graded assignments and provided constructive feedback to students on data analysis projects
PythonData ScienceStatisticsPandasNumPy

Achievements.

☁️

Azure Administrator

Microsoft Certified

Demonstrated expertise in implementing, managing, and monitoring Azure cloud environments.

☁️

GCP Associate

Google Cloud Certified

Proficient in Google Cloud Platform services, architecture, and best practices for cloud solutions.

📊

ML & AI Engineering Excellence

Performance Optimization

Reduced data processing time by 60% through optimized pipeline architectures.

🤝

Team Leadership

Mentorship & Collaboration

Mentored junior engineers and improved team productivity by 40%.

🔬

Research Excellence

Graduate Research Assistant

Published research on machine learning applications in data engineering.

☁️

AWS Cloud Fundamentals

Amazon Web Services

Strong foundation in AWS cloud concepts, services, and basic architectural principles.

Thoughts.

Building Scalable Data Pipelines with Apache Airflow

Learn how to design and implement robust data pipelines using Apache Airflow with best practices for production deployments.

Machine LearningApache AirflowPython
Dec 20248 min read

Optimizing Spark Jobs for Large-Scale Data Processing

Deep dive into Spark optimization techniques, memory management, and performance tuning for big data workloads.

Apache SparkBig DataPerformance
Nov 202412 min read

Data Governance in Modern Data Platforms

Implementing comprehensive data governance frameworks to ensure data quality, security, and compliance.

Data GovernanceSecurityCompliance
Oct 202410 min read

About.

My Story

I'm Sai Pavan, a passionate ML/AI Engineer with over 5 years of experience building scalable data platforms and real-time analytics solutions. My journey in technology began with a curiosity about how data can drive meaningful insights and decisions.

Currently pursuing my Master's in Computer Science at Wichita State University while working as a Senior ML Engineer. I specialize in modern ML/AI systems, model development, and productionizing scalable AI solutions.

When I'm not coding or architecting data systems, you'll find me exploring new technologies, contributing to open-source projects, or sharing knowledge through technical writing and mentoring.

What I Do

Machine Learning Engineering

Building robust ETL/ELT pipelines, real-time data streaming, and scalable data warehouses.

Cloud Architecture

Designing cloud-native solutions with AWS, implementing infrastructure as code, and optimizing costs.

Machine Learning

Developing ML models for predictive analytics, recommendation systems, and automated decision-making.

Leadership

Mentoring junior engineers, leading technical initiatives, and driving engineering excellence.

Beyond Code

Open SourceTechnical WritingPhotographyHikingChessCookingMusicTravel

Contact.

Get In Touch

I'm always interested in discussing new opportunities, interesting projects, or just having a chat about technology and ML/AI engineering and model development. Feel free to reach out!

Ready to Work Together?

Whether you have a project in mind, need consulting, or just want to connect, I'd love to hear from you.