Hi, I'm Satwik

I'm a Data Scientist

Download My Resume
About Me

Who Am I?

Hi, I'm Satwik — a data enthusiast driven by curiosity and a passion for solving real-world problems with machine learning and AI . With around 1 year of industry experience in data science and AI roles and over 2.5 years of research experience exploring the interdisciplinary applications of AI across healthcare , energy , speech processing , and NLP , I thrive at the intersection of data , algorithms , and impact .

I'm deeply committed to continuous learning , pushing the boundaries of what's possible, and contributing to projects that create meaningful change . My journey in data science began during my undergraduate studies at Amrita University, where I discovered the transformative power of AI through hands-on research projects and collaborations with esteemed professors. This foundation has shaped my approach to problem-solving: methodical, innovative, and always focused on real-world applications.

Currently pursuing my Master's degree at UIUC, I'm expanding my expertise in advanced statistical methods and cutting-edge machine learning techniques. Whether it's building scalable data pipelines, developing predictive models, or exploring the latest in generative AI, I approach each challenge with enthusiasm and a commitment to excellence.

Contact me: satwikreddy987@gmail.com
Current Location: Champaign, IL

Organizations I've Worked With

My Expertise

Technical Skills

Frameworks & Libraries

NumPyPandasApache Spark (PySpark)Scikit-learnPyTorchHugging FaceTensorFlowXGBoostLangChainCrewAIChromaHadoopMatplotlibasyncioOpenCV

Specializations

Object-Oriented ProgrammingData Structures and AlgorithmsMachine LearningHigh Performance ComputingPerformance EngineeringGenerative AIData MiningDistributed SystemsData AnalysisLow Latency SystemsData VisualizationDatabase Management SystemsSystem DesignArtificial IntelligenceTestingObject-Oriented DesignData StructuresData CleaningNatural Language ProcessingData AnalyticsPredictive ModelingCI/CDAgile

Programming Languages

Python (Advanced)SQL (Advanced)Java (Intermediate)MATLAB (Intermediate)

Tools & Cloud

DockerGitJiraPowerBITableauAWS (S3, SageMaker, Kinesis, Glue, Athena, Lambda, QuickSight)

Platforms

LinuxMacOSWindows
Experience

Work Experience

Volkswagen Group
Data Science Intern
Sep 2024 – Feb 2025 | Pune, India

Python, SQL, PySpark, AWS, REST APIs, Postman

  • Developed an end-to-end HR analytics solution on AWS to support workforce decision-making for 70,000+ employees, integrating OAuth-secured data from SAP SuccessFactors and internal enterprise sources.
  • Engineered a robust ELT pipeline using API-based data ingestion, AWS Glue with PySpark scripts for scalable and efficient data transformations, and Amazon S3 for storage of raw and processed datasets.
  • Accelerated data querying by 30% with optimized Amazon Athena queries and partitioning strategies, and built a near real-time, interactive dashboard in Amazon QuickSight for HR analytics.

Amrita Hospital
AI Intern
Nov 2023 – Feb 2024 | Kochi, India

Python, PyTorch, NumPy, Pandas, Matplotlib, GANs

  • Built and integrated a PyTorch-based image classifier into a healthcare web app, enabling automated diagnosis support using 25,000+ clinical images.
  • Designed and implemented a complete data preprocessing pipeline—including cleaning, normalization, resizing, and artifact removal—to prepare high-quality training data.
  • Fine-tuned ResNet, EfficientNet, and Vision Transformer (ViT) on GAN-augmented datasets, boosting classification accuracy by 20%.

Amrita School of Artificial Intelligence
Undergraduate Research Assistant
Jan 2022 – Aug 2024 | Coimbatore, India

  • Worked under the guidance of four senior faculty members: Dr. K.P. Soman, Dr. Sowmya V, Dr. Vinayakumar R, and Dr. Sachin Kumar S on interdisciplinary research projects across healthcare, speech processing, energy systems, and e-commerce.
  • Handled diverse data types including images, audio, text, structured tabular (CSV), and time-series data, adapting preprocessing pipelines accordingly.
  • Gained hands-on experience in deep learning, machine learning, natural language processing, signal processing, time-series forecasting, generative AI, and statistical modeling.
  • Contributed to the full ML lifecycle—data preprocessing, model development, evaluation, and implementation—with multiple projects leading to peer-reviewed publications.
Education

Academic Background

University of Illinois Urbana-Champaign
Master of Science - MS, Statistics (Data Science)
August 2025 - May 2027

GPA: 4.0/4.0

I am currently pursuing my Master of Science in Statistics at the University of Illinois Urbana-Champaign, Illinois, USA. I am delving deeper into mathematical concepts to further strengthen my expertise in data science and artificial intelligence. Additionally, I am working on several exciting projects in data science and generative AI.

Relevant Coursework (In Progress):
Statistics and Probability II, Statistical Modeling I, Advanced Data Analysis

Amrita Vishwa Vidyapeetham, Coimbatore
Bachelor of Technology - BTech, Computer Science and Engineering (Artificial Intelligence)
August 2021 - May 2025

Grade: 3.75/4 (First Class with Distinction)

I completed my Bachelor of Technology in Computer Science and Engineering with a specialization in Artificial Intelligence from Amrita Vishwa Vidyapeetham, Coimbatore, India. During my undergraduate studies, I conducted research under the guidance of senior professors, applying AI to interdisciplinary domains such as healthcare, speech processing, energy systems, e-commerce, and natural language processing, resulting in several peer-reviewed, Scopus-indexed international publications. Beyond academics, I served as the lead of the Nature Club, organized multiple cultural fests, and conducted technical workshops for freshmen during college tech fests. I also pursued internships to broaden my technical skill set. My undergraduate journey was a challenging yet rewarding phase that taught me independence, adaptability, and self-sufficiency.

Relevant Coursework:
1. Machine Learning & AI: Reinforcement Learning, Deep Learning for Signal & Image Processing, AI in Natural Language Processing, AI in Speech Processing
2. Data Science Foundations: Statistics & Probability, Linear Algebra, Optimization, Calculus
3. Data Engineering & Systems: Big Data Analytics, Big Data and Database Management, Cloud Computing, Database Management Systems
4. Computer Science Fundamentals: Data Structures & Algorithms, Computer Networks

My Work

Projects

AI-Powered Trading Intelligence Platform

"A lightweight research cockpit that ingests market & macro data, computes signals, and orchestrates multi-agent LLM reasoning."

Tech Stack: Python, CrewAI, LangChain, ChromaDB, Streamlit, Joblib

4-Wheeled Autonomous Robot Navigation Simulation

"A ROS2/Gazebo simulation of an autonomous 4-wheeled robot with lidar/camera sensing, path planning, and OpenCV-based traffic sign classification for intelligent navigation."

Tech Stack: Python, Gazebo, ROS2, OpenCV, Yolov5

Depression Detection from Reddit Posts using NTK and KANs

"ML classification of depression from Reddit text using TF-IDF features with Neural Tangent Kernel and Kolmogorov-Arnold Networks."

Tech Stack: Python, PyTorch, Scikit-ntk, Pykan, NumPy, Matplotlib

Arduino-Based Dual-Axis Solar Tracking System

"A dual-axis solar tracking system with environmental monitoring capabilities for temperature, humidity, and rainfall detection to optimize solar energy harvesting."

Built With: Arduino, C/C++, Servo Motors, Stepper Motors, Environmental Sensors

Real-Time Driver Drowsiness Detection System

"A computer vision-based safety system that tracks driver's eye movements to detect drowsiness and triggers buzzer alerts for enhanced road safety."

Built With: Python, OpenCV, dlib, Raspberry Pi 3, Computer Vision

Research Work

Publications

Accurate Estimation of Cargo Power Using Machine Learning Algorithms

Venkata Siva Manoj, A., N. Sai Satwik Reddy, V. Venkata Alluri Rohith, V. Sowmya, and Vinayakumar Ravi
Analytics Modeling in Reliability and Machine Learning and Its Applications, pp. 213-236. Cham: Springer Nature Switzerland, 2025.
Published: 21 January 2025

Transfer Learning-Based Emotion Recognition Using Augmented Speech Data

Reddy, N. Sai Satwik, V. Venkata Alluri Rohith, V. Poorna Muni Sasidhar Reddy, and Y. Shashank Reddy
2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), pp. 1-7. IEEE, 2024.
Published: 04 November 2024

Fast Iterative Filtering-Based Deep Belief Network for Accurate Short-term Electric Load Forecasting

Sai Satwik Reddy, N., A. Venkata Siva Manoj, Neethu Mohan, S. Sachin Kumar, and K. P. Soman
International Conference On Innovative Computing And Communication, pp. 521-530. Singapore: Springer Nature Singapore, 2024.
Published: 27 September 2024

Enhancing Product Categorization in E-commerce using NLP and Machine Learning

Reddy, N. Sai Satwik, V. Venkata Alluri Rohith, P. Sai Abhiram, M. Devi Siva Rama Saran, and Samya Rebecca
2024 International Conference on Inventive Computation Technologies (ICICT), pp. 1-6. IEEE, 2024.
Published: 07 June 2024

Transfer Learning Approach for Differentiating Parkinson's Syndromes Using Voice Recordings

Reddy, N. Sai Satwik, A. Venkata Siva Manoj, V. Poorna Muni Sasidhar Reddy, Aadharsh Aadhithya, and V. Sowmya
International Advanced Computing Conference, pp. 213-226. Cham: Springer Nature Switzerland, 2023.
Published: 26 March 2024

Classification of Colorectal Cancer Tissue Utilizing Machine Learning Algorithms

Reddy, N. Sai Satwik, A. Venkata Siva Manoj, and V. Sowmya
International Advanced Computing Conference, pp. 397-409. Cham: Springer Nature Switzerland, 2023.
Published: 26 March 2024

A Fast Iterative Filtering Method for Efficient Denoising of Phonocardiogram Signals

Reddy, N. Sai Satwik, V. Poorna Muni Saisdhar Reddy, Neethu Mohan, Sachin Kumar, and Soman KP
2023 3rd International Conference on Intelligent Technologies (CONIT), pp. 1-6. IEEE, 2023.
Published: 07 August 2023

Beyond Academics

Extra Curricular Activities

Workshop on LLMs
October 2024

Organized a workshop on Large Language Models for freshmen at Anokha Tech Fest, Amrita University

IEEE Idea-thon
April 2024

Conducted idea-thon as IEEE student member at Amrita University

Nature Club Lead
June 2022 – June 2024

Led sapling plantation event on World Environment Day for three consecutive years as the Lead of the Nature Club at Amrita University

Get in Touch

Contact

satwikreddy987@gmail.com

Champaign, IL

Drop me a message