experience
┌───────┐ │ │ │ │ │ │ └───────┘
alan turing institute
dec 2024 - april 2025 | research associate | collaboration: university of birmingham & u.s. army research institute
- developed a heterogeneous-agent reinforcement learning (harl) framework using human-proxy agents that simulate realistic human constraints and capabilities to improve ai-human collaboration in multi-agent systems.
- designed a cooperative grid-world capture environment based on stag hunt game theory, where machine agents had full observability but couldn't detect target health, while human-proxy agents had limited vision but unique disease detection abilities.
- conducted experiments across various environment configurations, varying disease probability and penalty severity to analyze cooperation patterns.
- demonstrated that rl agents trained with human-proxy teammates achieved superior cross-environment performance, with teams trained under moderate risk conditions showing 30-40% higher collaboration rates.
riskopsai
jun 2023 - aug 2024 | ai ml intern
- built a deep learning pipeline with tensorflow and pytorch using resnet-50 and transformer models for classification. used tensorrt to optimize inference, achieving 25% faster performance and 15% higher accuracy.
- created distributed ml infrastructure with apache airflow and mlflow on aws gpu clusters using horovod, reducing training time by 20%. set up automated data pipelines for feature engineering.
- developed a predictive analytics system using postgresql and bigquery with scikit-learn and xgboost on imbalanced data. optimized queries to improve decision-making efficiency by 30%.
srm institute of science & technology
oct 2022 - feb 2024 | researcher under dr. vaishnavi moorthy
- developed an autonomous navigation system using ros2 by fusing lidar, rgb-d, and imu data through an extended kalman filter. this improved localization accuracy by 15%.
- built a slam system using sac and trpo algorithms in pytorch to improve path planning with rrt* and a*. reduced navigation errors by 25% using dynamic obstacle avoidance.
- created a real-time perception pipeline using opencv and pcl, integrating yolov7 for object detection. achieved 20ms latency and 95% detection accuracy in changing environments.