Seasons of Code

Street Fighter II - Reinforcement Learning

Machine Learning
Deep Learning

Mentors :

Vijay Balsubramaniam

Mentees :

5-10

Machine Learning is the buzzword! New to the world of ML? Here's a perfect place to begin! Reinforcement learning (RL) is a powerful machine learning technique that has gained significant attention in recent years due to its ability to solve complex decision-making problems. In RL, the goal is to build Intelligent Autonomous Agents (with no prior knowledge of their environment except the possible set of actions they could take) which interact with their environment to figure out the best way to gain rewards from the environment, much like humans!
The theoretical concepts we will cover will more or less follow: https://www.deepmind.com/learning-resources/introduction-to-reinforcement-learning-with-david-silver I'd recommend watching the first lecture to get an idea; and yes, the endeavor is to focus on theory + programming.
In theory we will explore: Markov Decision Processes, The (holy) Bellman Equation, Simple Q-Learning and its improvisations, Monte Carlo Methods and Temporal Difference Learning.
Here's a python tutorial reference (must read before applying!): https://www.w3schools.com/python/default.asp
In coding we will explore: Python libraries which facilitate Reinforcement Learning: Open AI Gym, retro-gym, numpy, tensorflow and pytorch and of course, Street Fighter.
A similar project: https://pytorch.org/tutorials/intermediate/mario_rl_tutorial.html
The central goal of this SoC project is to teach an autonomous agent to play Street Fighter!

Who should TAKE this project?
-- Someone who has a background in python. You just need to be familiar with classes, functions and conditionals. Refer to the link in the general description of the project. If you are new to python, get yourself comfortable with python before we start the actual project. -- Someone who isn't afraid of math, matrices and all that magic! -- Someone who is willing to devote time to appreciate the beauty of Machine Learning algorithms and the power of python. -- Someone who is determined to finish what they start (Yes, at times you might feel fed up, but the sheer of joy of an agent that trained for 10 straight hours acing the game is the epitome of satisfaction!)
Who should NOT TAKE this project? -- Someone who is not familiar with python and has never coded before at all. -- Someone who has no enthu for ML and no energy for hours of debugging the merciless errors thrown by their programs. -- Someone who has no interest in going through texts and reading published papers.

Tentative Timeline :

Week	Work
Week 0	Introduction to RL and review of Python background
Week 1	Review of MDPs and introducing Q-Learning Theory, playing with OpenAIGym
Week 2-3	Review of RL MC and TD Methods. More OpenAIGym
Week 4-5	Implementing a published paper from scratch. You can choose the paper from a set of papers I will share.(This will give you a glimpse of research spaces in RL)
Week 6-7	Street Fighter II (This will give you a glimpse of the applied spaces in RL)

Topic: All ML DEVELOPMENT BLOCKCHAIN CP MISCELLANEOUS

TEXT SUMMARIZATION WEB APP

Competitive Programming

Write yourself a Git!

File Compression System

FAST-G

Developing Trading Strategy with Pine Script

Real time Driver Drowsiness detection System

The Image Cartoonifier

Speech to Speech Translation

Competitive Programming - Newbie to Master

Path-Planning of Swarm Robotics in 2/3D space

Image Super Resolution using Deep Neural Networks

Enhance Low Resolution Image using GANs

MyBox

Deep Carlsen

InstiExchange - A web marketplace for IITB

Homomorphic Encryption for k-NN on the Cloud

TRayCer

Social media website with MERN

Dive into Digital Image Processing

Neural Quest

To the Quantum Future

Street Fighter II - Reinforcement Learning

Combinatorial Computing

Navigating the Waters of AI

Autonomous Driving Vehicle

Author Identification through Stylometric Analysis

Breakout Genius - Using RL to Build an AI Game Master

Image Captioning

Cricbuzz

Competitive Programming (CP)

Economics meets Machine Learning

SynerG Lab, CSE Department - Webpage

Computer vision for driverless vehicles

AudioHive A Social Audio App

Learning the Latent structure in LLMs

EdConnect

NFTs Where Art and Tech Converge

Dive into the World of Quant

COLLIDE

Image Processing and Object Detection

Unreal IITB

Human Pose Estimation

Find me out

Sign Language Recogniser

E-Commerce Website for VibSpecLab, IIT Bombay

Image Caption Generator

Intro Full stack web development:Restaurant website

App for Credit rating of Retailers in Clothing Industry

Ray Tracing

YOLO-Cam-Object Detection based Analytics

Physically Based Rendering

A Secure Erasure Code-Based Cloud Storage System with Secure Data Forwarding

Graph Machine Learning

“The Watchdogs” - Solving a murder mystery using Computer Vision and Data Science

ArgueAI

Comic GPT

InstiNav

Face detection for attendance using AI

Hands on Reinforcement

RegExamaton

Institute OnChain Voting System with ZKPs

Using Deep RL and NLP to allocate stocks in portfolio

Blockchain Development- It's not that difficult!

Light field imaging and Dual Attention Networks

Stable Diffusion

FlappeRL

Hands-on Computational Physics

Image Colorization

PaperPal

JobFinderX

Light (Rendering) Engine

Street Fighter II - Reinforcement Learning

Machine Learning

Deep Learning

Mentors :

Mentees :

Tentative Timeline :