Welcome to my space! 👋
I am a data scientist with a passion for creating innovative solutions to complex problems. I completed my B.Tech in Computer Science and Engineering (with a minor in Economics) from Shiv Nadar University in 2022.
Currently, I am working as a Manager at American Express, leading a cross-functional team of analysts. I leverage data to develop innovative strategies for global operations. My experience spans designing predictive models, optimizing processes, and implementing data-driven solutions that enhance efficiency.
Previously, I was working as a researcher with the Inria-Modal Group to identify and isolate the causes of suicides in Nord-Pas de Calais region in France using statistical modelling and spatial econometrics. In 2021, I worked as a Data Science for Social Good fellow at the University of Warwick in collaboration with the German Federal Minsitry for Economic Affairs (BMWK). I also dabble in Machine Learning Engineering as an open-source collaborator at the AI4Bharat Lab at IIT-Madras.
When I am not in front of a computer screen, I enjoy hiking to remote lands and cycling. I am always open to a match of lawn tennis or badminton. You can often find me reading Jeffrey Archer or humming to The Lumineers. I love indie and rock music ♪. You can know me better from my music. In my last, Myers-Briggs personality test, I tested as an ENTJ.
Download my resumé .
Organizational Leadership, 2022-Present
Northwestern University
B.Tech in Computer Science and Engineering, 2018-2022
Shiv Nadar University
Summer Institute in Computational Social Science, 2022
International Institute of Information Technology, Hyderabad
My technical skills and core competencies
AI4Bharat is a Research Center at IIT Madras focused on developing ML-based solutions for Indian communities. I was working as a contributor for the Shoonya Project and the Chitralekha Project.
Shoonya is a data labeling and annotation tool built to enhance digital presence of under-represented languages in India. Chitralekha is an open source platform tool for video subtitling across various Indic languages, using ML model support. Both the platforms are currently being used by 100+ annotators across India who are skilled translation experts.
Developed a React and Django based web portal to annotate and store the text translations from English to Indian languages and vice-versa.
Wrote efficient code for the logic of the APIs and Webhooks used by the platform for both internal and external software functionalities.
Implemented multiple async functionalities in the project using celery and Django to speed up the application.
Worked on developing and automating financial trading strategies using seminal research papers. Implemented the strategies, like Piotroski F Score and Momentum Trading Strategy, and backtested them for over 15 years' data with positive returns.
I conducted various test to ascertain the results which included downside risk measures like Value at Risk, CVaR, Semideviation, Sharpe and Sortino Ratios. I also developed pipelines which automated the process of investing and testing on past data using Python.
Developed and deployed a machine learning tool for the Union Bank of India which allotted risk scores to customers based on past customer data. The bank used the model to decide which customers would receive a loan. My risk allocation model brought down the customer default rate by 11% on 2 years of testing data.
Assisted ongoing research in the field of financial machine learning to help write a paper which was accepted in The Financial Review.
Worked on a project by the Botnar Foundation to capture and understand what young people (age 10-24 yrs) today think about topics like their future, aspirations, concerns, and challenges they face, etc.
Collaborated with a team of 50 Machine Learning engineers to develop tools to analyze and understand the sentiments and aspirations of young people and performed a temporal analysis to understand how the sentiments have been changing over time, especially due to the Covid-19 Pandemic.
I was also the task manager of the Reddit team which worked on Crowdsourcing data from the popular social media website. My team collected and processed over 30000 posts along with comments in three languages.
The project host, Fondation Botnar, is a Swiss-based foundation that champions the use of digital and AI to improve the health and wellbeing of children and young people globally. They used our work to create counselling services and mental well-being indicators for young people in the European Union.
I like to write about data science, machine learning and finance. I document personal experiences and projects. I write for organizations like Towards Data Science, The Startup, Level up Coding and Analytics Vidhya. My main goal is to break down technical content for non-technical audiences.
Building and deploying a multipage Python webapp in a few simple steps.
Technical and Non-Technical books that will help you a become better data scientist.
Discussing a fix to create a Django form which can have a dynamic number of input fields.
Discussing the new features and updates in Python 3.11 and how to install the 3.11 Alpha version.
A summer fellowship for people looking to make a positive change in society through data science
Efficient and Quick Text Analysis Tool built using Streamlit including Text Summarization, POS Tagging and Named Entity Recognition.
Building your first Multipage Streamlit application and deploying it. The prerequisite is knowing the basics of Python and Streamlit.
Implementing Semideviation, VaR and CVaR risk estimation strategies in Python. Downside risk is when the returns go lower than the buy price and how to estimate them.
One of the key factors involved in asset and portfolio management is accurately assessing the risk involved in your investment. Implementing a stock market risk analysis strategy in Python.
Machine learning interpretability is a topic of growing importance in this field. Discussin an aspect of machine learning interpretability — feature importance and a novel approach called Shapley Additives.
An end-to-end machine learning project series. Text Analysis and Model Building.
An end-to-end machine learning project series. Problem Definition and Data Collection from Reddit.
An iterative approach to predicting which hand will win the match. An unconventional solution to a conventional problem.
The actvities I engage in to promote learning and skill development within my peers.
Part 1: Quantitative Trading Part 2: AI Algorithms in Trading