Resume
Experience

2022 - Present
Project Manager
Data Speciality: Real World Data; i.e. CRM Text, Medical Inquiries
-
Spearheading the development of a GPT-4 RAG engine to generate relevant insights for KOLs, enhancing product and marketing strategies.
-
Managed the products life cycle in a GxP compliant agile manner
-
Collaborate with cross-functional teams to define project scope, objectives, and deliverables.
Lead Data Scientist
Data Speciality: Real World Data; i.e. Raw text, Electronic Health Record, Insurance Claims and Patient Data
-
Built an analytics platform in PowerBI to derive real world evidence and hypothesis from the characteristics and treatment journeys of patients in a broad cohort using the LightGBM model
-
Built a ML model using G-estimation and SHAP to compare performance of biologics to determine efficacy of a treatment
-
Created a standard for digital release process of data products, in adherence with quality and compliance team
-
Spearheaded the establishment of OpenShift deployment for our web based products
-
Lead a team of Junior data scientists and analysts, fostering a collaborative and innovative environment
Data Scientist
Data Speciality: Real World Data; i.e. CRM free text, Social Media Data
-
Developed an automated compliance engine for CRM free text using Natural Language Processing AI models.
-
Created a reusable machine learning component in Python for anonymization, summarization, language detection and translation
-
Facilitated the development of a COVID-19 social listening dashboard to analyze public sentiments about the vaccine through tweets.

2021 - 2022
Data Scientist
Data Speciality: Text, Drug Trial Data
-
Developed a custom streamlit application identifying various target areas as potential investment opportunities using ensemble learning
-
Built a text analysis-NLP tool in Python to analyze research papers for better lead target identification

Aug 2020 - Dec 2020
Data Scientist (Contract Role)
Data Speciality: Customer Data
• Streamlined Customer Care department by designing a ML model which predicts plausible malfunctioned part of the boat with respect to vectorised customer complaints

January 2020 - June 2020
Data Scientist (Co-op)
Data Speciality: Clinical Trial Data, Drug Data
-
Developed a machine learning model in Python using XGBoost to predict the feasibility of various pharmaceutical compounds for preclinical trials
Education

2018 - 2020
Master of Science in Data Science
Northeastern University
Boston, MA
Coursework - Machine Learning, Big Data, Data Mining and Processing, Cloud Computing, Introduction to AI
Achievement - Awarded bonus credits in the Machine learning coursework for the best project, recipient of merit-based academic scholarship

2014 - 2018
Bachelor of Science in Computer Science
SRM University
Chennai, Tamil Nadu
Coursework - Big Data with Hadoop and R, Data Analysis and Algorithms, Java, Python, C++
Data Structures, Database Management System,
Introduction to Machine Learning and Artificial Intelligence
​
Achievement - Recipient of merit-based academic scholarship every year on account of scoring high scores; Was the valedictorian of my batch
Professional Skillset
Strategic Planning
Machine Learning
Predictive Modeling
Statistical Analysis
LLM, Generative AI
Programming Languages
Python (advanced) - Numpy, Pandas, Sklearn, Tensor Flow, Keras, Seaborn, Matplotlib, and more.
AWS - S3, Claude
SQL (proficient)
R (proficient)
GPT4
PineCone
Development Tools
Jupyter Notebook
VS Code
Git
DevOps - Argo CD and Openshift
Docker
Atlassian Suite - Confluence and JIRA