About

I'm a graduate student at University of Illinois Urbana-Champaign, pursuing MS Information Management in a Data Science track. Being from a computer science background, I've spent years experimenting with data, algorithms and computational capabilities. While I've worked with a Fortune 500 clientile solving real world problems using data science, I'm also an avid research enthusiast, exploring computational creativity extensively in the domain of natural language processing.

Data Visualization

Visualization and dashboards using Python, R, Tableau and PowerBI

Analytics

Data deduplication, root cause/trend analysis, customer segmentation, market basket analysis, recommendation systems.

Databases

Data wrangling to maintain and retrieve data from structured and unstructured databases using advanced queries and query automation.

Machine Learning

Statistics, probability, time series analysis, predictive modeling, model training and fine-tuning.

Natural Language Processing

Text classification, summarization, question- answering, named entity recognition, automatic speech recognition.

Business Intelligence

Quantitative analysis, monitoring key performance indicators, dashboards, reports, business resource documents.

Academics

Image Title

University of Illinois Urbana-Champaign

MS Information Management, Data Science & Analytics specialization

August 2022 - May 2024
CGPA - 4.0/4.0

Relevant coursework - Data, Statistical Models and Information, Data Warehousing and Business Intelligence, Database Administration and Scaling, NLP Research
Image Title

Manipal University Jaipur

BS Computer Science

August 2017 - June 2021
CGPA - 3.7/4.0

Relevant coursework - Data Science, Database Management, Data Structures, Python Programming, Algorithms, Machine Learning, Hybrid Soft Computing Technologies.

Work Experience

1+ years of experience in data science, analytics and business intelligence.

Image 1

National Center for Supercomputing Applications

Graduate Research Assistant | Aug 2022 - Present

-Deployed an AI-based teaching assistant chatbot via a multimodal question-answering dialogue system leveraging large language models such as GPT-3, OPT 175B and FLAN-T5 in HuggingFace.
-Fine-tuned models on custom dataset generated using prompt engineering and synthetic data generation methods in GPT-3.
-Implemented an information retrieval pipeline using Contriever model and filtering/ranking with MS-Marco model to obtain best answer for user-prompted question using complex API-queries.

Image 2

TransOrg Analytics

Data Analyst (Consulting) | Oct 2021 - July 2022

-Deployed an automatic speech recognition model using AWS services with multi-speaker detection for a robotic process automation software.
-Built analytics solutions for strategy management of 1K+ branches of Fortune 500 financial institutions, including risk segmentation, collections modeling, trend analysis and branch performance evaluation.
-Interfaced among product, risk and development teams to establish business rules engine for loan approvals; led requirements gathering, prepared business resource documents and conducted unit testing and user acceptance testing (UAT).
-Performed deduplication and customer segmentation on 6TB of point-of-sale data to increase luxury hospitality chain’s loyalty campaign conversion rate by 15%.

Image 3

Manipal University Jaipur

Research Intern | Jan 2021 - July 2021

-Conducted literature reviews and comparative studies of various core machine learning algorithms, to evaluate performance via F1 scores, AUC plots and accuracy.
-Collaborated with professors and documented findings to present at 2 international conferences and journals.

Image 4

Ezee Housing

Business Analyst Intern | July 2020 - Sept 2020

Enhanced sales analytics, performed lead segmentation in Python based on website traffic and enquiries, built a BI dashboard in Tableau to monitor various real estate properties and conducted a competitive analysis of the pricing module.

Image 5

Ege University, Turkey

Data Science Intern | July 2019 - Sept 2019

esearched on prediction of drug-food interactions to improve drug absorption for patients using linked open data and machine learning algorithms using scikit-learn and deep-DDI framework.

Projects

Scroll right! View my data analytics, machine learning and natural language processing projects and publications.

Get in touch