Can we detect Retinal Disorders in OCT images using CNNs?
Challenges: Occasional poor image quality, differing image heights and widths, unbalanced classes
Methods Applied: Filters to reduce noise, fixed image pixel sizes, balancing classes, and training with blocks of 1000 images at once
Utilities: Python, Keras, Tensorflow 2.0, OpenCV, CNNs, Pretrained and custom models
Result: Achieved a 97% testing accuracy on average with 5-fold cross validation with pretrained models on a class-balanced dataset. Training time reduced by 80% compared to a traditional non-pretrained CNN modeling approach.
Given a user's movie preference profile, can we recommend movies?
Challenges: Distributed data, Messy data, Imbalanced categorical variables
Methods Applied: Database joins; Data cleaning and splitting into clean variables; One Hot encoding
Utilities: Python, Pandas, Seaborn, Content-based filtering, Collaborative-filtering
Result: Able to generate the top 'X' movies suggestions based on user profile and ratings, least 'X' recommended movies, Most popular 'X' movies lists
Can we track Tweets associated with tags/keywords to do real-time sentiment analysis?
Challenges: Problem specific data collection, continuous data-stream from tweets, user perception of product or people or brand
Methods Applied: Data collection from Twitter feed with keyword matching for the brand or products, real-time tweet cleaning, storing into an SQL database, real-time sentiment analysis, translation of non-english tweets to english, live interactive dashboard template with "Dash" to create a web applet to stream the data and the sentiments associated with each tweet
Utilities: Python, Tweepy, Textblob, sqlite3, json, twitterstream, NLTK, SentimentAnalysis, Polarity scoring, Subjectivity
Result: Demonstrated an ability to gather, parse, clean, and understand the real-time sentiments associated with League of Legends new champion launches
Can we track, analyze, and auto-invest on Robinhood?
Challenges: Accessing real-time stock information, automating purchase decisions to maximize profit
Methods Applied: Webcrawler to gather near real-time stock info from Yahoo.com, Financial analysis to generate candlestick visualizations, short-term, and long-term and golden-cross indicators for predicting profitability of each stock
Utilities: Python, Robin_stacks, numpy, ta, tradingstats, dash, plotly, pandas
Result: Automated the process to track, analyze, and automate Robinhood stock portfolio management with python that can make guided purchase decisions to reduce time investment and improve purchase decisions for deriving profits.
Publications (1 published, 1 in press, 2 in prep)
Data Science projects and counting (check my github repo)
Years of Work Experiences spanning healthcare, human behavior, and data science
About me
My Work Experiences
I work on a variety of data science projects that utilize machine learning, AI, and statistical modeling applied to learning and behavioral predictions, user segmentation, computer vision in healthcare, risk analysis, and sentiment analysis
Back to top
I conduct research and create end-to-end research solutions to understand learning and human behavior in a variety of contexts. I am motivated by the work geared towards optimization, inclusion, and personalization of user experiences across domains -- technology, entertainment, healthcare, and learning.
Back to top
I have 3+ years of clinical experience as an Optometrist in India. My current research and future endeavors are tied to my experiences as a clinician. I believe that every individual should recieve a personalized experience when interacting with technology, entertainment, and everyday utilities. I want to understand the needs of the next billion and promote equity, inclusion, and universal design through my work.
Back to top