Senior Data Scientist
Jan 2023 – Nov 2024
LASH Division, Advanced Data Science and AI Solutions Team | Conshohocken, PA
Worked in end-to-end development of AI projects, mentored team members, and built models for forecasting, recommendation, and adverse event detection using patient and drug data.
- AI Chatbot for on-demand KPI Access (Agentic AI Chatbot):
- Led end-to-end development of AI chatbot for Pfizer patient data enabling real-time KPI access.
- Setup MLOps using CI/CD workflows and monitored performances.
- Saved 400 human working hours, translating to $3M in freed working capital.
Tech: Data Security, OpenAI, Langchain, FastAPI, Azure Web Apps, Azure DevOps, LLMs, Agentic AI.
- Recommending Next Best Item (Recommendation System):
- Developed recommender engine to suggest next best drugs to sell for given pharmas.
- Scaled pipeline to process millions of data records.
- Enabled extra capital of $2M quarterly based on newly recommended products.
Tech: Recommendation Engine, Collaborative Filtering, PySpark, Databricks, Snowflake, Keras, PyTorch.
- Out-of-pocket Cost Prediction (Regression Modelling):
- Developed end-to-end ML system to predict patient out-of-pocket costs for therapy claims.
- Architected feature engineering pipeline processing 10M+ monthly claims.
- Engineered two-part models and quantile regression to handle zero-inflated cost distributions.
Tech: Python, PySpark, Gradient Boosting, Two-Part Models, Quantile Regression, Tweedie Regression, MLflow, Great Expectations.
- Sales Forecasting & Demand Planning (Timeseries Modelling):
- Developed advanced time series model incorporating historical sales, seasonality and promotional activities.
- Reduced stockouts by 15% and improved forecast accuracy by 20%, saving $3M.
Tech: ARIMA, Prophet, LSTM, Pycaret, Greykite, XGBoost, CatBoost, darts, statsmodels, nixtla, TimeGPT, bambi.
- Cross-Functional Dashboard (PowerBI Reports):
- Created executive dashboards for market share, revenue, and operational KPIs.
- Collaborated with internal teams and external vendors to unify data pipelines.