Kavitha Rajan
Profile
Data Scientist with 5 years building ML pipelines and NLP models in Python for fintech and retail domains. Deployed a credit-risk model that reduced bad-loan approvals 18% (₹6.4 Cr savings in first year) at HDFC Bank. Expert in scikit-learn, XGBoost, and PySpark; M.Tech from IIT Hyderabad.
Experience
- Developed an XGBoost-based credit-risk scoring model on 4.2 M applicant records; ROC-AUC improved from 0.71 to 0.84, cutting bad-loan approvals 18% and saving ₹6.4 Cr in Year 1.
- Built a real-time transaction fraud-detection pipeline on Kafka + PySpark processing 120,000 events/min; false-positive rate dropped 31%, reducing manual review load by 2,400 analyst-hours/quarter.
- Designed A/B testing framework for credit-offer personalisation; winning model variant lifted acceptance rate 14 percentage points across 900,000 customers.
- Mentored 2 junior data scientists through end-to-end model delivery; both shipped production models within 4 months.
- Built a fashion recommendation engine (collaborative filtering + content embeddings) that raised click-through rate 23% and contributed ₹1.8 Cr incremental GMV in the first post-launch quarter.
- Trained BERT-based size-recommendation NLP model on 500,000 customer reviews; size-related return rate dropped 11% within two months of deployment.
- Automated weekly demand-forecasting pipeline (ARIMA + LGBM ensemble) for 2,300 SKUs, reducing overstock by 8% and cutting manual analyst effort from 12 hours to 45 minutes per week.
Education
Skills
Certifications & Competitions
- Google Professional Machine Learning Engineer — certified 2023
- Kaggle — Competition Expert; top 4% in IEEE-CIS Fraud Detection (3,748 teams)
- AWS Certified Data Analytics – Specialty (2022)