Hello There! I'm
Dayita Chaudhuri

Hi there! Welcome to my page!

I am currently pursuing M.Tech. in Computation and Data Science at IISc Banglore. I completed my graduation in Computer Science and Engineering from NIT Agartala with an 9.43 CGPA. I have 8+ months of experience in software development and have worked with some of the top tech companies of the industry.

I am a highly dedicated and results-oriented individual with strong work ethics and a passion for learning and innovation. I am interested in the field of Natural Language Processing, Deep Learning and Data Science. My ultimate goal is to utilise my experience and skillset to make a positive impact in the world of technology. I am open to collaborations and all kinds of research opportunities.

If you feel that you have something mutually beneficial for us, please ping me through any of my socials or drop me a mail at dayitac@iisc.ac.in or dayitachaudhuri.tech@gmail.com.

Research Experience
Academic Research & Innovation

Currently pursuing MTech in Computational and Data Science from IISc Bengaluru, I am actively engaged in cutting-edge research in Natural Language Processing and Machine Learning.

Natural Language Processing Lab
MTech Research Student

December 2024 - Present

Research Focus: Advanced NLP techniques for multilingual text processing

Professional Experience
Industry Expertise & Achievements

With 8+ months of hands-on experience in software development and data science, I have worked with leading tech companies to deliver innovative solutions.
My experience spans full-stack development, machine learning implementation, and scalable system design.

CRED
Data Scientist Intern

May 2025 - August 2025

Key Responsibilities: Designed a SMS classifier and event extraction service using text embeddings, hierarchial XGBoost classification and NER pipeline, achievening 95% accuracy in classifying SMSs into 20+ categories and extracting relevant financial events from user data. Extracted user's credit lines and related financial events with 85% Recall and 90% Precision across defaults and creation of new credit lines for users with available data, ahead of Bureau report. lending-related decisions.

Technologies: Python, SQL, Spark, AWS (EC2, Lambda, S3, SQS), Sentence Transformers, Sentence Embeddings and Machine Learning Libraries.

Impact: Enabled the prediction of potential borrower risk and accelerated critical lending-related decisions.

Amazon
Software Development Intern

January 2024 - June 2024

Key Responsibilities: Developed a shipping cost allocation tracking tool for the TFS team that collects partial allocations across services and displays in a common interface. This reduced the manual effort in cost component querying by up to 80% and optimised the time required for on-call engineers to detect cost allocation anomalies and trace the root cause of errors by up to 90%. This tool streamlined the process by automating cost tracking across various upstream services and the final allocated cost.

Tech Stack: Java, Python, SQL, Spark, AWS (EC2, S3, Lambda, DynamDB, SQS, SNS), REST API and Frontend Frameworks

Impact: Streamlined the process of error debugging across owned services and reduced the time to identify and resolve issues.

Google
Application Engineering Intern

May 2024 - August 2024

Project Highlights: Architectured and implemented an end-to-end module that streamlined goal management for automated monitoring of supply chain data, ensuring real-time responsiveness to critical metrics.

Tech Stack: Java, Python, REST API, AngularJS, NodeJS.

Impact: Made goal management easier and faster for the team.

Microsoft Engage
Program Mentee

April 2022 - May 2022

Program Focus: Received intensive mentorship in software engineering best practices from Microsoft engineers. Developed a recommendation engine that implements Content-Based Filtering and Collaborative Filtering using using KNN Clustering and Similarity Searching to learn the fine interactions between users and movies. User Based Model achieved RMSE of 1.5 and demonstrated 50% more efficiency and better recommendations compared to Item Based Model which had RMSE of 2.5.

Skills Developed: Python, Streamlit, Similarity Search, KNN Clustering, Machine Learning Libraries.

Outcome: Successfully completed program with a return offer.

Technical Expertise Summary
Programming Languages

Python, Java, C++, SQL

Frameworks & Tools

TensorFlow, PyTorch, Spark

Cloud & DevOps

AWS, Google Cloud, Docker, Kubernetes, CI/CD

Experience
Course Work and Certifications

I have gone through a diverse spectrum of educational pursuits, from comprehensive coursework in the field of computer science and artficial intelligence to certified achievements validating proficiency in the latest technologies and methodologies. My aim has always been to stay ahead of market trends.

Coursework

  • MTech CDS
    • DS215 : Introduction to Data Science
    • DS288 : Numerical Methods
    • DS261 : Artificial Inteligence for Medical Image Analysis
    • DS284 : Numerical Linear Algebra
    • DS221 : Introduction to Scalable Systems
    • DS211 : Numerical Optimisation
    • DS207 : Natural Language Processing
    • E9205 : Machine Learning for Signal Processing
    • DS295 : Parallel Programming
  • BTech CSE
    • UCS04P15, UCS03B08: Data Structures and Algorithms
    • UCS07B92, UCS06B28: Machine Learning and Artificial Intelligence
    • UCS05B14: Computer Architecture and Organisation
    • UCS05B16: Database Management Systems
    • UCS06B28: Formal Language and Automata Theory
    • UCS05B15: Operating Systems
    • UCS04B10: Object Oriented Programming
    • UCS07E16: Cloud Computing
    • UCS06B29: Compiler Design
    • UCS06E07, UCS05E06, UCS06E07: Computer Networks, Cryptography and Network Security
    • UCS04B09: Microprocessors and Microcontrollers
    • UCS07E17: Information Retrieval

Certifications

  • Software Engineering Virtual Experience - JP Morgan and Chase (September 2022)
  • Cybersecurity Essentials - Cisco (September 2022)
  • Virtual Engineering Programme - Goldman Sachs (September 2022)
  • Amazon Machine Learning Summer School (July 2022)
  • 30 Days of Google Cloud (October 2021)
  • Web Development Bootcamp - Udemy (October 2021)
  • NPTEL: Programming, Data Structures and Algorithms in Python - IIT Madras (September 2021)
Portfolio
Check My Existing Works

I have had to fortune of working on a diverse array of projects, from crafting intelligent machine learning models to creating robust web applications. These ventures have been great learning experiences. To explore my work, please visit my GitHub profile.