Hello! I'm Sara
I'm passionate about all things Data Science! My journey has been fueled by a love for uncovering insights hidden in data. I enjoy working in Data Analysis, Machine Learning, and Natural Language Processing, finding joy in unraveling complex information puzzles.
Right now, I'm on the lookout for a challenging role that lets me use my tech skills to solve real-world problems in a team where collaboration is key. I love being in environments where different minds come together to create something truly innovative and impactful.
Phone:
EXPERIENCE
Oct. 2023 - Present
Data Analysis and Visualization Intern
TREVOR WELTMAN CONSULTING LTD.
-
Developing correlation models to quantify the relationship between productivity and Accounts Receivables for a pediatric insurance billing organization.
-
In-charge of designing visualization dashboards in Tableau for pattern recognition and trend analysis in productivity datasets.
Jun. 2023 - Sept. 2023
KINETIX TRADING SOLUTIONS INC.
Machine Learning Engineer Intern
-
Spearheaded the end-to-end development, debugging, and testing of a Large Language Model-based application, which extracts complex text entities from unstructured financial documents on MS Azure.
-
Successfully evaluated performance metrics and reduced model time complexity by 75%, obtained an F1-score of 84%.
-
Built post-processing Python regex scripts for HTML manipulation in the main application pipeline.
-
Utilized Agile ML Ops tools model deployment and monitoring, such as JIRA, Confluence and Postman.
Dec. 2020 - Mar. 2021
Data Science Intern
RELIANCE INDUSTRIES LTD.
-
Successfully developed 23 REST APIs, for the knowledge graph of intelligent cloud-based voice assistant software.
-
Implemented Google RPC and Protocol Buffers for connecting microservices and serializing structured data.
-
Developed and optimized queries for knowledge extraction from Arango DB and performed version control using Git.
EDUCATION
2021 - 2023
UNIVERSITY OF PENNSYLVANIA
Master of Science in Enineering
- Data Science
Relevant Courses - Machine Learning, Statistics for Data Science, Computational Linguistics and Natural Language Processing, Big Data Analytics, Deep Learning, Artificial Intelligence, Databases and Information Systems, Ethical Algorithm Design, Advanced Machine Learning
2016 - 2020
Bachelor of Engineering
- Information Technology
UNIVERSITY OF MUMBAI
Relevant Courses - Applied Mathematics, Structured Programming using C, Logic Design, Data Structures and Analysis, Database Management Systems, SQL, Java Programming, Operating Systems, Python Programming, Advanced Data Management Technology, Advanced Data Structures and Analysis of Algorithms, IOT, Data Mining and Business Intelligence, Cloud Computing and Services, Software Design, Artificial Intelligence, Big Data Analytics
PROJECTS
SKILLS
Machine Learning Algorithms
Classification, Regression, Clustering, Reinforcement Learning, Optimization
Cloud Computing
Microsoft Azure, AWS, Google Cloud Platform
Big Data Analytics & Visualization
Tableau, Power BI
Statistical Methods
A/B Testing, Hypothesis Testing, ANOVA
SQL and NoSQL Databases
MySQL, PostgreSQL, MongoDB, ArangoDB, Neo4j
Programming Languages
Python, R, SQL, JavaScript
ACHIEVEMENTS
Publications
-
“Optimizing Network Intrusion Detection using Machine Learning” (ICDATA 2020 - Springer)
-
“Smart Car Parking System using Wireless Sensor Networks” (ICISC 2020 – IEEE)
-
“Detecting the Presence of Pneumonia using Machine Learning” (Medium)
-
From Data to Decisions: Ethical Reflections on the Adoption of AI in Modern Medicine
Certifications
Awards
-
Award for Academic Excellence - Nov 2020
-
Gold Medal in Information Technology - Nov 2020