Hello! I'm Sara
As a Data Analysis and Visualization Intern at Clickable Impact Consulting Group., I performed data extraction and integration of time-series data from diverse sources. I also conducted correlation analysis to identify features and increase operational efficiency. It has given me an opportunity to apply my strong background in Data Science, gained during my Master's Degree at the University of Pennsylvania.
I am looking for full-time opportunities where I can build creative data solutions for real-world challenges. I am a detail-oriented individual with strong analytical skills and an innovative approach to problem-solving. I am skilled at Python, SQL, Statistical Analysis, Data Visualization and Software Development. I am a self-motivated worker with excellent communication and presentation skills, flexibility to adapt, and strong interest in increasing my technical expertise while contributing to the industry. Let's connect and use data to tell a story!
Phone:
EXPERIENCE
Oct. 2023 - Jun. 2024
Data Analysis and Visualization Intern
TREVOR WELTMAN CONSULTING LTD.
-
Orchestrated the extraction and integration of time-series employee data with financial data of 7 medical practices.
-
Developed a model for measuring employee productivity using over 15 quantified features of the preprocessed dataset.
-
Performed in-depth correlation analysis to evaluate the relationship between features, leading to a 15% increase in operational efficiency through strategic prescriptive modeling.
-
Produced dynamic visualizations and reports, quantifying the impact of productivity on business outcomes.
Jun. 2023 - Sept. 2023
KINETIX TRADING SOLUTIONS INC.
Machine Learning Engineer Intern
-
Led the development and testing of a Large Language Model (LLM)-powered application focused on entity extraction from financial documents by utilizing OpenAI API, BeautifulSoup, LangChain and Asynchronous I/O.
-
Strategically optimized model runtime, reducing it by 75% while securing an F1-score of 84%.
-
Built and debugged post-processing Python scripts for HTML manipulation, deployed in the main application release.
Dec. 2020 - Mar. 2021
Data Science Intern
RELIANCE INDUSTRIES LTD.
-
Architected 23+ core APIs for an AI voice assistant’s knowledge graph using gRPC and ProtoBuf, enhancing data interchange efficiency.
-
Engineered and optimized queries in Arango DB, leveraging its graph database capabilities for superior performance.
-
Implemented containerization through Docker for seamless deployment and scalability.
EDUCATION
2021 - 2023
UNIVERSITY OF PENNSYLVANIA
Master of Science in Enineering
- Data Science
Relevant Courses - Machine Learning, Statistics for Data Science, Computational Linguistics and Natural Language Processing, Big Data Analytics, Deep Learning, Artificial Intelligence, Databases and Information Systems, Ethical Algorithm Design, Advanced Machine Learning
2016 - 2020
Bachelor of Engineering
- Information Technology
UNIVERSITY OF MUMBAI
Relevant Courses - Applied Mathematics, Structured Programming using C, Logic Design, Data Structures and Analysis, Database Management Systems, SQL, Java Programming, Operating Systems, Python Programming, Advanced Data Management Technology, Advanced Data Structures and Analysis of Algorithms, IOT, Data Mining and Business Intelligence, Cloud Computing and Services, Software Design, Artificial Intelligence, Big Data Analytics
PROJECTS
SKILLS
Machine Learning Algorithms
Classification, Regression, Clustering, Reinforcement Learning, Optimization
Cloud Computing
Microsoft Azure (OpenAI, ML Studio), AWS (RDS, SageMaker, Amazon RedShift), GCP (Google Cloud Storage, Cloud Function, Composer, BigQuery, Looker), Snowflake, Databricks Lakehouse
Tableau, Power BI, Microsoft Excel
Big Data Analytics & Visualization
Statistical Methods
Hypothesis Testing, Regression Analysis, A/B Testing
MySQL, PostgreSQL, NoSQL (MongoDB, Neo4j), ArangoDB
SQL and NoSQL Databases
SQL, Python, R, JavaScript, Java (Spring Boot framework)
Programming Languages
ACHIEVEMENTS
Publications
-
“Optimizing Network Intrusion Detection using Machine Learning” (ICDATA 2020 - Springer)
-
“Smart Car Parking System using Wireless Sensor Networks” (ICISC 2020 – IEEE)
-
“Detecting the Presence of Pneumonia using Machine Learning” (Medium)
-
From Data to Decisions: Ethical Reflections on the Adoption of AI in Modern Medicine
Certifications
Awards
-
Award for Academic Excellence - Nov 2020
-
Gold Medal in Information Technology - Nov 2020