CV
Summary
Aspiring data scientist with strong technical skills in software engineering.
SKILLS
Programming Languages / Technologies:
Java; R, Python (scikit-learn, pandas, nltk)
Large-Scale Data Processing:
Apache Spark, Apache Hadoop
Theory:
Data Mining, Machine Learning; Probability and Statistics; Natural Language Processing
PROFESSIONAL EXPERIENCE
01.2017 — Present
EdLab Teachers College Columbia University, New York, United States
Position: Software Engineering
06.2016 — 08.2016
Argus Information & Advisory Services, New York, United States
Position: Data & Application Solution Intern
Project details:
- Process, analyze and validate credit card data to ensure data integrity by identifying anomalies and suggesting corrective actions.
- Evaluated correlations among statistical data, identifying trends, summarizing findings across clients and products.
- Spearheaded Global Studies Code Repository Application reducing validation time by 900 minutes per month.
- Developed proof of concept for In-house application - built to replace Data Transformation Services packages.
03.2013 — 06.2015
Tata Consultancy Services , Mumbai, India
Position: Oracle ERP Technical Developer
Project details:
- Designed and implemented complex integrations for General Electric (GE P&W) enterprise level architecture.
- Tuned the performance to ensure integrity and security of Oracle Application R12.2.4 in ERP domain.
-Developed parts fulfillment component for the first time in GE history to address major pain point of fulfillments.
- Initiated and led process improvement ideations generating savings of USD 50,000 annually + 75 minute wait time.
Technologies:
- Oracle E-Business Suite R12 / 11i, PL/SQL, Informatica, Oracle Application Framework (OAF)
EDUCATION
2015 — 2017, Stevens Institute of Technology
Master of Science in Computer Science.
Specialization: Data Mining, Predictive Analytics and Recommendations System.
Current GPA: 4.0 / 4.0
2008 — 2012, Gujarat Technological University
Department of Computer Engineering, C U Shah College of Engineering & Technology
Specialization: Computer Engineering
Cummulative GPA: 3.2 / 4.0
PERSONAL PROJECTS
Netflix For Education
- Building a recommender system on student’s desired area of interest, either gleaned from past registrations history or expressed preferences with an aim to empower students to reach maximum potential.
- Incorporating additional data sources such as MOOC course, YouTube video, Google Scholar papers to produce a broader recommendation set.
Web Server Log Analysis
- Built a log analyzer to analyze the HTTP requests in NASA Kennedy Space Center web server.
- Processed the source of the hosts and failed requests to analyze client behavior and visualize the results.
Real Time Data Clustering
- Developed a web application to demonstrate the real time clustering of data and prediction based on user input.
- Integrated features to download, share and visualize interactive plots of data.
- Implemented Machine Learning models: K-Means Clustering, Generalized Linear Model.
Speech Recognition
- Working on Idea to revolutionize Healthcare Industry. Please drop a note to discuss and colloborate.
Twitter Sentimental Analysis
- Developed scripts for data retrieval and data cleansing of tweets during football matches using twitter API.
- Integrated a model to perform sentiment analysis of tweets.
ACADEMIC PROJECTS
Restaurant Review Classificaiton
- Developed machine learning model to classify restaurant reviews and achieved accuracy of 86.8%.
- Machine Learning Concepts: K-Nearest Neighbor, Logistic Regression, Multinomial Naive Bayes, Voting Classifier.
Web Scraping
- Developed scripts to scrape data of product reviews from e-commerce websites for qualitative and quantitative analysis.
- Implemented scripts using Beautiful soup, Selenium, lxml and compared execution time.
Telling Stories with Tableau
- Developed visualization showing geographic dispersion of NYC Citi bikes throughout the day using Mapbox.
- Created dashboard to represent twitter user habits and engagements over time using Twitter API data and d3.js.
Simplified Search Engine
- Simplified Search Engine using all the words as index terms excluding stop words such as articles, prepositions, and pronouns.
Inventory Management System
- Software that provides complete Maintenance, Replacement and Operational support for ESKAY INDUSTRIES using C#.NET, SQL
ShopAll
- A website which lists products and show promotion scheme
INDEPENDENT COURSEWORK