Summary Education Certifications: Technical Skills Academic Projects ...

Report 10 Downloads 109 Views
Dallas, TEXAS-76010 (682)582-6400 [email protected]

Karthik Elangovan

http://www.linkedin.com/in/elangovankarthik https://github.com/elonProfile Portfolio

Summary Experience in Machine learning data analysis, data wrangling, database management, project planning, and implemented ETL in several projects. Excellence in statistic and data visualization with data science Nano degree from udacity and master’s degree in Industrial Engineering.

Education

Udacity- Data Analyst Nanodegree The University of Texas at Arlington - Masters of Science in Industrial Engineering Anna University, Tamil Nadu, India –Bachelor of Technology, Mechanical Engineering

Jun.2015- May. 2016 Jan.2014- Dec. 2015 May.2009- Mar. 2013

Certifications:

Excel for Data Analysis and Visualization certified from Microsoft. Quality Engineering and Management certified from Technische Universität München. Supply chain and logistic certified from Massachusetts Institute of Technology.

Technical Skills Languages: Python, R,SQL,No-SQL,Javascrip,D3. IDE: Anaconda, Spyder, IPython Notebook, R-Studio, Radio. Databases / Documentation: MongoDB, Infomatica, MS Word, MS Excel, MS Project Management. Python statistics library: Pandas, Numpy, Matplotlib, Scikit-learn, ggplot.

Academic Projects

 Investigated the Enron Employees using Machine Learning to find who have committed fraud based on public Enron financial and email data. Various algorithms such as Naïve Bayes, Decision tree, PCA, SVM was applied using Sklean Machin Learning package.  Optimized and validated Machine learning algorithms with a precision .60 and recall .30 and cross validated using kfold.  Prepared open street map for the city of Chicago and Chennai using Data Wrangling tools in Python to parsing data from different file formats like json, xml, csv, etc and assess the quality of the data for validation, accuracy, consistency and uniformity.  Stored the cleaned data into MongoDB and run MongoDB No-SQL query’s and aggregate data for future analysis.  Developed regression model and investigated inference statistics with 95% confidence interval using Python to answer "do more or few people ride the NYC subway when it is raining?”  Developed prediction model using ordinary least squares using Python’s Numpy, Pandas, Statsmodels to find various factors such as weather, time of travel etc. influencing NYC subway ridership to make subway more commuter friendly and help bring down operation cost.  Build interactive graphs to tell a story on how the tourist incoming boomed in India till 2014 using D3 and JavaScript with gulp build tools.  Designed an A/B test, including which metrics to measure and how long the test should be run. I also analyzed the result of an A/B test that was run by Udacity, recommended a decision, and proposed a follow-up experiment.  Investigate hypothesis and generated descriptive statistics outcomes for Stroop Effect using Python to describe qualities of sample.  Analyzed inferences from Stroop experiment samples and draw conclusion based on results with 95% confidence interval.  Prepared exploratory data analysis on the quality of red wine sample using R. Generated HTML doc. Via R markdown to explore the relationship using univariate, bi-variate and multi-variate plots.

Professional Experience Efftronics System Pvt, Ltd., Operation Analysis, India Jan. 2013- Dec. 2013  Developed vendor database network to increase 20% more potential suppliers based on company strategy and cost priority in Asian market.  Prepared purchase orders; generated reports on power plant projects using SAP (Warehouse Management System model); and worked closely with purchase department to implement LEAN technology.  Prepared spend data of various modes and trend analysis to support the annual reports using MS Excel.  Created Informatica Mapping, Session and workflow as per technical specification.  Tuning the ETL-Informatica code in Mapping level and session level.

Leadership and Involvement Kaggle – Ranked 1072 in Home depot product searched relevance.