Over 6 plus years of professional experience in Data Science, with sound knowledge in concepts related to Machine Learning, Data Mining and Statistical Analysis.
- Expertise in transforming business requirements into analytical models, designing algorithms, building models, developing data mining and reporting solutions that scale across a massive volume of structured and unstructured data. Machine learning algorithms. Python 3.5 (NumPy, Pandas, Matplotlib and Sci-kit-learn), Decision Tree, Random Forest, Naïve Bayes, Logistic Regression, Linear Regression, Multiple regression, Cluster Analysis, Neural Networks, KNN, SVM, k-means from Mathematical perspective.
- Experience in implementing data analysis with various analytic tools, such as Anaconda 4.0 Jupiter Notebook 4.X, R (ggplot2, Caret, dplyr) and Excel.
- Involved in the entire data science project life cycle and actively involved in all the phases including data extraction, data cleaning, statistical modelling, and data visualization with large data sets of structured and unstructured data.
- Skilled in Advanced Regression Modelling, Multivariate Analysis, Model Building, Business Intelligence tools and application of Statistical Concepts.
- Ability to write and optimize diverse SQL queries, working knowledge of RDBMS like SQL server 2008, NoSQL databases like MongoDB 3.2.
- Experience with ELK (Elasticsearch) stack technologies
- Extensive experience in Text Analytics, Statistical Machine learning and Data Mining, providing solutions to various business problems and generating data visualizations using R, Python.
- Performed Information Extraction using NLP algorithms coupled with Deep learning (ANN and CNN, RNN, LSTM, Encoders, Embedders), Keras and Tensor Flow.
- Trained convolutional neural network using Tenser flow, to collect image features from images of items provided by each seller.
- Good nowledge in Computer vision and ROS development.
- Good knowledge on use of Microservices FOR developing a service-oriented architecture.
- Trained and tested various object detection models.
- Excellent understanding of SDLC, Agile and Scrum development methodology.
- Have good experience, working with ARIMA parametric time series models.
- Created customs reports and dashboards using simple drag and drop functionality by use of Sageworks.
- Assist in creation of forecasting tools and models using mathematical and statistical techniques, to determine continuous improvement opportunities.
- Experience in tools and concepts such as Hadoop, MapReduce, Spark 1.6, PySpark, SparkSQL, HDFS, Hive 1.X., related to Big Data.
- Extracted data from HDFS and prepared data for exploratory analysis through data munging.
- Proficient in Predictive Modelling, methods related to Data Mining, Factor Analysis, ANOVA, Hypothetical Testing, Normal Distributions and other advanced statistical and econometric techniques.
- Worked as a member of an experienced optimization team to design creative variants to be A/B tested. Focused on developing, improving, and testing the subscriptions landing pages as well as the crosswords page.
- Hands on experience and in provisioning virtual clusters under Amazon Web Service (AWS) cloud which includes services like Elastic compute cloud (EC2), S3, and EMR.
- Performing Map Reduce jobs in Hadoop and implemented spark analysis using Python, for performing machine learning & predictive analytics on AWS platform.
- Experienced in Data Integration, Data Validation and Data Quality control for ETL process and Data Warehousing, using MS Visual Studio SSIS, SSAS, SSRS.
- Worked with complex applications such as MATLAB and SPSS to develop neural networks and performed cluster analysis.
- Strong experience and good knowledge in data visualization using tools such as Tableau for creating line and scatterplots, Bar-charts, Histograms, Pie-chart, Dot-charts, Boxplots, Timeseries, Error Bars, Multiple Charts types, Multiple Axes, subplots etc.
- Automated recurring reports using SQL and Python and visualized them on BI platform like Tableau.
- Experience in visualization tools like Tableau 9.X, 10.X for creating dashboards.
- Used version control tools like Git 2.X and VM.
- Passionate about gleaning insightful information from massive data assets and developing a culture of sound, data-driven decision making.
- Experience in Visual Basic for Applications and VB programming languages to work with developing applications.
- Taking responsibility for technical problem solving, creatively meeting product objectives and developing best practices
- Excellent communication skills (verbal and written) to communicate with clients and team prepare + deliver effective presentations.
- Ability to maintain a fun, casual, professional and productive team atmosphere.