Uploaded File
Akshay
813-203-9929
San Francisco, CA 94101
Data Engineer
13 years experience W2
0
Recommendations
Average rating
179
Profile views
Summary

  • Roles:
    Senior Data Engineer

    Summary:
    Playing with Big Data has been my passion. Thats what I do and thats what I love to do.

    Extensive experience (8.5 years) while working with huge volume of data, building scalable data pipeline.
    Passionate about Big data technologies, having experience in Hadoop, AWS framework.

    Skills:
    Big Data Technologies: Hadoop(HDFS), MapReduce, Hive, Spark, Kafka, AWS Kinesis, Lambda scripts, S3, AWS Glue, DynamoDb
    Programming: Python,Scala, Shell script
    Platform : Amazon Web Service
    Database: Redshift, DB2, Oracle, MySQL
    Data Analytics & Business Intelligence – Tableau
    Software Methodology: Agile, Waterfall

Experience
Data Engineer
Information Technology
Oct 2015 - present

--Building data pipeline, ETL processes for all products using Hadoop framework, python and ETL tools.
--Writing Python scripts for ETL methodologies.
--Using Sqoop for importing and exporting data from traditional RDBMS like Oracle to HDFS and vice versa.
--Writing hive queries for fetching data from HDFS and writing complex SQL queries to fetch data from RDBMS.
--Writing a scala scripts to migrate a data between Hadoop clusters, doing transformation and automating entire process.

Sqoop SQL RDBMS Python Oracle Hadoop ETL Data Engineering Hive HDFS
Remove Skill
Academic Project
Education
Aug 2014 - Apr 2015

​Data Mining and Predictive analysis of Internet users for Telecom Company:

  • Performing data analysis using various data mining techniques such as neural networks, naive Bayes and decision trees in SAS & Weka to determine possible customers that can use Internet.

Regression Analysis on eBay auction data:

  • Built a linear regression model in R on auction data of eBay to determine pattern in price changes. Used different sampling technique to determine effective samples in data.

Statistical Data Mining Project on Nielsen data:

  • Identifying best TV program within each DMA assigned by Nielsen and comparing SANDY effect on TV viewing experience using different modelling techniques like linear regression, non-linear regression, classification trees and ANOVA tables.

Analysis Data Analysis Data Mining Linear Regression Neural Networks Weka SAS CL
Remove Skill
Technology Analyst
Information Technology
Feb 2012 - Jun 2014

Projects: - 1. Mort Project 2. Rental Project 3. Experian Property Database 4. High Availability Data Services

Roles: -

  • Responsible for functional requirement gathering, analysis of requirements, creating high level and low level design document.
  • Manipulating, cleansing and processing of data using excel, SQL, Quality stage.
  • Used Datastage to extract, transform and loading (ETL) of source data from transaction systems.
  • Created data reports, reconciliation of data, data auditing and monitoring data for quality purpose.
  • Designed, developed and deployed new ETL functionality.
  • Suggested architectural changes in existing BI Datawarehouse as per new functionality to clients and building them.
  • Providing quantitative and qualitative data to business users.
  • Carried out processing of data and statistical techniques.
  • Worked with Business Analysts as an onshore coordinator for understanding of requirements, advising on new methods and suggesting improvements.
  • Managed and mentored different module teams simultaneously.
  • Worked directly with client and utilized problem solving skills to gain more business.

Data Warehousing Data Services SQL Requirements Gathering ETL Business Analysis Auditing Analysis Data Analysis Analytics Database Design DataStage Microsoft Excel
Remove Skill
Software Engineer
Information Technology
Jan 2011 - Feb 2012
  • Developed two ETL applications in project from scratch which became a core part of credit risk calculation.
  • Supported all live applications issues, bug fixing and maintenance of live applications.
ETL Software Engineer
Remove Skill
System Engineer
Aug 2008 - Jan 2011
Involved in coding, enhancement, production support of developed applications. Automation process of one of applications using DATASTAGE and UNIX scripts.
DataStage UNIX Production Support Systems Engineering
Remove Skill
Edit Skills
Non-cloudteam Skill
Education
Management Information Systems, General
University of South Florida 2015
Electronics and Telecommunication
Pune Institute of Computer Technology 1978
MIS
not provided
Certifications
IBM Infosphere Datastage V8.0
IBM Desktop Systems
IBM Infosphere Datastage V8.5
IBM Desktop Systems
IBM Certified Cognos Report Developer
Certificate in Analytics and Business Intelligence
Skills
ETL
2021
6
DataStage
2014
4
SQL
2021
4
Systems Engineering
2011
3
Analysis
2015
2
Analytics
2014
2
Auditing
2014
2
Business Analysis
2014
2
Data Analysis
2015
2
Data Engineering
2021
2
Data Services
2014
2
Data Warehousing
2014
2
Database Design
2014
2
Hadoop
2021
2
HDFS
2021
2
Hive
2021
2
Microsoft Excel
2014
2
Oracle
2021
2
Production Support
2011
2
Python
2021
2
RDBMS
2021
2
Requirements Gathering
2014
2
Sqoop
2021
2
UNIX
2011
2
Software Engineer
2012
1
Agile Methodology
0
1
AWS
0
1
AWS S3
0
1
Big Data
0
1
Business Intelligence
0
1
CL
2015
1
Cognos
0
1
Credit/Collections Analysis
0
1
Data Mining
2015
1
DB2
0
1
Fraud Analysis
2014
1
Kafka
0
1
Linear Regression
2015
1
Linux
0
1
MapReduce
0
1
MySQL
0
1
Neural Networks
2015
1
SAP GRC
0
1
SAS
2015
1
Scripting
0
1
SDLC
0
1
Shell Scripts
0
1
Spark
0
1
Tableau
0
1
Unit Testing
2014
1
UNIX Shell Scripting
0
1
Weka
2015
1
Awards
Insta Award Performer of year in a project, 0