Moe
Moeredshift92@gmail.com
929-589-6467
New York, NY 10001
347-468-0742
HADOOP DEVELOPER
10 years experience W2
0
Recommendations
Average rating
342
Profile views
Summary

  • Big Data Engineer/ Senior developer with several years of professional experience involving hands-on experience working in all phases of development life cycle involving strong Python / Java programming skills
  • Strong experience installing, configuring, and using Hadoop ecosystem components
  • Hadoop MapReduce, HDFS, HBase, Hive, Sqoop, Pig, Zookeeper, Storm, Spark, Kafka and Flume, setting up and integrating Hadoop ecosystem tools
  • HBase, Hive, Pig, Sqoop etc.
  • Experience with machine learning algorithms, involving implementation of multi-stage pipelines for Data pre-processing, feature engineering, and model training. Created data management pipelines, supported data migration efforts from legacy to on-cloud
  • Experience operating data warehouses or data lakes
  • DB2 and AWS redshift. Worked with large data sets using Python and Pyspark to extract data from the S3 buckets. Used Amazon S3 plugin for PyTorch to scale data loaders efficiently for accessing data stored in S3 buckets
  • Worked with Spark ML file formats to create functional programming for researching and designing the data flows.
  • Used Informatica to perform ETL operations such as transformations, filter, joins, and merge on source files, raw files and systems. Implemented the Big Data solution using Hadoop, hive and Informatica to pull/load the data into the HDFS system.
  • Analyzed financial products, issuers, exchange rates, and prices to help facilitate financial transactions involving reference data within the company.

Experience
Education
Master's in Business Administration
University of Wales
Skills
Apache
2021
5
Python
2021
5
Linux
2021
4
AWS
2021
3
Eclipse
2021
3
Hadoop
2021
3
Hadoop Developer
2021
3
HDFS
2021
3
Hive
2021
3
Java
2018
3
MapReduce
2021
3
MySQL
2018
3
Oozie
2021
3
Oracle
2021
3
Pig
2021
3
Spark
2021
3
Sqoop
2021
3
Tableau
2021
3
Ubuntu
2018
3
WebServices
2018
3
.NET
2016
2
ETL
2021
2
Flume
2021
2
Git
2016
2
Groovy
2016
2
JavaScript
2016
2
Jenkins
2016
2
Junit
2016
2
Machine Learning
2021
2
Maven
2016
2
REST
2016
2
Selenium
2016
2
SOAP
2016
2
SVN
2016
2
UI
2016
2
Big Data
2017
1
Data Analysis
2017
1
Data Cleansing
2017
1
Data Integration
2021
1
Data Warehousing
2018
1
Elasticsearch
2018
1
Hbase
2021
1
impala
2017
1
JSON
2018
1
Metadata
2017
1
MongoDB
2021
1
OpenShift
2018
1
Shell Scripts
2018
1
SQL
2017
1
Tableau Desktop
2018
1
Teradata
2021
1
UNIX
2021
1
Data Architecture
0
1
Data Integrity
0
1
Data Modeling
0
1
Data Profiling
0
1
HTML
0
1
J2EE
0
1
JDBC
0
1
jQuery
0
1
JSP
0
1
MS Azure
0
1
node.js
0
1
XML
0
1