Sign In
Looking for talent?
Check out our hiring section
Login to your account
Remember me?
Login
Forgot password?
Not a user yet?
Click here to register.
LOADING
Select Login
Uploaded File
Balaji
pb_2910@yahoo.com
206-604-2864
13809 NE 11th Street
Bellevue, WA 98005
Big data Developer
15 years experience
W2
0
Recommendations
Average rating
118
Profile views
Summary
9.9 years of experience in IT which includes Analysis, Design and Development of Big Data using Hadoop and SPARK and data bases includes SQLSERVER, My SQL, ORACLE, TERADATA, IMS DB and DB2.
Around 4.5+ years of work experience on Big Data with hands on experience in Hadoop ecosystem components like Hadoop Map reduce, HDFS, Zookeeper, Hive, Hbase, Sqoop, Apache NIFI, Oozie and SPARK, CTRL-M, KAFKA.
Good Understanding of Hadoop architecture and Hands on experience with Hadoop components such as YARN, Job Tracker, Task Tracker, Name Node, Data Node, Application master and Map Reduce concepts and HDFS Framework.
Experience in using Apache Ambari for installation and management of single-node and multi-node Hadoop cluster (Ambari 2.5).
Experience in Data load management, importing & exporting data using SQOOP.
Experience in scheduling and monitoring jobs thru Oozie, Zookeeper and CTRL-M.
Experience in writing Map Reduce programs & UDF's for Hive in java.
Experience in integrating Hive and Hbase for effective operations.
Experience in writing end to end Spark data processing (2.3) as a part of Azure cost usage project.
Experience in Developing Scala program by implementing Spark Streaming by integrating with Kafka and triggering via Nifi.
Having good knowledge in writing scripts using Bash shell in Linux.
Experience in understanding and querying Databases like TERADATA, ORACLE, MYSQL, SQL SERVER and integrating with Hadoop HDFS storage.
Worked on different file formats (ORCFILE, AVRO, TEXTFILE) and different Compression Codecs (GZIP, SNAPPY).
Strong understanding of Data warehouse concepts, ETL, data modeling experience using Normalization, Business Process Analysis, Reengineering, Dimensional Data Modeling, Physical & Logical data modeling.
Experience in JAVA concepts like Oops, collections, Multithreading, JDBC with strong analytical and problem solving skills and ability to follow through with projects from inception to completion.
Utilized Apache Hadoop environment by Hortonworks.
Knowledge in software Development Life Cycle (Requirements Analysis, Design, Development, Testing, Deployment and Support).
Implemented end to end cloud data processing framework by creating Azure hdinsights Spark cluster and Azure ADF pipelines and integrating with SQL server at the source and Azure BLOB storage accounts at the end.
Have good interpersonal communication skills, strong problem solving skills, explore/adopt to new technologies with ease and a good team member.
TECHNICAL PROFICIENCY – BIGDATA
OPERATING SYSTEMS LINUX, UNIX, WINDOWS
HADOOP ECO SYSTEM Hadoop (HDFS and Map-Reduce), YARN, Zookeeper, Hive, Oozie
PROCESSING FRAMEWORK Map Reduce and APACHE SPARK
DATA INGESTION SQOOP, KAFKA
Others Apache NIFI
LANGUAGES/SCRIPT Shell Script(bash), Java, Scala, Python
Cluster Management Hortonworks, Ambari
Operating Systems Unix, Windows, Linux (CentOS, Ubuntu, Redhat)
Development Tools Maven, Jenkins, GITHUB, Gerrit, gitbash
Cloud Technologies AWS EC2, EMR, S3 bucket, Azure Hd insights, Azure data factory, Azure BLOB storage account.
Experience
Edit Skills
Non-cloudteam Skill
Education
Bachelor's in Pollachi
Mahalingam college of Engineering and Technology 2009
Record has not been verified.
Skills
ETL
2019
6
Java
2019
6
Devops
2020
4
Git
2020
4
Hadoop
2020
4
Hbase
2019
4
HDFS
2019
4
Hive
2019
4
Jenkins
2020
4
MapReduce
2019
4
Oozie
2019
4
Sqoop
2019
4
Ambari
2020
3
Data Validation
2019
3
Data Warehousing
2019
3
Eclipse
2020
3
Python
2019
3
Stored Procedure
2019
3
Teradata
2019
3
UNIX
2019
3
Apache
2016
2
Big Data
2020
2
Informatica
2019
2
Maven
2017
2
MySQL
2016
2
node.js
2016
2
Data Cleansing
2016
1
Data Migration
2016
1
Pig
2016
1
Spark
2020
1
XML
2017
1
AWS
0
1
BaSH
0
1
CentOS
0
1
Data Modeling
0
1
JDBC
0
1
Linux
0
1
MS Azure
2020
1
Oracle
0
1
RedHat
0
1
SQL
2020
1
SQL Server
2020
1
Ubuntu
0
1
Windows
0
1