Data mining projects are viewed under the umbrella of Cross Industry Standard Process and have 6 major phases: business understanding, data understanding, data preparation, modeling, evaluation and deployment.
Wymagania
Responsibilities:- Hands-on experience in Hadoop EcoSystem, specially Hive, Cloudera/Hue
- Time series based analysis using Hive
- Should have written or helped write Java Web Service using data in Cloudera/Hadoop (Hive)
- synchronized transaction management using weblogic (inserts only)
- work with Java and Database experts to come up with efficient Hive/No SQL, intermediate tables in Hive
Skills Required:
Good to Have
- loading the data, designing partitions in Hadoop
- handling structured and semi structured data
- Tuning Hadoop implementation (Cloudera installation)