- 岗位要求
You have a good understanding of data modeling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop
You have built large-scale data pipelines and data-centric applications using any of the distributed storage platforms such as HDFS, S3, NoSQL databases (Hbase, Cassandra, etc.) and any of the distributed processing platforms like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting
Hands-on experience in MapR, Cloudera, Hortonworks, and/or cloud (AWS EMR, Azure HDInsights, Qubole, etc.) based Hadoop distributions
You are comfortable taking data-driven approaches and applying data security strategy to solve business problems
Working with data excites you: you can build and operate data pipelines, and maintain data storage, all within distributed systems
Strong communication and client-facing skills with the ability to work in a consulting environment