职位详情

暂时没有符合条件的职位

研发 北京,成都,西安
详情 收起

薪酬:16K-25K  |  学历要求:硕士及以上  |  工作年限:2年以上

岗位职责
You will partner with teammates to create complex data processing pipelines in order to solve our clients’ most ambitious challenges You will collaborate with Data Scientists in order to design scalable implementations of their models You will pair to write clean and iterative code based on TDD Leverage various continuous delivery practices to deploy data pipelines Advise and educate clients on how to use different distributed storage and computing technologies from the plethora of options available Develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions Create data models and speak to the tradeoffs of different modeling approaches
岗位要求
You have a good understanding of data modeling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop You have built large-scale data pipelines and data-centric applications using any of the distributed storage platforms such as HDFS, S3, NoSQL databases (Hbase, Cassandra, etc.) and any of the distributed processing platforms like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting Hands-on experience in MapR, Cloudera, Hortonworks, and/or cloud (AWS EMR, Azure HDInsights, Qubole, etc.) based Hadoop distributions You are comfortable taking data-driven approaches and applying data security strategy to solve business problems Working with data excites you: you can build and operate data pipelines, and maintain data storage, all within distributed systems Strong communication and client-facing skills with the ability to work in a consulting environment
Mazr
Mazr

ThoughtWorks_Developer

  • 平均简历处理率 95%
  • 平均简历处理时间 38天
  • 当前简历处理率 100%
  • 当前简历处理时间 20天