
Associate Product Architect
Yantran LLC, Seattle, WA, United States
Competent Big Data/Databrick engineer, who is independent, results driven and is capable of taking business requirements and building out the technologies to take it to production.
• Big Data
Engineer with expert level experience in Hadoop ecosystem and real time analytics tools including PySpark/Scala Spark/Hive/Hadoop CLI/MapReduce/ Storm/Kafka/Lambda Architecture/Mongo
• Familiar with job scheduling challenges in Hadoop
• Experienced in creating and submitting Spark jobs
• Experience in high performance tuning and scalability
• Experience in working on real time stream processing technologies like Spark structured streaming, Kafka
• Expertise in Python/Spark and their related libraries and frameworks
• Experience in building pipeline and efforts involved in deployment
• Unix/Linux expertise; comfortable with Linux operating system and Shell Scripting
• Experience in Azure cache
• Design, Development, Unit and Integration testing of complex data pipelines and to handle data volumes to derive insights
• Ability to optimize code to be able to run efficiently with stipulated SLA
• Excellent problem solving skills, with attention to detail, focus on quality and timely delivery of assigned tasks