Sr. Software Engineer, Data
We are looking for skilled engineers with eyes for building and optimizing distributed systems. From data ingestion, processing, storage optimization, we work closely with engineers and the product team to build highly scalable systems that tackle real world data problems. Our customers depend on us to provide accurate, real-time, and fault tolerant solutions to their ever growing data needs. The senior level engineer position is a highly technical position with responsibility to lead the development, validation, publishing, and maintenance of logical and physical data models which support various OLTP and analytics environments.
About this role:
Designs and implements planet scale distributed data platform, services and frameworks including solutions to address high-volume and complex data collections, processing, transformations and analytical reporting Work with the application development team to implement data strategies, build data flows and develop conceptual data models Understand and translate business requirements into data models supporting long-term solutions Analyze data system integration challenges and propose optimized solutions Research to identify effective data designs, new tools and methodologies for data analysis Provide guidance and expertise to development community in effective implementation of data models and building high throughput data access services Provide technical leadership in all phases of a project from discovery and planning through implementation and delivery Qualifications:
6+ years of hands-on experience in architecture, design or development of enterprise data solutions, applications, and integrations Ability to conceptualize and articulate ideas clearly and concisely Excellent algorithms, data structure, and coding skills with either Java, Python or Scala programming experience Proficiency in SQL Experience building products using one from each of the following distributed technologies: *Relational Stores (i.e. Postgres, MySQL or Oracle)
*Columnar or NoSQL Stores (i.e. Big Query, Clickhouse, or Redis)
*Distributed Processing Engines (i.e. Apache Spark, Apache Flink, or Celery) Distributed Queues (i.e. Apache Kafka, AWS Kinesis or GCP PubSub) Experience with software engineering standard methodologies (e.g. unit testing, code reviews, design document) Experience working with GCP, Azure, AWS or similar cloud platform technologies a plus Excellent written and verbal communication skills Bonus points for contributions to the open source community