Senior Machine Learning Engineer, Data Platform
Skills :     Machine Print Operator
Job Description:

PubMatic (Nasdaq: PUBM) is an independent technology company maximizing customer value by delivering digital advertising’s supply chain of the future.

PubMatic’s sell-side platform empowers the world’s leading digital content creators across the open internet to control access to their inventory and increase monetization by enabling marketers to drive return on investment and reach addressable audiences across ad formats and devices.

Since 2006, our infrastructure-driven approach has allowed for the efficient processing and utilization of data in real-time. By delivering scalable and flexible programmatic innovation, we improve outcomes for our customers while championing a vibrant and transparent digital advertising supply chain.

Job Description

PubMatic is seeking a Sr. Machine Learning Engineer with big data experience who can work on building the next-generation ML platform. The ideal candidate is a self-motivated problem solver with a strong background in big data tech stack, software design and development.

If you get excited about building a highly impactful machine learning platform that processes large datasets, in a creative and fast-paced open cultured environment, then you should consider applying for this position.

Responsibilities

  • Build, design, and implement our highly scalable, fault-tolerant, highly available big data platform to process terabytes of data and provide customers with in-depth analytics.
  • Developing Big Data pipelines using modern technology stacks such as Spark, Hadoop, Kafka, HBase, Hive, etc.
  • Developing analytics applications ground up using modern technology stacks such as Java, Spring, Tomcat, Jenkins, REST APIs, JDBC, Amazon Web Services, and Hibernate;
  • Building data pipeline to automate high-volume data collection and processing to provide real-time data analytics.
  • Work collaboratively with the Machine Learning and monetization team to make democratize data for analysis and impact.
  • Build solutions to help the monetization team to run experiments at a fast pace and analyze data accurately to calculate impact.
  • Have a good understanding of the engineering tech stack and ML algorithms to make data processing jobs powering these algorithms more efficient and scalable.
  • Develop systems to objectively monitor the impact of various experimental changes on machine learning algorithms, clearly highlighting both positive and negative outcomes.
  • Managing Hadoop Map Reduce and Spark Jobs & solving any ongoing issues with operating the cluster;
  • Expertise in developing Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, performing code reviews, committing to Github, preparing documents in Confluence, continuous delivery using Jenkins, automated testing, and operations.
  • Participate in Agile/Scrum processes such as Sprint Planning, Sprint Retrospective, Backlog grooming, User story management, work item prioritization, etc.
  • Keep in regular touch with the quality engineering team which ensures the quality of the platforms/products and performance SLAs of Java-based microservices and Spark-based data pipelines.
  • Support customer issues over email or JIRA(bug tracking system), and provide updates, and patches to customers to fix the issues.
  • Discuss with the Technical Writing team about the technical documents that are published on the documentation portal.

Qualifications

  • 3-5 years coding experience in Java,
  • Solid computer science fundamentals including data structure and algorithm design, and creation of architectural specifications.
  • Expertise in developing Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews, source control management, documentation, build processes, automated testing, and operations.
  • A passion for developing and maintaining a high-quality code and test base, and enabling contributions from engineers across the team.
  • Expertise in big data technologies like Hadoop, Spark, Kafka, HBase, etc would be an added advantage.
  • Experience in developing and delivering large-scale big data pipelines, real-time systems & data warehouses would be preferred.
  • Demonstrated ability to achieve stretch goals in a very innovative and fast-paced environment.
  • Demonstrated ability to learn new technologies quickly and independently.
  • Excellent verbal and written communication skills, especially in technical communications.

Compensation And Benefits

Base Salary Range: $160,000 - $180,000

In accordance with applicable law, the above salary range provided is PubMatic’s reasonable estimate of the base salary for this role. The actual amount may vary, based on non-discriminatory factors such as location, experience, knowledge, skills, and abilities. In addition to salary PubMatic also offers a bonus and a competitive benefits package.

Additional Information

Return to Office: PubMatic employees throughout the globe have returned to our offices via a hybrid work schedule (3 days “in the office” and 2 days “working remotely”) that is intended to maximize collaboration, innovation, and productivity among teams and across functions.

Benefits: Our benefits package includes the best of what leading organizations provide such as paid leave programs, paid holidays, healthcare, dental and vision insurance, disability and life insurance, commuter benefits, physical and financial wellness programs, unlimited DTO in the US (that we actually require you to use!), reimbursement for mobile and fully stocked pantries plus in-office catered lunches 4 days per week.

Diversity and Inclusion: PubMatic is proud to be an equal opportunity employer; we don’t just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status