{"has_next":true,"jobs":[{"id":"4b838594-7738-4715-8dfb-17cd9c747ea8","company_id":"3029e985-56bf-4ac2-9ae1-df4cdd53b12f","title":"Principal GenAI Data Engineer ","slug":"principal-genai-data-engineer-9a655ea2","description":"About Zscaler \n Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise , we are constantly pushing the envelope, leveraging the world’s largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location.\n Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate —we’re focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession , collaboration, ownership, and accountability.\n We value high-impact, high-accountability with a sense of urgency where you’re enabled to do your best work and embrace your potential. If you’re driven by purpose, thrive on solving complex challenges, and want to be part of the team that’s helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity.\n Role \n We are looking for a Principal GenAI Data Engineer to join our IT Data Strategy team. This role is fully remote within the US, reporting to the Senior Manager, Enterprise AI Data Platform. We are seeking an experienced technical leader to drive the design and implementation of enterprise-grade Generative AI data ingestion, knowledge preparation, and platform architectures that enable scalable, production-ready GenAI applications. This role focuses on architecting robust pipelines and platforms for ingesting, processing, governing, and serving structured and unstructured enterprise data for AI/LLM workloads. The ideal candidate combines deep expertise in enterprise data architecture, unstructured data pipelines, GenAI platform engineering, and strong software engineering skills in Python.\n What you’ll do (Role Expectations) \n \n Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data\n Design scalable pipelines for enterprise knowledge ingestion from diverse data sources including documents, SaaS platforms, knowledge bases, collaboration tools, and databases\n Define architecture for metadata extraction, chunking, enrichment, embeddings generation, and knowledge preparation workflows\n Design AI-ready data models and storage strategies for vector, graph, and hybrid knowledge systems\n Architect scalable unstructured data processing pipelines for text, images, PDFs, tables, and multimodal content\n \n Who You Are (Success Profile) \n \n You act like an owner. Your passion for the mission fuels your bias for action. You operate with integrity because you genuinely care about the outcome. You adapt to what’s needed, navigating seamlessly between high-level strategy and hands-on execution.\n You are a problem-solver. You seek out challenges because you are energized by finding solutions, knowing that solving the hard problems delivers the biggest impact.\n You lead with integrity. You do the right thing, even when it’s hard. You hold yourself and others to a high standard of accountability and build trust by matching your words with consistent, transparent action.\n You think at scale. You connect your day-to-day work to the larger company mission and think globally. You build solutions, processes, and teams that are not just effective today but are built to last and support a high-growth, global organization.\n You are a high-trust collaborator. You are ambitious for the team, not just yourself. You embrace our challenge culture by giving and receiving ongoing feedback—knowing that candor delivered with clarity and respect is the truest form of teamwork and the fastest way to earn trust.\n \n What We’re Looking for (Minimum Qualifications) \n \n Expert-level Python programming and software engineering capabilities\n Experience building distributed/scalable data pipelines for AI workloads\n Strong understanding of unstructured data extraction and processing pipelines\n Experience with vector databases, graph databases, and metadata/knowledge storage systems\n Hands-on experience with clustering, entity recognition algorithms, and modern retrieval strategies (including RAG, search, and agentic AI workflows)\n \n What Will Make You Stand Out (Preferred Qualifications) \n \n Deep understanding of AI-ready data platform design principles and the ability to bridge platform/data engineering with GenAI/LLM application requirements\n Experience with LLMOps / GenAIOps frameworks such as LangSmith, Evaluation Framework like Arize","salary_min":182000,"salary_max":260000,"location":"Remote (US)","workplace":"hybrid","job_type":"full-time","experience_level":"principal","tags":["security","embeddings","data-pipeline","agents","generative-ai","llm","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/zscaler/jobs/5142526007","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-26T18:53:33Z","expires_at":"2026-06-29T14:09:18.592777Z","created_at":"2026-05-27T14:09:33.406845Z","updated_at":"2026-05-30T14:09:18.706338Z","company_name":"Zscaler","company_slug":"zscaler","company_logo_url":"https://www.google.com/s2/favicons?domain=zscaler.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/4b838594-7738-4715-8dfb-17cd9c747ea8"},{"id":"84de85ca-e855-456f-bf90-bc014d3a1e3b","company_id":"714f360f-a244-487d-b3f0-0c43518a9e66","title":"Software Engineer II, Data","slug":"software-engineer-ii-data-6e0c0be7","description":"About Pinterest: \n Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.\n Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the  flexibility to do your best work. Creating a career you love? It’s Possible.\n At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI.\n Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here .\n As a Software Engineer (Data), you are a full stack data engineer that loves solving business problems with data. You work with key business and product leads, analysts and data scientists to understand the business domain and how data can empower them. You engage with fellow engineers to develop better data platforms to make the process of producing data and deriving insights easy and efficient. You are passionate about the quality of the data you produce and take pride in having your data drive our business.\n  \n What you’ll do: \n \n Understand the business drivers and analytical use-cases and translate these to data products.\n Explore new technologies and learn new techniques to solve business problems creatively.\n Think big and drive the strategy for better data quality within Pinterest.\n Design, implement and maintain pipelines that produce business critical data reliably and efficiently using cloud technology.\n Become the voice of business within engineering, and of engineering within business.\n Create data visualizations that allow easy consumption of the data learnings and insights.\n Collaborate with many teams from Product, Engineering and Business to produce relevant data solutions that can be used across multiple use cases. \n Leverage AI to seek faster execution (i.e. draft, prototype, outline) and explore alternative options (i.e. iterate, compare approaches)\n Leverage AI to synthesize information (summarize, distill themes) and automate repeatable tasks (documentation, reporting, QA checks)\n \n  \n What we’re looking for: \n \n 2+ years of experience with big data (Hive, Iceberg, Presto, Spark, SparkSQL, Scala, Airflow), and scripting language (Python). Data visualization technologies (Tableau, Looker, Superset) a plus.\n Hands-on experience in principled data warehouse design, data pipeline design and development, and data visualization.\n Experience using large language models and developing AI agents to boost productivity\n Great communication skills. You should be able to directly communicate with senior business leaders, embed yourself with business teams, and present solutions to business stakeholders.\n Experience in working independently and driving projects end to end.\n Strong analytical skills. \n Demonstrated ability to use AI to improve speed and quality in your day-to-day workflow for relevant outputs.\n Strong track record of critical evaluation and verification of AI-assisted work (e.g., testing, source-checking, data validation, peer review).\n High integrity and ownership: you protect sensitive data, avoid over-reliance on AI, and remain accountable for final decisions and deliverables.\n Bachelor’s or Master’s degree in a relevant field such as Data Engineering, or equivalent experience\n \n  \n Relocation Statement: \n \n This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.\n \n  \n In-Office Requirement Statement: \n \n We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.\n This role will need to be in the office for in-person collaboration 1-2 times per half and therefore can be situated anywhere in Ontario.\n \n  \n #LI-HYBRID\n #LI-CH1\n At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise. \n Information regarding the culture at Pinterest and benefits available for this position can be found  here . \n Canada based applicants only\n $139,378 — $189,378 CAD \n Our Commitment t","salary_min":139378,"salary_max":189378,"location":"Toronto, Canada","workplace":"onsite","job_type":"full-time","experience_level":"junior","tags":["data-pipeline","llm","agents","data-engineering"],"apply_url":"https://www.pinterestcareers.com/jobs/?gh_jid=7901817","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-25T17:26:50Z","expires_at":"2026-06-29T14:08:26.696644Z","created_at":"2026-05-27T14:08:40.273078Z","updated_at":"2026-05-30T14:08:26.848471Z","company_name":"Pinterest","company_slug":"pinterest","company_logo_url":"https://www.google.com/s2/favicons?domain=www.pinterest.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/84de85ca-e855-456f-bf90-bc014d3a1e3b"},{"id":"4dbf93be-05aa-4e04-9e99-1e7793a74816","company_id":"714f360f-a244-487d-b3f0-0c43518a9e66","title":"Sr. Data Scientist, GenAI \u0026 Labeling Platforms","slug":"sr-data-scientist-genai-labeling-platforms-e43ce988","description":"About Pinterest: \n Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.\n Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the  flexibility to do your best work. Creating a career you love? It’s Possible.\n At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI.\n Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here .\n Pinterest brings millions of people the inspiration to create a life they love. Advancements in Generative AI have opened up a wealth of opportunities for improvements in productivity and labeling quality, and we've only scratched the surface of its capabilities. Early results show strong promise for LLM-assisted labeling — reducing time and cost, focusing human rater efforts on higher-value problems, and improving the accuracy of our learnings.\n  \n This role focuses on advancing the science and systems behind labeling, evaluation, and GenAI-enabled workflows. The work spans LLM-assisted labeling, human-in-the-loop quality systems, prompt and rubric design, model evaluation, and methods for improving the speed, consistency, and usefulness of judgment-based data.\n  \n We're looking for a strong senior individual contributor to execute high-impact technical work in this space, partner cross-functionally to turn successful ideas into durable platform capabilities, and grow with the team as the space evolves.\n  \n What you’ll do: \n We are looking for an experienced and highly capable Data Scientist to help us drive step function improvements in our data labeling capabilities at Pinterest. In this role, you will:\n \n Execute high-impact scientific work across GenAI-powered labeling and evaluation systems\n Identify opportunities where LLMs and related methods can improve quality, speed, coverage, and cost efficiency\n Develop prototypes that demonstrate value in areas such as prompt optimization, task decomposition, quality estimation, routing, and human-in-the-loop workflows\n Design experiments and measurement frameworks to evaluate model performance, workflow outcomes, and operational tradeoffs\n Partner with engineering, product, and data science teams to productionize successful approaches\n Apply standards for trustworthiness, including bias measurement, calibration, quality control, and responsible oversight\n Contribute to reusable methods and frameworks that can scale across teams and use cases\n Support more junior scientists and contribute to the technical health of the team\n \n  \n What we’re looking for: \n \n 6+ years of combined post-graduate academic and industry experience (or PhD + 3 years) applying scientific methods to real-world problems on large-scale data\n Strong hands-on experience as an individual contributor solving technically complex, high-impact data science or ML problems\n Experience applying LLMs or other generative AI techniques to practical workflows, systems, or products\n Ability to turn ambiguous problems into rigorous analyses, experiments, and prototypes\n Track record of writing high-quality code and using technical work to influence product or platform direction\n Solid cross-functional collaboration skills and experience working effectively across teams\n Business and product sense with the ability to define meaningful success metrics\n Self-directed learning mindset and comfort working in a rapidly evolving technical landscape\n Experience with labeling systems, evaluation frameworks, human judgment workflows, or internal AI tooling is strongly preferred\n \n  \n Relocation Statement: \n \n \n We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. \n \n  \n In-Office Requirement Statement: \n \n \n This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country. \n This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.\n \n  \n #LI-NM4\n #LI-REMOTE\n At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparen","salary_min":139764,"salary_max":287749,"location":"San Francisco, CA","workplace":"remote","job_type":"full-time","experience_level":"senior","tags":["llm","generative-ai","data-engineering","data-science"],"apply_url":"https://www.pinterestcareers.com/jobs/?gh_jid=7923203","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-18T18:22:15Z","expires_at":"2026-06-29T14:08:27.096244Z","created_at":"2026-05-27T14:08:40.655054Z","updated_at":"2026-05-30T14:08:27.209786Z","company_name":"Pinterest","company_slug":"pinterest","company_logo_url":"https://www.google.com/s2/favicons?domain=www.pinterest.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/4dbf93be-05aa-4e04-9e99-1e7793a74816"},{"id":"df1eb663-3291-469f-99fa-889a704b06f9","company_id":"714f360f-a244-487d-b3f0-0c43518a9e66","title":"Data Scientist II, Infrastructure","slug":"data-scientist-ii-infrastructure-76c04719","description":"About Pinterest: \n Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.\n Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the  flexibility to do your best work. Creating a career you love? It’s Possible.\n At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI.\n Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here .\n Pinterest brings millions of people the inspiration to create a life they love. Behind that experience is a complex infrastructure ecosystem that powers reliability, performance, measurement, and efficiency across the platform. As Pinterest grows, it’s increasingly important that we understand these systems clearly so we can make smarter decisions for both Pinners and the business.\n  \n We’re looking for a Data Scientist to join our Infrastructure Data Science team. In this role, you’ll partner with engineering and cross-functional teams to make Pinterest’s infrastructure more measurable, intelligible, and actionable. Depending on the area, your work may span app performance, shopping infrastructure, metrics quality, infrastructure governance, or site reliability. You’ll help build the data foundations, measurement systems, and analytical frameworks that enable Pinterest to optimize core technical systems and make better product and infrastructure decisions. \n What you’ll do: \n In this role, you will partner closely with engineering and cross-functional teams to improve how Pinterest measures, understands, and optimizes its infrastructure:\n \n Partner with engineering teams to define, measure, and improve the health, quality, and efficiency of Pinterest’s infrastructure systems.\n Build and refine metrics, dashboards, and analytical frameworks that make complex technical systems more understandable and actionable.\n Strengthen data foundations by improving metric definitions, auditing data quality, and contributing to pipeline and measurement improvements where needed.\n Design and analyze experiments, investigations, and deep dives to quantify the impact of infrastructure changes on user experience, reliability, and business outcomes.\n Translate ambiguous technical problems into clear analyses and actionable recommendations for engineering and platform partners.\n Support high-priority investigations and decision-making related to infrastructure performance, reliability, cost, and measurement quality.\n Identify opportunities to improve how Pinterest measures and optimizes infrastructure across a range of domains, such as performance, shopping infrastructure, governance, metrics quality, and site reliability. \n \n What we’re looking for: \n \n Masters degree in a relevant field such as Statistics, Applied Math, Biostatistics, or equivalent experience.\n Strong SQL and analytical programming skills, with experience working through messy, imperfect data and building reliable metrics and datasets.\n Experience partnering on or contributing to production-ready data pipelines, measurement systems, or foundational data work that improves data quality and usability.\n Solid foundation in experimentation and measurement, with the ability to design analyses, interpret results rigorously, and partner effectively with engineers and other cross-functional stakeholders.\n Demonstrated ability to translate ambiguous problems into clear analytical workstreams and actionable recommendations.\n Strong cross-functional communication skills, with the ability to explain technical findings clearly to engineering, product, and platform stakeholders.\n Ability to operate independently, prioritize across both longer-term projects and fast-turn inbound requests, and drive work forward in a dynamic environment.\n Curiosity and a builder mindset, with excitement for improving messy systems and creating more scalable, trustworthy measurement foundations.\n \n  \n Relocation Statement: \n \n \n We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. \n \n  \n In-Office Requirement Statement: \n \n This position is not eligible for relocation assistance. Visit our PinFlex page to l","salary_min":114297,"salary_max":235319,"location":"San Francisco, CA","workplace":"remote","job_type":"full-time","experience_level":"mid","tags":["data-pipeline","data-science","data-engineering","infrastructure"],"apply_url":"https://www.pinterestcareers.com/jobs/?gh_jid=7816424","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-18T18:21:18Z","expires_at":"2026-06-29T14:08:25.367154Z","created_at":"2026-05-27T14:08:38.877001Z","updated_at":"2026-05-30T14:08:25.481605Z","company_name":"Pinterest","company_slug":"pinterest","company_logo_url":"https://www.google.com/s2/favicons?domain=www.pinterest.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/df1eb663-3291-469f-99fa-889a704b06f9"},{"id":"56169506-5603-4cc9-8a02-2c4a36957dba","company_id":"9d70a126-16ff-4f5c-9f36-2133735865d3","title":"Senior Data Engineer","slug":"senior-data-engineer-6a00ba6a","description":"Who We Are \n Verkada is transforming how organizations protect their people and places with an integrated, privacy-sensitive AI-powered platform that includes solutions for video security, access control, air quality sensors, alarms, intercoms, and visitor management. \n We’ve got serious momentum in the market: more than 30,000 customers (including 100+ of the Fortune 500), a $5.8B valuation , more than $1 billion in annualized bookings, and backing from CapitalG, Sequoia Capital, General Catalyst, Felicis Ventures, Next47 and more. Physical AI is one of the most consequential technology shifts of our time, and Verkada is at the center of it.\n You can look at all kinds of communities to see our platform’s impact in the world. It's the retailer that uses our agentic AI to deter theft before it happens. The warehouse that uses AI-powered alerts to make sure its team is protected on the floor with proper PPE. The school that’s alerted to a threat in real-time and triggers a lockdown in seconds, not minutes. We’re rapidly scaling this impact: today, more than 2 million Verkada devices are deployed across 170+ countries. \n About the Role \n As a member of our Data Platforms and Analytics Team and reporting directly to the Head of Data, you will be responsible for developing the core enterprise data warehouse infrastructure, data models, and pipelines at Verkada. We aim to provide a single reporting source of truth for enterprise data with clear business data definitions to empower internal Finance, Sales, Marketing, Product and HR teams to make informed data driven decisions. Our strategy emphasizes automation, scalable architecture, and accuracy, while providing iterative improvements over time.\n We are committed to a thriving in-office culture. This role requires you to be onsite at our HQ in San Mateo, CA.\n What You'll Do \n \n Engineer and maintain efficient, scalable warehouse infrastructure that facilitates high-quality, accurate insights and reporting.\n Design, implement and manage automated data pipelines from various data sources including databases, API endpoints, business systems, and data lakes.\n Collaborate across departments to develop bronze, silver, and gold data models, enforcing business alignment and data governance.\n Partner with Finance, Sales, Marketing, Product, and HR stakeholders to define data pipeline sources, data modeling requirements, and data quality standards.\n Oversee the entire project lifecycle, moving initiatives from initial design through to production leveraging development standards such as Github PR reviews and Jira sprint board management.\n Create and deploy strategies to maintain data security, integrity, and regulatory compliance.\n Provide leadership and guidance to grow and mentor future members of the data engineering team.\n \n What You Bring \n \n Bachelor's or Master's degree in Computer Science or a related technical field.\n Minimum of 5 years of professional data engineering experience.\n Advanced skill in Python and SQL.\n Expertise with cloud warehouses such as BigQuery, Snowflake, or Databricks leveraging DBT as a data modeling framework.\n Expertise in managing data lakes with open source file formats such as Apache Iceberg, Delta Lake or Apache Hudi\n Proven track record in constructing automated pipelines using Airflow, Dagster, Fivetran, and / or Airbyte from various operational databases, API endpoints, business systems, and data lakes.\n Expert-level proficiency in SQL and Python is required.\n Experience building / managing data observability and data quality platforms such as BigEye, Monte Carlo and Great Expectations is a plus\n Familiarity or experience building vector databases for Generative AI use cases is a plus.\n Experience building Gen AI agents to optimize development workflows within data engineering is a plus.\n Experience building Gen AI agents for providing support for business intelligence related inquiries is a plus\n Must be willing and able to work onsite five days per week.\n \n Employee Benefits \n Verkada is committed to fostering a workplace environment that prioritizes the holistic health and wellbeing of our employees and their families by offering comprehensive wellness perks, benefits, and resources. Our benefits and perks programs include, but are not limited to:\n \n Healthcare programs that can be tailored to meet the personal health and financial well-being needs - Premiums are 100% covered for the employee under at least one plan and 80% for family premiums under all plans\n Nationwide medical, vision and dental coverage\n Health Saving Account (HSA) with annual employer contributions and Flexible Spending Account (FSA) with tax saving options\n Expanded mental health support\n Paid parental leave policy \u0026 fertility benefits\n Time off to relax and recharge through our paid holidays, firmwide extended holidays, flexible PTO and personal sick time\n Professional development stipend\n Wellness/fitness benefits\n Healthy lunches provided daily","salary_min":140000,"salary_max":210000,"location":"San Mateo, CA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["generative-ai","data-pipeline","agents","embeddings","healthcare","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/verkada/jobs/5139539007","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-16T00:13:28Z","expires_at":"2026-06-29T14:09:29.268442Z","created_at":"2026-05-16T14:10:01.242564Z","updated_at":"2026-05-30T14:09:29.379654Z","company_name":"Verkada","company_slug":"verkada","company_logo_url":"https://www.google.com/s2/favicons?domain=verkada.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/56169506-5603-4cc9-8a02-2c4a36957dba"},{"id":"482d8713-6d68-47f1-bc6f-cae1db3e912c","company_id":"76c63eb7-c307-4322-8c2b-c20216feec49","title":"Senior Engineering Manager, Data Engineering","slug":"senior-engineering-manager-data-engineering-ed22f67c","description":"Get to Know Us\n\nHorizon3.ai http://Horizon3.ai is a fast-growing, remote cybersecurity company dedicated to the mission of enabling organizations to proactively find, fix and verify exploitable attack vectors before criminals exploit them. Our flagship product, the NodeZeroTM platform, delivers production-safe autonomous pentests and other key assessment operations that scale across the largest internal, external, cloud, and hybrid cloud environments. NodeZero has been adopted by organizations of all sizes, from small educational institutions to government agencies and Global 100 enterprises. It is used by IT Ops/SecOps teams, consulting pentesters, and MSSPs and MSPs.\n\nWe are a fusion of former U.S. Special Operations cyber operators, startup engineers \u0026 operators, and formerly frustrated cybersecurity practitioners. We're committed to helping solve our common security problems: ineffective security tools and false positives, resulting in alert fatigue, blind spots, \"checkbox” security culture, cybersecurity skills shortage, and the long lead time and expense of hiring outside consultants. Collectively, we are a team of learn it alls, committed to a culture of respect, collaboration, ownership, and results.\n\nAs a remote first company, we require minimum 25Mbps consumer grade broadband connection.\n\n\nWhat You’ll Do\n\nYou will lead the team that provides the internal data platform that powers analytics and operational decision-making across Horizon3, to make high-quality, trustworthy data available to the business\n\nYou will:\n\n - Drive and lead execution on a modernization of Horizon3’s data architecture.\n\n - Define data quality and timeliness standards. Drive data quality, observability and pipeline robustness efforts to provide reliable and performant access to data to consumers.\n\n - Act as a product owner to capture needs of product teams, BI teams, and other customers and manage the roadmap of data engineering initiatives.\n\n - Grow a team of data engineers, infrastructure engineers, and data analysts. Establish a culture of collaboration and engineering excellence.\n\n\n\nWhat You’ll Bring :\n\nPlatform Leadership \u0026 Architecture\n\n - Demonstrated expertise leading teams designing and operating cloud data warehouse in production (eg, Redshift, Snowflake, Databricks, BigQuery).\n\n - Hands-on experience with the modern data stack: dbt for transformation, a pipeline orchestrator (Airflow, Dagster, or similar), and managed ingestion tooling (Fivetran, Airbyte, etc.)\n\n - Experience implementing data quality frameworks and observability: defining pipeline SLAs, detecting and alerting on anomalies, and establishing tiered data sets with quality guarantees (e.g., medallion architecture)\n\n - Able to partner with and influence peer engineering teams - drive alignment on shared standards such as pipeline patterns, data contracts, and quality guarantees across teams that own their own data sources.\n\n\n\nTeam Building \u0026 Engineering Culture\n\n - Experience building or significantly growing a small data engineering team — including hiring, onboarding, and establishing engineering norms.\n\n - Proven ability to define and instill engineering culture: code review standards, definition of done, incident response, documentation practices, and culture of ownership.\n\n - Drives high-impact architecture decisions rigorously — requires design documents, runs structured reviews, builds consensus, and ensures decisions are well-reasoned before commitment.\n\n - Demonstrated experience acting as product owner for a platform or infrastructure team: maintaining a roadmap, triaging inbound requests, managing internal customer expectations, and making prioritization tradeoff decisions against capacity.\n\n\n\nTechnical Expertise:\n\n - Proficient in SQL. Able to read, write, and review data transforms and data quality checks in SQL.\n\n - Proficient in Python. Ability to review pipeline code and guide engineering decisions.\n\n - Competent in data analysis. Able to investigate anomalies, validate data quality issues, and find insight in data.\n\n\n\nNice to Haves:\n\n - Familiarity with AWS data services (Redshift, Athena, S3/Glue)\n\n - Experience with Databricks (Delta Lake, Unity Catalog, Spark)\n\n - Familiarity with Argo Workflows or Kubernetes-native job orchestration\n\n - Experience with high volume streaming data (Kafka, PubSub)\n\n - Experience supporting data science or ML workflows.\n\n - Exposure to cybersecurity, network telemetry, APM, or other high-volume operational SaaS data.\n\n \n\nTravel Requirements\n\nWe are a fully remote company, and this job may require up to 10% travel to be successful.\n\n\n\nCompensation and Values\n\nAt Horizon3, we believe that our people are our greatest asset, and our compensation philosophy reflects this core value. We are committed to fostering an environment where all employees feel valued, respected, and rewarded for their contributions. Our compensation structure is designed to be fair, competitive, and transparent, ensurin","salary_min":260000,"salary_max":280000,"location":"Remote (US)","workplace":"hybrid","job_type":"full-time","experience_level":"senior","tags":["data-pipeline","cloud","security","data-engineering"],"apply_url":"https://jobs.ashbyhq.com/horizon3ai/7dbb4b68-db0b-4a34-b131-e376c651f1c2/application","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-05T17:39:13.879Z","expires_at":"2026-06-29T14:06:11.679839Z","created_at":"2026-05-06T14:07:13.575587Z","updated_at":"2026-05-30T14:06:11.785966Z","company_name":"Horizon3 AI","company_slug":"horizon3-ai","company_logo_url":"https://www.google.com/s2/favicons?domain=horizon3.ai\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/482d8713-6d68-47f1-bc6f-cae1db3e912c"},{"id":"efe98185-f104-4129-8d20-2518c860a685","company_id":"4c0fefc3-173a-4227-a823-4d67d3e70ff0","title":"Senior Software Engineer, Data","slug":"senior-software-engineer-data-c12c0fe3","description":"Persons in these roles are expected to work from our offices in Seattle. On-site requirements vary based on position and team. If you have questions about on-site work arrangements for this role, please ask your recruiter.\n Our base salary range is $126,000 - $189,000, and in addition we have generous bonus plans to provide a competitive compensation package. \n Who You Are: \n The Allen Institute for AI (Ai2) is hiring a Senior Data Engineer to build the data infrastructure behind AI research agents that explore and reason over scholarly literature. You'll work on the Semantic Scholar corpus, expanding what it covers and improving the quality of what’s already there, and create the APIs and tooling that these agents rely on at scale.\n This role sits at the intersection of data engineering and applied ML. You'll own pipelines, design schemas, and ship production services, but you'll also apply practical ML techniques (entity resolution, text classification, embedding-based similarity) to improve data quality and enrich metadata at scale, directly shaping what the agents can do. We're looking for a strong engineer who is comfortable across that full range.\n Who We Are:  \n The Agentic Applications team builds open, production-grade systems that power scientific discovery and large-scale AI research. We focus on creating high-quality structured datasets, integrating diverse content types, and enabling downstream applications across search, citation analysis, and model training. The team combines strong engineering practices with close collaboration across Ai2’s product and research orgs to deliver tools and infrastructure used by millions of researchers and developers worldwide.\n Your Next Challenge: \n \n Improve the coverage and quality of the Semantic Scholar corpus across academic papers, patents, and new domain-specific datasets\n Build and maintain scalable data pipelines for corpus integration, citation resolution, and metadata enrichment\n Develop and deploy ML models for entity disambiguation, author linking, and topic classification\n Design and extend APIs that expose structured scholarly data to academic researchers and AI agent workflows\n Contribute to dashboards and tools for evaluating data quality and model precision\n Collaborate across engineering and research teams to ensure maintainability, test coverage, and robust deployment\n \n What You’ll Need: \n Required:\n \n Bachelor's degree and 8+ years of technical experience; relevant experience may substitute for education.\n Strong Python engineering skills, especially for building and maintaining data pipelines\n Experience with SQL and schema design in production settings (PostgreSQL preferred)\n Familiarity with ML workflows (training classifiers, tuning models, deploying for inference), particularly for large-scale or ambiguous structured datasets\n Comfortable working with structured data formats (XML/JSON/Parquet) and writing ETL code\n Experience with workflow orchestration tools (Airflow or similar) and cloud infrastructure (AWS, S3, Docker)\n Strong communicator and a strong sense of ownership for results\n \n Preferred:\n \n Experience with author disambiguation, entity resolution, or record linkage problems\n Experience applying vector-based similarity or topic modeling techniques to real-world corpora at scale\n Exposure to citation networks or scholarly data systems (e.g., arXiv, OpenAlex, USPTO)\n Familiarity with building APIs or data services consumed by automated or agent-based workflows\n \n Physical Demands and Work Environment: \n The physical demands described here are representative of those that must be met by a team member to successfully perform the essential functions of this position. Reasonable accommodations may be made to enable individuals with disabilities to perform the functions.\n \n Must be able to remain in a stationary position for long periods of time. \n The ability to communicate information and ideas so others will understand. Must be able to exchange accurate information in these situations. \n The ability to observe details at close range.\n Can work under deadlines.\n \n A Little More About Ai2: \n Ai2 is a Seattle based non-profit AI research institute founded in 2014 by the late Paul Allen. Our mission is building breakthrough AI to solve the world’s biggest problems. We develop foundational AI research and innovation to deliver real-world impact through large-scale open models, data, robotics, conservation, and beyond.\n In addition to Ai2’s core mission, we also aim to contribute to humanity through our treatment of each member of the Ai2 Team. Some highlights are:\n \n We are a learning organization – because everything Ai2 does is ground-breaking, we are learning every day. Similarly, through weekly Ai2 Academy lectures, a wide variety of world-class AI experts as guest speakers, and our commitment to your personal on-going education, Ai2 is a place where you will have opportunities to continue learning alongside yo","salary_min":126000,"salary_max":189000,"location":"Seattle, WA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["robotics","nlp","cloud","agents","data-pipeline","healthcare","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/thealleninstitute/jobs/7872631","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-05-02T04:38:40Z","expires_at":"2026-06-29T14:16:43.627368Z","created_at":"2026-05-06T14:23:24.690747Z","updated_at":"2026-05-30T14:16:43.737571Z","company_name":"Allen Institute for AI","company_slug":"allen-institute-for-ai","company_logo_url":"https://www.google.com/s2/favicons?domain=allenai.org\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/efe98185-f104-4129-8d20-2518c860a685"},{"id":"ff0a1b6f-6b9a-4d92-86e7-a1ed47fe0488","company_id":"e3915539-5a8f-4461-9f26-06366a918674","title":"Product Data Engineer, Autonomous Airpower","slug":"product-data-engineer-autonomous-airpower-df70934e","description":"Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.\n ABOUT THE JOB   \n Anduril is seeking a Product Data Engineer to join our Fury Team in Columbus, Ohio.  The Product Data Engineer is responsible for change control activities supporting the Autonomous Airpower production line.  As a Product Data Engineer  you will work closely with the leaders of critical Anduril programs to manage a highly dynamic and fast-paced design, engineering, and manufacturing environment. The Product Data Engineer will have a massive impact on capturing and defining the baseline for a program’s requirements, design, testing, manufacturing, and sustainment data structures (requirements baseline, EBOM, MBOM, SOM, BOP).   \n WHAT YOU'LL DO \n \n Be a partner to Product and Program teams by: \n \n Working with Anduril system, design, and manufacturing engineers to generate, populate and submit configuration item change requests.\n Preparing critical analyses to support change requests while maintaining configuration integrity and traceability (item dependency analyses, test analyses, effectivity plans, inventory disposition analyses etc.)\n Supporting the execution and implementation of change notices across system engineering, design engineering, manufacturing engineering, and supply chain domains.\n Identifying and executing continuous improvement opportunities by building data-driven perspectives of the full product configuration landscape.   \n \n \n \n Be the leader of critical configuration management activities and forums by: \n \n Supporting configuration managers in managing critical program technical forums (Test Review Board, Change Control Board etc.).\n Working with program teams to prepare for critical technical forums (the Change Review Board, the Change Control Board) including preparing of business cases, test data, effectivity plans, inventory disposition plans, etc.\n Documenting and distributing conclusions, decisions emerging from configuration management forums.   \n \n \n \n Conduct periodic product configuration record audits by: \n \n Supporting configuration management reviews and audits, including FCA/PCA and providing direction before, during and after the events to prevent or resolve issues.\n Reconciling released bills of materials against baseline bills of materials by confirming all variances have signed-off change notices.\n Reconciling product as-built-records against released bills of materials by confirming all variances have quality deviations.\n Identifying, analyzing resolving non-conformances; finding and resolving root-causing failure modes.   \n \n \n \n Be an expert resource for the Product Teams for Engineering Release and Change Management processes and standards by: \n \n Using Teamcenter software to manage Engineering changes, including part revisions, EBOM/MBOM/SBOM updates, documentation modifications, and associated workflows.\n Providing guidance and support to Engineering teams in configuring the Engineering, Manufacturing, and Service Bills of Materials (EBOM, MBOM, SBOM)\n Supporting Configuration Managers and Engineers in defining and maintaining configurable product data sets (i.e., 150% BOMs, product variant dictionaries and schema).\n Being a Teamcenter super-user and able to train product team users on workflows.   \n \n \n \n Be part of the team defining the future of digital engineering at Anduril by: \n \n Partnering with engineering teams to set requirements and identify improvement opportunities in existing ways of working by creating tools or enhancing existing workflows.\n Building an end-to-end digital thread that maintains an accurate product definition, system by system, from the definition of requirements to the as maintained record.\n Identifying and piloting opportunities to leverage Anduril’s AI capabilities tools to unlock increased speed, accuracy, and efficiency within Anduril’s configuration management activities.   \n \n \n REQUIRED QUALIFICATIONS   \n \n 2-5 years of configuration analysis or equivalent experience.   \n \n \n 2-5 years of demonstrated experience working in high complexity and high-rate production environments (aerospace, automotive, security systems, medical devices, nuclear).   \n \n \n Familiar with configuration management principles and practices (EIA649/C, MIL-HDBK-61B).   \n \n \n Experience with pro","salary_min":113000,"salary_max":149000,"location":"Ashville, OH","workplace":"onsite","job_type":"full-time","experience_level":"junior","tags":["computer-vision","cloud","payments","data-engineering"],"apply_url":"https://boards.greenhouse.io/andurilindustries/jobs/5107205007?gh_jid=5107205007","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-04-28T19:56:54Z","expires_at":"2026-06-29T14:06:45.303048Z","created_at":"2026-04-30T05:50:13.095034Z","updated_at":"2026-05-30T14:06:45.414411Z","company_name":"Anduril","company_slug":"anduril","company_logo_url":"https://www.google.com/s2/favicons?domain=anduril.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/ff0a1b6f-6b9a-4d92-86e7-a1ed47fe0488"},{"id":"56705546-5164-4608-b2fc-e84123e40134","company_id":"01b03876-5b01-47ab-82b9-696293e861b9","title":"Data Engineer II","slug":"data-engineer-ii-20336077","description":"Who we are \n Samsara (NYSE: IOT) is the pioneer of the Connected Operations™ Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (IoT) data to develop actionable insights and improve their operations. At Samsara, we are helping improve the safety, efficiency and sustainability of the physical operations that power our global economy. Representing more than 40% of global GDP, these industries are the infrastructure of our planet, including agriculture, construction, field services, transportation, and manufacturing — and we are excited to help digitally transform their operations at scale. \n Working at Samsara means you’ll help define the future of physical operations and be on a team that’s shaping an exciting array of product solutions, including Video-Based Safety, Vehicle Telematics, Apps and Driver Workflows, and Equipment Monitoring. As part of a recently public company, you’ll have the autonomy and support to make an impact as we build for the long term. \n About the role: \n Samsara’s Revenue Operations AI \u0026 Data Team is building the future of how we go to market — with intelligence, personalization, and speed. We’re a high-impact team of builders, scientists, and strategists focused on transforming sales operations through AI. Our mission is to help sellers reach the right customer at the right time with the right message — and to put everything they need at their fingertips, whether that’s data from Salesforce, context from a past call, or content that wins deals.\n As a Data Engineer II, you’ll own the data platforms that power Samsara’s GTM AI engine. You’ll be responsible for building, scaling, and optimizing our Databricks data store, visualization store, and AI store, while also enabling large-scale generative AI jobs in Databricks. Your work will ensure that our AI applications are grounded in clean, reliable, and well-structured data from CRM pipelines, CS Systems to GenAI-powered copilots. You’ll partner closely with data scientists, AI engineers, and business stakeholders to deliver the infrastructure that fuels innovation at scale.\n This role is remote and open to candidates residing in the US  except the San Francisco Bay Metro Area, NYC Metro Area, and Washington, D.C. Metro Area.\n You should apply if: \n \n You want to impact the industries that run our world: Your efforts will result in real-world impact—helping to keep the lights on, get food into grocery stores, reduce emissions, and most importantly, ensure workers return home safely.\n You are the architect of your own career: If you put in the work, this role won’t be your last at Samsara. We set up our employees for success and have built a culture that encourages rapid career development, countless opportunities to experiment and master your craft in a hyper growth environment.\n You’re energized by our opportunity: The vision we have to digitize large sectors of the global economy requires your full focus and best efforts to bring forth creative, ambitious ideas for our customers.\n You want to be with the best: At Samsara, we win together, celebrate together and support each other. You will be surrounded by a high-calibre team that will encourage you to do your best.\n \n In this role, you will:   \n \n Build and maintain ETL/ELT data pipelines in Databricks and Spark, ensuring data is ingested, transformed, and delivered reliably for analytics and AI use cases.\n Develop and evolve logical and physical data models to support reporting, experimentation, and advanced workflows (e.g., scoring models, signal generation).\n Implement monitoring, alerts, and testing for data quality, timeliness, and lineage to ensure trustworthy data delivery.\n Support workflow orchestration with Databricks Jobs, DBT, or equivalent scheduling tools to operate at scale.\n Contribute to data pipelines and tooling that support retrieval-augmented generation (RAG), vector integrations, or embedding workflows.\n Design and optimize bulk GenAI data pipelines in Databricks to support generative AI applications at scale.\n Partner with AI engineers and data scientists to enable experimentation, model training, and production-grade deployments.\n Develop frameworks for data ingestion, transformation, governance, and monitoring across CRM, sales, and revenue systems.\n Work with RevOps, sales, and customer success stakeholders to translate business needs into data requirements and stable technical implementations.\n \n Minimum requirements for the role: \n \n 2-3 years of industry experience in data engineering, with significant experience building large-scale data platforms.\n Hands-on experience working with modern data technologies stack, such as Databricks, DBT, Redshift, RDS, Snowflake or similar solutions.\n Proficiency in Python and SQL, with experience in designing robust ETL/ELT pipelines.\n Experience orchestrating data workflows at scale and enabling machine learning or ","salary_min":101745,"salary_max":153900,"location":"Remote (US)","workplace":"hybrid","job_type":"full-time","experience_level":"junior","tags":["rag","generative-ai","data-pipeline","embeddings","code-generation","data-engineering"],"apply_url":"https://www.samsara.com/company/careers/roles/7859406?gh_jid=7859406","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-04-28T19:37:09Z","expires_at":"2026-06-29T14:03:54.236761Z","created_at":"2026-04-30T05:48:10.859436Z","updated_at":"2026-05-30T14:03:54.344933Z","company_name":"Samsara","company_slug":"samsara","company_logo_url":"https://www.google.com/s2/favicons?domain=www.samsara.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/56705546-5164-4608-b2fc-e84123e40134"},{"id":"13a6bdb5-cf40-4e9e-85f4-af6c9c89b189","company_id":"63839083-85dd-4aa0-b128-254fc82866e5","title":"Senior / Staff Software Engineer, ML Datasets \u0026 Data Pipelines","slug":"senior-staff-software-engineer-ml-datasets-data-pipelines-d6def817","description":"Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech.\n\nWith offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: www.waabi.ai\n\n\nAs a Senior/Staff Software Engineer embedded within our Autonomy \u0026 Algorithms team, you will build the scalable ML data pipelines necessary to train and evaluate Waabi’s autonomous driving platform. Working closely with world-renowned scientists and engineers, you will solve complex data challenges to accelerate our launch of fully driverless vehicles.\n \nYou Will..\n- Design and implement data pipelines using real-world driving data and Waabi World (our high-fidelity simulator) to train and evaluate deep learning models.\n- Optimize data formats, caching, and dataloading to drive highly efficient ML training and evaluation at scale.\n- Improve data sampling and composition for deep data introspection to track model performance and uncover critical edge-case scenarios.\n- Champion engineering excellence by writing high-quality, well-structured, and rigorously tested code.\n- Help drive project roadmap planning, prioritization, and delivery.\nQualifications:\n- BS or MS in Computer Science, Machine Learning, or a related technical field, with 4+ years of industry experience.\n- Proficiency in Python and strong software engineering fundamentals, including experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX.\n- Hands-on experience building distributed ETL and data processing pipelines.\n- Direct experience managing ML pipelines, including dataset management, dataloading, and optimization.\n- Strong understanding of cloud job orchestration, monitoring, and instrumentation best practices.\n- A collaborative, open-minded approach with a passion for tackling hard problems in autonomous technology and a strong willingness to mentor others.\n \nBonus Points:\n- Experience with optimizing large scale distributed training pipelines and/or highly optimized ML inference pipelines.  \n- Experience with MapReduce (Apache Hadoop/Spark) or orchestration frameworks (Apache Airflow, Apache Beam, Google Cloud Dataflow, AWS Step Functions).\n- Experience solving data challenges specific to autonomous driving.\n- Familiarity with linear algebra (projections, transforms) and 3D geometry.\n- Experience working with multimodal sensor data (e.g., LiDAR, RADAR, camera).\n","salary_min":148000,"salary_max":260000,"location":"Toronto, Canada","workplace":"onsite","job_type":"full-time","experience_level":"lead","tags":["pytorch","autonomous-vehicles","cloud","data-pipeline","tensorflow","deep-learning","distributed-systems","data-engineering"],"apply_url":"https://jobs.lever.co/waabi/81ed817e-cb56-45f8-aac6-e645610798da/apply","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-03-26T16:35:32.938Z","expires_at":"2026-06-29T14:05:44.788887Z","created_at":"2026-04-13T09:41:54.316786Z","updated_at":"2026-05-30T14:05:44.902302Z","company_name":"Waabi","company_slug":"waabi","company_logo_url":"https://www.google.com/s2/favicons?domain=waabi.ai\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/13a6bdb5-cf40-4e9e-85f4-af6c9c89b189"},{"id":"d5c58ff8-1093-4c47-80e6-a3b88d596560","company_id":"a0000000-0000-0000-0000-000000000001","title":"Analytics Data Engineering Manager, Product","slug":"analytics-data-engineering-manager-product-f864f076","description":"About Anthropic \n Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.\n About the role\n As an Analytics Data Engineering Manager focused on Product, you will build and lead the analytics engineering team responsible for creating the data foundations that enable data-driven decision making across Anthropic’s Product organization.  You will oversee the development of scalable data solutions for Product pillars – including Consumer, Claude Code, Enterprise \u0026 Verticals, Growth, Platform Product – managing a team of analytics engineers and working closely with stakeholders across Data Science, Product, and Engineering to ensure teams have access to reliable, accurate metrics that can scale with our company’s growth.\n In this role, you will balance hands-on technical leadership with people management, setting the strategic vision for product data foundations while developing and mentoring team members.  You will partner closely with Product Data Scientists, Product Managers, and Product Engineers to understand how users interact with Claude, how to measure product quality and growth, and how to transform raw event logs into insightful data marts that power product decisions.\n Responsibilities :\n \n \n Build and scale the Product Analytics Engineering team, including hiring and mentoring a team of high-performing analytics engineers embedded with Product pillars\n \n Define and execute the strategic roadmap for product data foundations and analytics capabilities\n \n Oversee the design and implementation of scalable data pipelines, data models, and analytics solutions that transform raw product event logs into canonical datasets and insightful data marts\n \n Partner with Data Science, Product, and Engineering leadership to understand data needs and translate them into technical requirements\n \n Establish and maintain high data integrity standards, SLAs, alerting, and best practices for the team\n \n Drive the development of foundational data products, dashboards, and tools to enable self-serve analytics; partner with the Data Science team to build innovative data tools using Claude to scale data-driven decisions across Product teams\n \n Foster a culture of technical excellence, continuous learning, and data-driven decision making\n \n Serve as a technical thought leader for data modeling, ETL processes, and product analytics infrastructure\n \n You might be a good fit if you have: \n \n \n 5+ years of experience managing analytics engineering or data engineering teams, preferably in a scaling startup environment\n \n 8+ years of total experience in analytics engineering, data engineering, or similar data-focused roles\n \n Deep expertise in data modeling, ETL pipelines, and data warehouse architecture\n \n Strong technical foundation with expertise in SQL, Python, dbt, and modern data stack tools\n \n Proven track record of building and leading high-performing teams\n \n Experience partnering with Data Science, Product, and Engineering leaders to deliver key product metrics and user behavior insights\n \n Demonstrated ability to balance strategic thinking with hands-on technical leadership\n \n Strong communication skills with the ability to translate complex technical concepts for diverse audiences\n \n Experience scaling analytics functions from early stage to maturity in rapidly changing environments\n \n Track record of establishing data governance, quality standards, and best practices\n \n A bias for action and urgency, not letting perfect be the enemy of the effective\n \n A “full-stack mindset”, not hesitating to do what it takes to solve a problem end-to-end\n \n A passion for Anthropic’s mission of building helpful, honest, and harmless AI\n The annual compensation range for this role is listed below. \n For sales roles, the range provided is the role’s On Target Earnings (\"OTE\") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\n Annual Salary:\n $370,000 — $450,000 USD \n Logistics \n Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience\n Required field of study:  A field relevant to the role as demonstrated through coursework, training, or professional experience\n Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position\n Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.\n Visa sponsorship:  We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make","salary_min":370000,"salary_max":450000,"location":"San Francisco, CA","workplace":"hybrid","job_type":"full-time","experience_level":"lead","tags":["alignment","data-pipeline","data-science","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/anthropic/jobs/5125387008","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-02-20T00:49:54Z","expires_at":"2026-06-29T14:00:10.12713Z","created_at":"2026-05-10T14:00:12.031668Z","updated_at":"2026-05-30T14:00:10.236952Z","company_name":"Anthropic","company_slug":"anthropic","company_logo_url":"https://www.google.com/s2/favicons?domain=anthropic.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/d5c58ff8-1093-4c47-80e6-a3b88d596560"},{"id":"f0061aa7-7ffd-49a1-a92d-feac6070c604","company_id":"9e823d7c-7833-447b-9466-a78c031a7a48","title":"Senior Software Engineer - Data Engineering - Moloco Commerce Media","slug":"senior-software-engineer-data-engineering-moloco-commerce-media-b3b8b584","description":"About Moloco: \n Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for \"machine learning company\"—reflects our core mission: democratizing access to the advanced AI that has historically been reserved for tech giants. Led by machine learning pioneers who built some of the most successful ad systems at Google, including YouTube's monetization engine and key search advertising technologies, we're transforming how businesses grow and compete in the digital economy.\n Built with AI from day one, Moloco’s planet-scale machine learning platform powers a suite of solutions for advertising growth and monetization. Moloco Ads is an AI-powered platform that delivers real business outcomes for mobile app marketers through performance-based user acquisition. Moloco Commerce Media enables retailers and marketplaces to build revenue-generating ad businesses that balance user experience and advertiser performance.\n Moloco is headquartered in Silicon Valley, with offices in Seattle, New York, San Francisco, Seoul, Beijing, Singapore, Gurgaon, Tokyo, Shanghai, London, Tel Aviv, and Berlin.\n Moloco is a truly rewarding place to work and in an exciting period of growth, which you could be a part of. Join us today and apply now!\n The Impact You’ll Be Contributing to Moloco \n \n You will be responsible for developing an ML-based online advertising platform for the rapidly growing retail media industry.\n You will play a pivotal role designing, developing and optimizing the data infrastructure and pipelines to build robust and scalable solutions that power Moloco’s data-driven products.\n You will mentor others on the team and have the opportunity to lead high impact projects.\n \n The Opportunity \n \n Design and develop complex data pipelines and ETL processes for manipulating and managing big data.\n Improve cost effectiveness of data pipelines, storage, and databases, both independently and in collaboration with cross-functional teams.\n Design and implement data quality and governance processes to ensure data accuracy, consistency, and compliance.\n Enhance system scalability, availability, and performance across Moloco Commerce Media’s data infrastructure.\n Develop internal tools to boost developer productivity and system efficiency.\n Participate in cross-functional projects and initiatives, providing technical expertise and guidance.\n Collaborate with teams such as Infrastructure, Machine Learning, Data Science, Analytics, and Production to build the best ad platform.\n Tech lead a small team to contribute to high-impact projects.\n \n How Do I Know if the Role is Right For Me? \n \n Bring 6+ years of software engineering experience using modern languages such as Java, C#, Go, C++, Python, and foundational SQL knowledge, with proficiency in SQL or NoSQL database technologies.\n Demonstrate experience with computer systems architecture, operating systems, or distributed systems.\n Possess extensive experience with cloud platforms (e.g., AWS, Azure, Google Cloud) or equivalent on-premise systems.\n Apply strong programming fundamentals, testing practices, and knowledge of common algorithms and data structures.\n Exhibit system design and development skills, especially in large-scale environments.\n Work with at least one of the following: data modeling, analytics, data management, or big data processing/MapReduce.\n Display strong analytical and troubleshooting skills in ambiguous or evolving environments.\n Maintain a growth mindset, staying updated on emerging technologies and industry trends, and sharing knowledge with others.\n Communicate and collaborate effectively across teams.\n Deep, hands-on experience in ads infrastructure (bidding, pacing, budget optimization, or measurement) at large revenue scale a plus. \n \n \n Compensation \u0026 Benefits \n U.S.-based employees have access to medical, dental, and vision insurance, a 401(k) plan with company match, short-term and long-term disability coverage, basic life insurance, and well-being benefits and perks. U.S.-based employees also receive up to 12 scheduled paid holidays per calendar year and one Thrive Day off per quarter. Additionally, all employees have Flexible Time Off (FTO). \n The successful candidate may be eligible for a bonus and equity awards. Eligibility and amounts are determined by performance and the terms of the applicable plans. \n The location for this role is listed above. For base pay range purposes, location-based compensation is grouped into the following regions. Your region is determined by your assigned work location. \n \n Region A: Menlo Park Office, New York Office, SF Bay Area, New York Metro Area \n Region B:  Seattle Office, Seattle, Los Angeles/Orange County, San Diego, Austin, Boston, Washington DC Metro Area, Miami \n Region C: All other U.S. locations \n \n Salary Ranges: \n Region A:\n $188,000 — $259,440 USD \n Region B:\n $169,200 — $233,496 USD \n Region C:\n $159,800 — $220,524 USD \n Moloco Thrive: Benefits a","salary_min":159800,"salary_max":220524,"location":"Menlo Park, CA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["data-pipeline","distributed-systems","data-engineering","platform"],"apply_url":"https://job-boards.greenhouse.io/moloco/jobs/7607957003","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-02-09T21:36:44Z","expires_at":"2026-06-29T14:14:32.84962Z","created_at":"2026-04-22T15:53:15.596111Z","updated_at":"2026-05-30T14:14:32.959529Z","company_name":"Moloco","company_slug":"moloco","company_logo_url":"https://www.google.com/s2/favicons?domain=moloco.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/f0061aa7-7ffd-49a1-a92d-feac6070c604"},{"id":"6d89dd83-c08c-4bc0-8753-b6484ed59168","company_id":"34cd55a7-59d0-4c54-bde4-216aadd50eae","title":"Software Engineer, Data","slug":"software-engineer-data-360da71c","description":"About HeyGen \n At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences. Learn more at  www.heygen.com .  Visit our Mission and Culture doc here . \n Position Summary \n A Software Engineer with data engineering responsibilities to bridge the gap between core application development and large-scale data infrastructure.  You will help build the data foundational layers for our next-generation features. This role is not just about moving data—it’s about enabling AI models to function in real-time, building robust pipelines for multimedia, and powering engaging user experiences. This team is currently working on cutting-edge features including PPT-to-video converters and interactive, conversational video capabilities.\n Core Responsibilities \n \n Build \u0026 Scale Data Pipelines: Design, develop, and maintain robust batch and real-time data pipelines (using Python, Go, Spark, Kafka) that ingest and transform massive multi-modal data—text, audio, and video—to train and run AI models.\n Power Intelligent Features: Collaborate with ML engineers to implement data structures and APIs for new, exciting features like PPT-to-video automation and interactive AI avatars that require low-latency data fetching.\n Data Lakehouse Infrastructure: Architect and manage data lakehouse solutions (e.g., Snowflake, Databricks, Apache Iceberg) to store and query unstructured media data efficiently, enhancing storage and computation efficiency.\n Data Reliability \u0026 Observability: Implement data quality checks, data contracts, and monitoring to ensure high reliability of data, preventing downtime in production video generation.\n Productize Data: Transform raw data into structured, actionable data products that can be easily consumed by front-end applications, API endpoints, and AI agents.\n \n Qualifications \n \n Bachelor’s/Master’s degree in Computer Science, Engineering, or a related field.\n 3-5+ years of experience as a Backend Software Engineer with heavy data processing responsibilities.\n Strong proficiency in Python (for ETL/scripting) and SQL (for data modeling).\n Experience with cloud platforms (AWS/GCP) and data technologies like Kafka, Spark, and Snowflake/Databricks.\n Experience or interest in Computer Vision/Generative AI data processing.\n Proactive, \"owner\" mindset; ability to operate in a fast-paced, startup environment.\n \n What HeyGen Offers \n \n Competitive salary and benefits package.\n Dynamic and inclusive work environment focused on innovation and creativity.\n Opportunities for professional growth and skill development.\n Collaborative culture that values teamwork and employee input.\n Access to state-of-the-art technologies and tools.\n \n Salary Range $180,000 – $220,000 + equity + benefits Please note that the salary information is a general guideline only.  HeyGen considers factors such as scope and responsibilities of the position, candidate's work experience, education/training, key skills, and internal equity, as well as location, market and business considerations when extending an offer.  As part of our total rewards package, HeyGen offers comprehensive benefits including equity, a 401k plan, health benefits, generous PTO, a parental leave program and emotional health resources.\n HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.\n Join us at HeyGen and be part of a team that's reshaping the world of video creation through innovative technology!","salary_min":180000,"salary_max":220000,"location":"San Francisco, CA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["agents","generative-ai","computer-vision","data-pipeline","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/heygen/jobs/5044951007","is_featured":false,"is_sticky":false,"status":"active","published_at":"2026-02-06T01:02:40Z","expires_at":"2026-06-29T14:13:33.604123Z","created_at":"2026-04-16T15:55:47.380084Z","updated_at":"2026-05-30T14:13:33.717018Z","company_name":"HeyGen","company_slug":"heygen","company_logo_url":"https://www.google.com/s2/favicons?domain=heygen.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/6d89dd83-c08c-4bc0-8753-b6484ed59168"},{"id":"3b9ef003-2e06-4a55-a9d0-1e93cd528a20","company_id":"9bad7e3a-74e6-4dae-87c5-f3e9f0e72bd0","title":"Data Engineer","slug":"data-engineer-ec7e1ff9","description":"The Data team leverages data from our autonomous vehicles and operations to determine autonomy and service readiness. We provide the foundation for strategic decision-making at Zoox. You will develop and implement the next generation of our data pipeline to ensure visibility into our business as we scale toward the launch of an autonomous mobility service. You will define the system and build the pipeline to enable Zoox to develop and scale with a data-first culture.\n \nYou will join a diverse, experienced team with rapidly growing scope and responsibility while also having access to one of the most unique data sets in the autonomous vehicle industry. Hence, we are seeking all skill levels to grow with the team.\n","salary_min":180000,"salary_max":230000,"location":"Foster City, CA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["autonomous-vehicles","data-pipeline","data-engineering","data-science"],"apply_url":"https://jobs.lever.co/zoox/c73446f6-3e66-4e3d-a694-f768e86038e9/apply","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-12-08T20:02:03.735Z","expires_at":"2026-06-29T14:05:47.230708Z","created_at":"2026-04-13T09:41:58.352322Z","updated_at":"2026-05-30T14:05:47.350815Z","company_name":"Zoox","company_slug":"zoox","company_logo_url":"https://www.google.com/s2/favicons?domain=zoox.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/3b9ef003-2e06-4a55-a9d0-1e93cd528a20"},{"id":"176dc5a9-2f72-4ae8-8fce-7461368ef485","company_id":"a0000000-0000-0000-0000-000000000001","title":"Analytics Data Engineer","slug":"analytics-data-engineer-8e32d185","description":"About Anthropic \n Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.\n About the Role\n As an Analytics Engineer, you will be an early member of the Data Science \u0026 Analytics team building the foundation to scale analytics across our organization. You will collaborate with key stakeholders in Engineering, Product, GTM and other areas to build scalable solutions to transform data into key metrics reporting and insights. You will be responsible for ensuring teams have access to reliable, accurate metrics that can scale with our company’s growth. You will also lead your own projects to enable self-serve insights to help teams make data-driven decisions. \n Responsibilities :\n \n Understand the data needs of stakeholder teams in terms of key data models and reporting, and translate that into technical requirements\n Define, build and manage key data pipelines in dbt that transform raw logs into canonical datasets\n Establish high data integrity standards and SLAs to ensure timely, accurate delivery of data\n Develop insightful and reliable dashboards to track performance of core metrics that will deliver insights to the whole company\n Build foundational data products, dashboards and tools to enable self-serve analytics to scale across the company\n Influence the future roadmap of Product and GTM teams from a data systems perspective\n Become an expert in our organization’s data models and the company's data architecture\n \n You might be a good fit if you have: \n \n 5+ years of experience as an Analytics Data Engineer or similar Data Science \u0026 Analytics roles, preferably partnering with GTM and Product leads to build and report on key company-wide metrics.\n A passion for the company's mission of building helpful, honest, and harmless AI.\n Expertise in building multi-step ETL jobs, building robust data models through tooling like dbt; proficiency with workflow management platforms like Airflow and version control management tools through GitHub.\n Expertise in SQL and Python to transform data into accurate, clean data models.\n Experience building data reporting and dashboarding in visualization tools like Hex to serve multiple cross-functional teams.\n A bias for action and urgency, not letting perfect be the enemy of the effective.\n A “full-stack mindset”, not hesitating to do what it takes to solve a problem end-to-end, even if it requires going outside the original job description.\n Experience building an Analytics Data Engineering (or similar) function at start-ups. \n A strong disposition to thrive in ambiguity, taking initiative to create clarity and forward progress.\n The annual compensation range for this role is listed below. \n For sales roles, the range provided is the role’s On Target Earnings (\"OTE\") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\n Annual Salary:\n $275,000 — $370,000 USD \n Logistics \n Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience\n Required field of study:  A field relevant to the role as demonstrated through coursework, training, or professional experience\n Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position\n Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.\n Visa sponsorship:  We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.\n We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed.  Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious","salary_min":275000,"salary_max":370000,"location":"San Francisco, CA","workplace":"hybrid","job_type":"full-time","experience_level":"senior","tags":["data-pipeline","alignment","data-engineering","data-science"],"apply_url":"https://job-boards.greenhouse.io/anthropic/jobs/4956672008","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-10-17T18:13:10Z","expires_at":"2026-06-29T14:00:10.046748Z","created_at":"2026-05-10T14:00:11.950413Z","updated_at":"2026-05-30T14:00:10.156579Z","company_name":"Anthropic","company_slug":"anthropic","company_logo_url":"https://www.google.com/s2/favicons?domain=anthropic.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/176dc5a9-2f72-4ae8-8fce-7461368ef485"},{"id":"60b9288c-b278-4210-8785-36ba8e6eaa50","company_id":"ed18bbda-3537-4b44-9295-c7b575fce0ff","title":"Principal Machine Learning \u0026 Data Engineer ","slug":"principal-machine-learning-data-engineer-ea3f6c48","description":"Who we are  \n At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to  hundreds of thousands of businesses  and empower millions of developers worldwide to craft personalized customer experiences.\n Our dedication to remote-first work , and strong culture of connection and global inclusion means that no matter your location, you’re part of a vibrant team with diverse experiences making a global impact each day. As we continue to revolutionize how the world interacts, we’re acquiring new skills and experiences that make work feel truly rewarding. Your career at Twilio is in your hands. We use Artificial Intelligence (AI) to help make our hiring process efficient. That said, every hiring decision is made by real Twilions!\n . \n See yourself at Twilio \n Join the team as Twilio’s next L5 Machine Learning \u0026 Data Engineer to lead the design, build, and operation of the internal ML-and-data platform that powers every customer interaction. You will architect cloud-native pipelines, model-serving infrastructure, and developer tooling that allow Twilio’s product teams to iterate rapidly and safely at scale, advancing our mission to unlock the imagination of builders.\n  \n About the job \n Twilio’s next L5 Machine Learning \u0026 Data Engineer to lead the design, build, and operation of the internal ML-and-data platform that powers every customer interaction. You will architect cloud-native pipelines, model-serving infrastructure, and developer tooling that allow Twilio’s product teams to iterate rapidly and safely at scale, advancing our mission to unlock the imagination of builders.\n Responsibilities \n In this role, you’ll:\n \n Architect and evolve Twilio’s end-to-end ML and real-time data platforms for reliability, security, and cost efficiency.\n Design scalable feature stores, streaming and batch pipelines, and low-latency model-serving layers on AWS.\n Implement MLOps best practices—automated testing, CI/CD, monitoring, and rollback—for hundreds of daily deployments.\n Own system design reviews, threat modeling, and performance tuning for high-volume communications workloads.\n Lead cross-functional engineering efforts, breaking down complex initiatives into executable roadmaps.\n Mentor staff and senior engineers, raising the technical bar through code reviews and pair programming.\n Partner with Product, Security, and Compliance to meet stringent privacy and governance requirements (HIPAA, SOC 2, GDPR).\n Champion a culture of experimentation, data-driven decision-making, and continuous improvement.\n \n Qualifications  \n Twilio values diverse experiences from all kinds of industries, and we encourage everyone who meets the required qualifications to apply. If your career is just starting or hasn't followed a traditional path, don't let that stop you from considering Twilio. We are always looking for people who will bring something new to the table!\n \n *Required: \n \n Bachelor’s or higher in Computer Science, Engineering, Mathematics, or equivalent practical experience.\n 7+ years building and operating production data or machine-learning systems at scale.\n Expert fluency in Python and one compiled language (Java, Scala, Go, or C++).\n Hands-on mastery of distributed data frameworks (Spark/Flink), SQL/NoSQL stores, and streaming platforms (Kafka/Kinesis).\n Demonstrated success designing cloud-native architectures on AWS, including Terraform-managed infrastructure.\n Deep knowledge of container orchestration (Kubernetes/EKS), service-mesh networking, and autoscaling strategies.\n Practical experience implementing MLOps tooling such as MLflow, Kubeflow, SageMaker, or Vertex AI.\n Strong grasp of model-lifecycle concerns—feature engineering, offline/online parity, A/B testing, drift detection, and retraining.\n Proven ability to lead technical projects end-to-end and influence without authority across multiple teams.\n Exceptional written and verbal communication skills, with a bias toward clarity and action.\n \n  \n Desired: \n \n Graduate degree focused on machine learning, distributed systems, or applied statistics.\n Contributions to open-source ML or data infrastructure projects.\n Experience with privacy-enhancing technologies (differential privacy, homomorphic encryption) or on-device inference.\n Background in conversational AI, real-time communications, or large-language-model deployment at scale.\n Exposure to compliance-heavy environments (HIPAA, PCI-DSS) and secure multi-tenant design patterns.\n Published research, patents, or conference talks in ML systems or data engineering.\n \n  \n Location \n This role will be remote, but is not eligible to be hired in CA, CT, NJ, NY, PA, WA.\n  \n Travel  \n We prioritize connection and opportunities to build relationships with our customers and each other. For this role, you may be required to travel occasionally to participate in project or team in-person meetings.\n  \n What We Offer Working at Twi","salary_min":217000,"salary_max":271300,"location":"Remote (US)","workplace":"remote","job_type":"full-time","experience_level":"principal","tags":["distributed-systems","mlops","fine-tuning","healthcare","data-engineering","machine-learning"],"apply_url":"https://job-boards.greenhouse.io/twilio/jobs/7155492","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-08-08T17:48:51Z","expires_at":"2026-06-29T14:09:09.832267Z","created_at":"2026-04-13T12:08:56.512282Z","updated_at":"2026-05-30T14:09:09.943707Z","company_name":"Twilio","company_slug":"twilio","company_logo_url":"https://www.google.com/s2/favicons?domain=twilio.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/60b9288c-b278-4210-8785-36ba8e6eaa50"},{"id":"f8682c7a-87d1-455b-9b14-7bf04986597a","company_id":"e09c6869-c06f-4a6b-bf08-dd5cd3138ee1","title":"Backend Engineer, Growth and Data","slug":"backend-engineer-growth-and-data-47a43546","description":"ABOUT HEBBIA\n\nThe AI platform for investors and bankers that generates alpha and drives upside.\n\nFounded in 2020 by George Sivulka and backed by Peter Thiel and Andreessen Horowitz, Hebbia powers investment decisions for BlackRock https://www.blackrock.com/us/individual/about-us/about-blackrock?cid=ppc:blk_us:corpaffairs_us_br_reputationmediamanagement_na_exact_ol:google:brand_nonprod:ol\u0026gclsrc=aw.ds\u0026gad_source=1\u0026gad_campaignid=21584717446\u0026gbraid=0AAAAACc6WDFbRb5bgB6zxVFLE_7yIy25I\u0026gclid=CjwKCAiA9aPKBhBhEiwAyz82J2uDlcIPVsy0fhSZMS_rp_OsGerYzYFPFfLo4TlN8K4eCHWzPfvysRoC7oQQAvD_BwE, KKR https://www.kkr.com/, Carlyle https://www.carlyle.com/, Centerview https://www.centerviewpartners.com/, and 40% of the world’s largest asset managers. Our flagship product, Matrix, delivers industry-leading accuracy, speed, and transparency in AI-driven analysis. It is trusted to help manage over $30 trillion in assets globally.\n\nWe deliver the intelligence that gives finance professionals a definitive edge. Our AI uncovers signals no human could see, surfaces hidden opportunities, and accelerates decisions with unmatched speed and conviction. We do not just streamline workflows. We transform how capital is deployed, how risk is managed, and how value is created across markets.\n\nHebbia is not a tool. Hebbia is the competitive advantage that drives performance, alpha, and market leadership.\n\n\n\n\nTHE TEAM\n\nThe Growth and Data team at Hebbia is the engine that fuels the entire AI platform with the data it needs to reason and retrieve. We’re responsible for sourcing, indexing, and enriching content from every corner of the knowledge universe — spanning private enterprise data, public data, and third-party platforms — and delivering it seamlessly to power user workflows and agentic research.\n\nWe build robust integrations with platforms like Snowflake, S3, SharePoint, Dropbox, and beyond, enabling organizations to securely unify their data ecosystems. From discovery and search to retrieval-augmented deep research within chat matrix frameworks, the data we deliver quite literally powers every part of the AI platform.\n\n\n\n\nTHE ROLE\n\nAs a Backend Software Engineer on Hebbia’s Growth and Data team, you will build and maintain the powerful backend systems that drive user engagement and fuel Hebbia's continued expansion. Your role involves architecting and implementing robust APIs, services, and infrastructure that empower customers with tailored, high-value experiences—such as personalized data views, streamlined management tools, and powerful integrations that uniquely amplify the value of Hebbia. Collaborating closely with product teams, designers, and frontend engineers, you'll take ownership of core backend features from initial design through deployment, ensuring scalability, reliability, and performance. Your technical expertise, strategic thinking, and proactive problem-solving will directly impact customer success, unlocking innovative solutions that accelerate adoption and growth of Hebbia’s platform.\n\n\n\n\nRESPONSIBILITIES\n\n 1. Own critical system components: Take complex requirements and turn them into robust, scaled solutions that solve real customer needs.\n\n 2. Unlock O(1) universal indexing: Build and iterate on our high-scale document build system that enables constant time latency for indexing any content in the world, regardless of data volume.\n\n 3. Drive performance optimization: Architect and implement performance-tuning solutions to ensure our systems operate efficiently at scale, minimizing latency and maximizing throughput across millions of documents.\n\n 4. Mentor and guide: Provide technical leadership, mentorship, and guidance to junior engineers, fostering a culture of learning and growth.\n\n\n\n\nWHO YOU ARE\n\n - Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field. A strong academic background with coursework in data structures, algorithms, and software development is preferred.\n\n - 5+ years software development experience at a venture-backed startup or top technology firm, with a focus on backend software engineering.\n\n - Proficiency in building backend and API systems using technologies such as Python, Java, or Go.\n\n - Extensive experience with cloud platforms (e.g., AWS)\n\n - Working experience with one or more of the following: Kafka, ElasticSearch, PostgreSQL, and/or Redis \n\n - Ability to analyze complex problems, propose innovative solutions, and effectively communicate technical concepts to both technical and non-technical stakeholders.\n\n - Proven experience in leading software development projects and collaborating with cross-functional teams. Strong interpersonal and communication skills to foster a collaborative and inclusive work environment.\n\n - Enthusiasm for continuous learning and professional growth. A passion for exploring new technologies, frameworks, and software development methodologies.\n\n - Embraces rapid prototyping with an emphasis on user fee","salary_min":160000,"salary_max":300000,"location":"New York, NY","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["llm","agents","rag","search","backend","data-engineering"],"apply_url":"https://jobs.ashbyhq.com/hebbia-ai/1710a563-14df-45c5-a6b1-a62adcdead89/application","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-07-09T22:17:06.975Z","expires_at":"2026-06-29T14:05:02.185834Z","created_at":"2026-04-13T09:40:59.242443Z","updated_at":"2026-05-30T14:05:02.301893Z","company_name":"Hebbia","company_slug":"hebbia","company_logo_url":"https://www.google.com/s2/favicons?domain=hebbia.ai\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/f8682c7a-87d1-455b-9b14-7bf04986597a"},{"id":"f5fd58d4-db31-4a4c-bffb-ee8ec0375455","company_id":"ff51c80a-dce9-4cb4-b2e6-9c060d25ef55","title":"Software Engineer - Kafka","slug":"software-engineer-kafka-934b5e1b","description":"About Applied Intuition\n Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving machine on the planet. Applied Intuition services the automotive, defense, trucking, construction, mining and agriculture industries in three core areas: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20 global automakers, as well as the United States military and its allies, trust the company’s solutions to deliver physical intelligence. Applied Intuition is headquartered in Sunnyvale, California, with offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Learn more at applied.co .\n We are an in-office company, and our expectation is that employees primarily work from their Applied Intuition office 5 days a week. However, we also recognize the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments. \n About the role \n \n \n As the data engine team, our goal is to provide a central data and machine learning platform that can be used across all verticals of the company. We’re looking for generalist engineers that want to build the foundations of a new workflow that spans:\n \n Data ingestion from a production fleet\n Data processing and storage (TBs/car/day)\n Labeling infrastructure\n Machine learning infrastructure\n \n This group has a  massive scope to define how product verticals across the company are deployed at scale for our customers given data and machine learning infrastructure is at the heart of the autonomy problem. We’re still very early in development so if you are interested in the “0-1” stage of building up a new team that interacts with teams across the company, this is a good fit.\n \n At Applied Intuition, you will: \n \n Design and build large-scale data platforms to support our AI research and autonomy stack development, handling petabytes of multimodal sensor data from real-world driving scenarios\n Work on data curation and tagging platforms that enable efficient dataset discovery, labeling workflows, and quality assessment across diverse driving conditions\n Build high-performance data processing systems using modern distributed computing frameworks to transform raw sensor data into training-ready formats\n Use the following technologies: Apache Spark, Apache Hudi, Trino, Apache Kafka, Flyte, Kubernetes, Python, Golang, Java\n \n We're looking for someone who has: \n \n A Bachelor's degree in Computer Science, Software Engineering, or equivalent\n 2+ years of professional experience\n Strong backend engineering experience\n Problem-solving skills and experience working with cross-functional teams in fast-paced environments\n \n Nice to have: \n \n Hands-on experience with modern data stack technologies including Apache Spark, Hudi, Trino, Kafka, or similar distributed data processing frameworks\n Knowledge of data lake architectures, streaming systems, and workflow orchestration platforms like Flyte\n \n Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment.\n Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials \u0026 certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position.\n Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $153,000 - $222,000 USD annually.\n Don’t meet every single requirement? If you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles. \n Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. Consequently, the parties agree that, as applicable, they will abide by the requirements of 41 CFR 60-1.4(a), 41 CFR 60-300.","salary_min":153000,"salary_max":222000,"location":"Sunnyvale, CA","workplace":"onsite","job_type":"full-time","experience_level":"junior","tags":["distributed-systems","data-pipeline","cloud","data-engineering"],"apply_url":"https://boards.greenhouse.io/appliedintuition/jobs/4584510005?gh_jid=4584510005","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-07-09T18:23:48Z","expires_at":"2026-06-29T14:03:40.777524Z","created_at":"2026-04-13T09:39:24.361226Z","updated_at":"2026-05-30T14:03:40.891501Z","company_name":"Applied Intuition","company_slug":"applied-intuition","company_logo_url":"https://www.google.com/s2/favicons?domain=appliedintuition.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/f5fd58d4-db31-4a4c-bffb-ee8ec0375455"},{"id":"97441040-c542-4b27-9b90-33067e17d2d4","company_id":"386fe9d9-0b35-4d37-bdcf-c61d636cf918","title":"Senior Data Engineer","slug":"senior-data-engineer-c2a23312","description":"About EliseAI\n\nAt EliseAI, we're improving the industries that matter most: housing and healthcare. Everyone needs a place to live and access to quality healthcare, yet both are often harder to secure than they should be.\n\nBy integrating AI agents deeply into existing workflows, we make them more efficient, reduce costs, and improve the experience for everyone.\n\n\n\n - Housing: We simplify how renters tour apartments, sign leases, submit maintenance requests, and stay connected with their property team—bringing everything they need for their home into one place.\n\n - Healthcare: We make it easy to schedule appointments, complete intake forms, and we help patients communicate with providers, so everyone can focus on health instead of paperwork.\n   \n   \n\nWith EliseAI, organizations reduce manual work, improve accessibility, and deliver a seamless experience across essential services. We recently raised a $250 million Series E round https://www.eliseai.com/blog/eliseai-raises-250m-series-e led by Andreessen Horowitz to accelerate this mission.\n\n\n\nAbout The Role\n\nAs a Senior Data Engineer, you will be a crucial member of our Engineering team, driving our mission to leverage data as a strategic asset. You’ll work closely with cross-functional teams, including product, operations, and analytics, to build out the data infrastructure that powers our decision-making processes. Your work will directly impact how we develop products, optimize our operations, and deliver value to our customers.\n\n\n\nKey Responsibilities\n\nWe are seeking a skilled and experienced Senior Data Engineer to join our team. The ideal candidate will completely own our data pipeline, playing a crucial role in designing, implementing, and maintaining our data infrastructure and ETL pipelines. In addition, in this role, you’ll lead the engineering team in making sure all code processes produce meaningful data that is stored in a performant manner. This data process is integral in powering client facing reporting as well as informing internal decisions and strategy. \n\n\n\nCore Objectives\n\n - The role involves building and maintaining high-quality data solutions and large-scale data pipelines that process vast amounts of data\n\n - Optimize data flow and collection for cross-functional teams\n\n - Design core data objects and models for the entire company\n\n - Implement and maintain data quality checks and monitoring systems\n\n - Develop client-facing reporting solutions\n\n - Additionally, the position supports data-driven decision-making through the creation of metrics datasets and dashboards\n\n - Engineers will collaborate across teams to optimize and improve data quality while delivering user-centric experiences\n\n\n\nMove at rocket speed, build something massive.\n\nWe’re scaling fast, solving real client problems with precision and ambition. Here, you own your impact; full autonomy, no micromanagement, no fluff. We hire the best, expect the best, and give you the masterclass of your career. It’s hard, it’s intense, and it’s the most rewarding work you’ll ever do. If you’re hungry, driven, and ready to build something massive, climb aboard.\n\n\n\nRequirements\n\nTechnical skills\n\n - 4+ years of professional experience in data engineering\n\n - Programming proficiency in python\n\n - Experience with PostgreSQL or other relational databases\n\n - Proven experience in data modeling and database design\n\n - Experience with ETL design, implementation, and maintenance\n\n - Knowledge of data warehousing solutions, particularly Snowflake\n\n - Understanding of development workflows and best practices\n\nProfessional skills\n\n - Strong problem-solving skills and attention to detail\n\n - Strong communication and collaboration skills\n\n - Ability to work independently and as part of a team\n\n - Excellent organizational and time management skills\n\n - Adaptability to learn and work with new technologies\n\n - Passion for maintaining high data quality standards\n\nPreferred Qualifications\n\n - Experience with data quality management and monitoring tools\n\n - Familiarity with AWS services\n\n - Familiarity with data visualization techniques and tools\n\n - Knowledge of machine learning pipelines and tools\n\n - Experience with real-time data streaming platforms\n\n - Familiarity with DBT, Airflow\n\nMindset\n\n - Entrepreneurial and ambitious; gritty, ‘roll-up-your sleeves’ attitude\n\n - Hardworking\n\n - Action-orientated with the ability to influence others\n\n - Self-starter mentality, yet a team player with a collaborative approach \n\n - Ego-free\n\n - Direct and clear communicator\n\n - Willingness to work in person at our office 4-5 days a week\n\n\n\nWhy join\n\nGrowth and impact. It’s not often that you can get in on the ground floor of a funded (unicorn! https://www.eliseai.com/blog/eliseai-raises-250m-series-e) startup that’s scaling so fast. That means that instead of following a playbook, you’ll be writing it. Every single day you will be challenged to identify how we can scale and execute on","salary_min":240000,"salary_max":300000,"location":"New York, NY","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["agents","cloud","healthcare","data-pipeline","data-engineering"],"apply_url":"https://jobs.ashbyhq.com/eliseai/59d7a2a6-cb05-456f-bde7-5337f1d88589/application","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-06-20T22:55:20.126Z","expires_at":"2026-06-29T14:17:33.064746Z","created_at":"2026-04-17T02:26:11.424316Z","updated_at":"2026-05-30T14:17:33.185881Z","company_name":"EliseAI","company_slug":"eliseai","company_logo_url":"https://www.google.com/s2/favicons?domain=eliseai.com\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/97441040-c542-4b27-9b90-33067e17d2d4"},{"id":"1547eec7-3eee-4dd5-a84d-aabde5c3293c","company_id":"6ea0f41a-b13e-481a-b410-5195f391f939","title":"Senior Data Engineer","slug":"senior-data-engineer-746a79b0","description":"About the Role \n Together AI is looking for a Senior Data Engineer to help define, build, and operate the data infrastructure that handles millions of events every day to power Together’s mission-critical systems. As a Senior Data Engineer, you will work with our Data and Commerce engineering team to scale the data processing components of Together’s usage-based billing system, real-time customer-facing analytics product, and internal business intelligence tools. You will work across both cloud-native services and globally distributed data centers.\n If you thrive in fast-paced environments and have a passion for defining and building early-stage data platforms for a rapidly scaling and data-intensive company, this is for you.\n Requirements \n \n 5+ years of demonstrated experience in building large scale, fault tolerant, distributed data platforms, stream processing pipelines, ETLs, etc\n Expert-level skills in designing, building, and operating stream processing pipelines with services like AWS Kinesis, Apache Kafka, or Redpanda\n Expert-level knowledge of building real-time customer facing analytics systems using services like AWS TimeStream or Clickhouse\n Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform, AWS CDK, or Pulumi\n Proficiency in version control practices and integrating IaC with CI/CD pipelines.\n Proficiency in implementing and managing GitOps workflows with tools such as ArgoCD, Github Actions, TeamCity, or similar\n Proficiency in one or more of Golang, Rust, Python, Java, or TypeScript\n Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience\n Experience with Kubernetes, or containers a plus\n \n Responsibilities \n \n Identify, design, and develop foundational data infrastructure components capable of handling millions or billions of events daily\n Analyze and improve the robustness and scalability of existing data processing infrastructure\n Partner with product teams to understand functional requirements and deliver solutions that meet business needs\n Write clear, well-tested, and maintainable infra-as-code and software for both new and existing systems\n Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance\n Participate in an on-call rotation to address critical incidents when necessary\n \n About Together AI \n Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.\n Compensation \n We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $240,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.\n Equal Opportunity \n Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.\n Please see our privacy policy at  https://www.together.ai/privacy","salary_min":160000,"salary_max":240000,"location":"San Francisco, CA","workplace":"onsite","job_type":"full-time","experience_level":"senior","tags":["payments","data-pipeline","cloud","data-engineering"],"apply_url":"https://job-boards.greenhouse.io/togetherai/jobs/4737079007","is_featured":false,"is_sticky":false,"status":"active","published_at":"2025-05-15T19:57:13Z","expires_at":"2026-06-29T14:01:49.453323Z","created_at":"2026-04-13T09:37:38.100914Z","updated_at":"2026-05-30T14:01:49.557926Z","company_name":"Together AI","company_slug":"together-ai","company_logo_url":"https://www.google.com/s2/favicons?domain=together.ai\u0026sz=128","quality_score":90,"url":"https://aidevboard.com/job/1547eec7-3eee-4dd5-a84d-aabde5c3293c"}],"page":1,"per_page":20,"total":95,"total_pages":5}
