Data Scientist
full-time
junior
Posted 1 month ago
About this role
ABOUT SEMGREP
Semgrep, the leader in code security for builders, empowers invention without friction. Teams catch, flag, and fix real issues before they ship, powered by security that learns as they build. Semgrep secures code as it’s written and provides guardrails that pave the road for developers to move fast and stay secure. Built for builders and trusted by security, Semgrep lives where developers work, delivering fixes without breaking flow, and giving security teams visibility, control, and confidence. Semgrep gets smarter as you build, with AI that learns your context to cut false positives and prioritize reachable vulnerabilities, validated by 95% of security reviewers across 6M+ findings. Semgrep makes zero false positives a reality with AppSec teams triaging 80% fewer false positives across Code and Supply Chain, dramatically shrinking the backlog.
Founded in San Francisco and backed by Menlo Ventures, Felicis Ventures, Lightspeed Venture Partners, Redpoint Ventures, and Sequoia Capital, Semgrep is recognized by Gartner in Application Security Testing and is trusted by leading organizations, including Snowflake, Dropbox, and Figma. Learn more at semgrep.dev http://semgrep.dev.
ABOUT THE ROLE
You will be an early member of Semgrep’s data team. Your mission will be to define how an entire company uses data, always striving to best improve our users’ security. You will work on a diverse set of problems, touching every aspect of the startup: extracting product insights from usage metrics, determining business strategy from market data, crafting production data pipelines, and defining where to direct our security research. This is a growth role: while you will start as an individual contributor, initially contributing to a quarter-long project while growing your technical skill set and domain knowledge to start taking on more responsibility and influencing data and business decisions within the company/
Along the way, you will work with a dedicated group of full-stack, backend, and infrastructure engineers, as well as security researchers and program-analysis developers. You will learn what it means to have “secure-by-default” code. You will meet and collaborate with security-industry scions. And, as a member of our team, you’ll be a part of the decisions that make a high-growth startup successful. Your work will be critical to our mission. Every feature you build will have a measurable impact on our users’ lives. We’re excited to see what you do.
WHAT YOU'LL DO
- Contribute to specific data science projects and initiatives at Semgrep; discovering each department’s most pressing data problems, and proactively identifying the most critical areas to focus your efforts
- Bring your wide knowledge of data-science approaches to each problem you solve: the first day you might build a dashboard to track Board level metrics for the Engineering team, the second you might apply multivariate regression to identify important product features, the third you might apply active-learning techniques to guide data collection and labeling
- Iteratively tackle problems as a series of experiments, proving the value of your work with proof-of-concept to ever more refined results
- Convince your peers of your conclusions with clear data visualizations and well-reasoned explanation
- Help grow your team through the recruitment and hiring of top data talent
YOU ARE IDEAL FOR THIS ROLE IF YOU HAVE
- 2+ years of experience in data and strategy fields
- Knowledge of data-science approaches; this may include machine-learning algorithms, optimization methods or symbolic artificial-intelligence, but should also include statistical methods and “good-enough” heuristics — and the taste to know when to use each
- Experience clearly visualizing information and experimental results across the full company stack: Board-level, leadership team, and individual team leads
- Sufficient familiarity with production data-processing pipelines to construct them working together with generalist infrastructure engineers; tools we use include S3, FiveTran, DBT, Snowflake, Metabase, Retool, Sagemaker/JupyterNotebook (Python)
- Aptitude delivering technical projects via rapid iterative development
- Experience working on a small team in a fast-paced environment and are willing to experiment with different approaches before settling on the best and most elegant solution given time constraints
- Excellent, proactive communication, both verbal and written
SOME EXAMPLE PROJECTS THAT YOU MIGHT WORK ON INCLUDE
- Build a client-facing dashboard showing scan time metrics over time to show how the product is improving
- Work together with Product leadership to identify the correct north-star metrics to measure Product usage and what features to build next
- Partner with the rule-writing team to identify the most impactful rules and languages to focus on in real-time
- Build out cleaned/medallio
Similar Jobs
Related searches:
Hybrid Jobs
Junior Jobs
Hybrid Junior Jobs
Junior Data EngineeringJunior Generative AIJunior Data ScienceJunior Machine LearningJunior Fintech & Payments AIJunior AI Safety & Security
AI Jobs in San Francisco
Data Engineering in San FranciscoGenerative AI in San FranciscoData Science in San FranciscoMachine Learning in San FranciscoFintech & Payments AI in San FranciscoAI Safety & Security in San Francisco
paymentsdata-pipelinefine-tuningsecuritydata-science