Sonatype

Senior Data Scientist

Sonatype  •  Hyderabad, IN (Onsite)  •  14 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
65
AI Success™

Job Description

Sonatype is the software supply chain security company. We provide the world’s best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale.
As founders of Nexus Repository and stewards of Maven Central, the world’s largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.
More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.

The Opportunity       We’re looking for a Senior Data Scientist to join our growing AI & Data Science team.       You’ll operate as an internal AI consultant and technical lead, helping multiple teams across Sonatype apply machine learning and generative AI to real-world problems.       You’ll explore complex datasets, design experiments, build models, and collaborate closely with product engineering, and security experts to turn research ideas into practical, scalable solutions.       This role is ideal for someone who thrives on autonomy, loves translating ambiguous ideas into working systems, and enjoys working across boundaries rather than in a single product lane.

What You’ll Do

  • Lead applied AI projects from concept to impact — prototype, validate, and help teams deploy practical ML and GenAI solutions.
  • Collaborate cross-functionally: Partner with product, engineering, and research teams to scope problems, identify opportunities, and co-develop solutions.
  • Act as an internal consultant: Advise teams on ML/AI best practices, model evaluation, and productive use of generative technologies.
  • Design robust experiments and establish evaluation pipelines for model reliability, accuracy, and business impact.
  • Bridge research and production: Package research insights into usable APIs, tools, or workflows for other teams.
  • Explore new techniques (e.g., LLMs, embeddings models, retrieval-augmented generation, agentic workflows) to enhance developer and security experiences.
  • Share knowledge and mentor peers, helping elevate the organization’s AI literacy and capabilities.

What We’re Looking For

  • 6+ years of experience in applied data science, machine learning, or AI research
  • Strong Python skills and hands-on experience with ML/AI libraries and platforms such as Databricks, OpenAI API, and Scikit-learn
  • Comfortable working with large, messy, or unstructured datasets — you know how to turn chaos into features, insights, and beautiful visualizations
  • Deep familiarity with LLMs and GenAI ecosystems (e.g. OpenAI, Claude, Hugging Face): skilled in prompt engineering, parameter tuning, and evaluating model behavior against ground truth
  • Experience taking ML or GenAI systems from prototype to production, even if small-scale or incremental
  • Strong analytical thinking, experimentation skills, and appreciation for trustworthy, data-driven evaluation
  • Proficiency with Git and collaborative code workflows (GitHub or similar)
  • A balanced mindset — equally comfortable exploring research ideas and implementing production-ready systems
  • Proactive and self-directed: you don’t wait for perfect specs; you find meaningful problems and drive them to completion

Bonus Points

  • Experience with AI-assisted coding tools (Copilot, Claude Code, Codex, etc.)
  • Familiarity with agentic workflows, Model Context Protocol (MCP), and tool-use integrations
  • Exposure to cybersecurity, anomaly detection, or code analysis
  • Understanding of MLOps practices (MLflow, AWS SageMaker, model serving, or monitoring)
At Sonatype, we value diversity and inclusivity. We offer perks such as parental leave, diversity and inclusion working groups, and flexible working practices to allow our employees to show up as their whole selves. We are an equal-opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please do not hesitate to let us know.

Sonatype

About Sonatype

The Sonatype journey started 15 years ago, just as the concept of “open source” software development was gaining steam. From our humble beginning as core contributors to Apache Maven, to supporting the world’s largest repository of open source components (Central), to distributing the world's most popular repository manager (Sonatype Nexus Repository), we’ve played a meaningful role in helping the world embrace the power of open innovation.

Over time, we witnessed the staggering volume and variety of open source libraries that began flowing into every development environment in the world. We understood that when open source components are properly managed, they provide a tremendous energy for accelerating innovation. Conversely, when unmanaged, open source "gone wild"​ can lead directly to security vulnerabilities, licensing risks, enormous rework, and waste.

Our vision today is simple.

We are laser focused on helping organizations continuously harness all of the good that open source has to offer, without any of the risk. In order to do this, we have invested in knowing more about the quality of open source than anyone else in the world. This investment takes the form of machine learning, artificial intelligence, and human expertise, which in aggregate produces highly curated intelligence that is infused into every Sonatype product. Organizations equipped with Sonatype products make better decisions, innovate faster at scale, and rest comfortably knowing that their applications always consist of the highest quality open source components.

Industry
IT & Software
Company Size
501-1,000 employees
Headquarters
Fulton, MD
Year Founded
2008
Social Media