Return to jobs Return to jobs

Principal Data Engineer

BitBio

Clock

Posted over 30 days ago...

Expired

Join BitBio as they are looking for a Principal Data Engineer

Overview

icon Salary

No salary declared 😔

icon Location

Cambridge, UK

icon Nomad Friendly?
Tick

98% Remote- UK

icon Expires

Expires at anytime

Organisation summary

bit.bio is a cutting-edge biotech company emerging from the University of Cambridge. Specializing in the fusion of synthetic and stem cell biology, we aim to revolutionize human cell reprogramming for research, drug discovery, and cell therapy. We're driven by the mission to craft the future of medicine and are on the lookout for innovative minds eager to contribute to the advancement of science and therapeutics. Our culture thrives on science, collaboration, openness, curiosity, creativity, and mutual respect.

Role Summary

  • Construct and manage data pipelines for scientific and bioinformatics teams.
  • Design data architectures, models, and storage solutions, ensuring robust data infrastructure.
  • Utilize data analytics and business intelligence tools for insightful decision-making.
  • Implement data quality management and compliance with standards like ISO and GxP.
  • Advise on data analysis infrastructure and develop APIs for dataset accessibility.
  • Manage data storage and establish archival processes.
  • Monitor data and infrastructure usage through dashboards.

Role Requirements

  • Bachelor's degree in Computer Science, Mathematics, Statistics, or related field, or equivalent experience.
  • Proven expertise in database development and data management, particularly in the Life Sciences sector.
  • Strong collaboration skills and a data-centric problem-solving approach.
  • Proficiency in Linux, AWS, big data tools (e.g., Hadoop, Spark), and data workflow tools (e.g., Airflow).
  • Experience with object-oriented and scripting languages like Python, JavaScript, Scala.
  • Understanding of FAIR principles and experience with GxP compliant systems.
  • Beneficial to have experience in biotech scale-ups, team management, and BI tools.

bit.bio is an award-winning spinout from the University of Cambridge. Our breakthrough technology combines synthetic and stem cell biology for the precise, efficient and consistent reprogramming of human cells used in research, drug discovery, and cell therapy. At bit.bio, we are passionate about engineering human cells that will enable the medicine of the future. To do this we need talented and curious people who want to make an impact on the future of science and therapeutics.

As a team of individuals, we value science, collaboration, openness, curiosity and creativity. We are united by trust and respect for each other.

Location: Babraham Research Campus, Cambridge

Type: Full time, permanent 

Start: Immediate

Salary: Competitive / Hours: 40 p/w

Office based position (hybrid working available)

Your role in our team:

We are looking for a highly experienced and passionate Principal Data Engineer to partner with and support our Scientists and Bioinformaticians by addressing their data management challenges. In this highly collaborative role you will help acquire, curate, store and retrieve any type of data and metadata required by the teams. You will design the databases required by different teams to manage and make their data searchable.

As Principal Data Engineer you will support the development of a data pipeline for continuous ingestion of new datasets from external/internal sources into the relevant databases. This will involve the Principal Data Engineer working closely with the Bioinformaticians to set up a scalable analysis platform, integrated with API calls to the dataset.

Your key responsibilities will include:

  • Develop pipeline to acquire and ingest data required by the Scientists and the Bioinformaticians
  • Develop and oversee the design of data architectures, including data models, data integration, and data storage solutions. Collaborate with the IT team to implement robust data infrastructure and ensure seamless data flow across systems.
  • Drive the use of data analytics and business intelligence tools to derive valuable insights from data. Work closely with Scientists and Bioinformaticians to develop advanced analytics solutions. Promote data-driven decision-making across the organization.
  • Implement data quality management processes to identify and rectify data errors or inconsistencies. Monitor data quality metrics and take corrective actions as needed. Ensure data accuracy and reliability.
  • Ensure compliance with relevant standards such as ISO and GxP and data security best practice
  • Advise on optimized data analysis infrastructure and resources
  • Develop libraries and APIs for access to the dataset
  • Provide secure access to data and resources
  • Manage storage of raw data set and setup archival process
  • Determine best practices for data management and storage
  • Monitoring data, lab equipment and IT infrastructure usage, with dashboard

You…

  • Have a Bachelors’ degree in a field with a strong quantitative and informatics aspect (such as Computer Science/ Mathematics/ Statistics) or equivalent experience, followed by significant experience developing and working with a variety of databases and data sets, preferably within the Life Science sector
  • Have demonstrable experience independently supporting the data needs of small groups of data scientists and informaticians 
  • Are a strong collaborator, used to working cross-functionally across all levels within a growing organisation
  • Have a deep interest in data as a resource, with instinct and passion for supporting missions that are data-driven and depend on getting data provisioning right
  • Have a proactive problem-solving approach and a high level of initiative, along with good ability to organise yourself well at work
  • Have excellent written and verbal communication skills

With essential experience in…

  • Significant experience developing and working with a variety of databases and data sets
  • Additional extensive, deep experience in data engineering with demonstrable experience of;
    • Developing/ optimising high-volume data pipelines, large datasets and big-data architectures
    • Successfully and independently serving data needs of a small group of data scientists/ informaticians and other users of data
    • Performing analysis on at least a couple of types of data sets to understand their properties and advising end-user teams on their value
    • Successfully building processes for transforming data, creating unique data structures to suit end uses, ensuring sufficiency of metadata, and developing methods for automated delivery of data sets (software tools, APIs)
    • Working on building and using data stores in AWS
    • Working with a variety of stakeholders and cross-functional teams, performing analysis of their data requirements and documenting it
    • Working and programming in the Linux environment
    • Managing multiple requests and priorities in a fast paced environment
    • Big data tools and stream-processing systems such as: Hadoop, Spark, Kafka, Storm, Spark-Streaming
    • Relational SQL and NoSQL databases, including Postgres and Cassandra.
    • Data pipeline and workflow management tools: Luigi, Airflow, etc.
    • AWS cloud services: EC2, S3, Glue, Athena, API Gateway, Redshift
    • Experience with object-oriented and scripting languages: Python, JavaScript, Scala, etc.
    • Designing and building APIs (RESTful, etc.)
    • Ontologies such as Gene Ontology, and ontological modelling tools and editors such as W3C Wiki, Basic Formal Ontology, etc.
    • Understanding of FAIR principles
    • Experience having built GxP systems

and possibly...

  • Experience working within a rapidly expanding, scale up biotech environment
  • Experience in the direct management of small teams
  • Practical experience working to ISO and GxP standards
  • Experience with BI tools such as Power BI, Kibana and AWS Quicksight
  • Exposure to cost optimisation in relation to data management

More reasons to join us:

bit.bio provides a vibrant and dynamic work environment in an exciting, fast-moving time for biology. We work with cutting edge technologies and with our world-leading scientific advisory board. We conduct pioneering work with real-world impact.

We trust our people to make significant contributions early on with opportunities to be involved in projects that are key to the success and growth of our young company. We invest in people, creating opportunities for personal development in an inclusive multi-skilled team with ambitious goals that provide opportunities to learn on the job from each other.

Creativity and open minds are encouraged for everyone to contribute to the success of the company.

For information on how we will manage your data please see our Candidate Privacy Notice

Medal
Computer

FOR ORGANISATIONS

Your progressive people partner

Post your jobs, become a Top 1% Employer and more. We work with organisations who aspire to do things differently.

Learn More
*** 🚨 Announcing Top 1% Employer: Escape Verified 💥 ***