We are seeking a highly motivated bioinformatician to join Ensembl (www.ensembl.org), a world leading provider of genomics data resources and bioinformatics software tools.
The Darwin Tree of Life (DToL) project plans to sequence, assemble and annotate all 66000 eukaryotic species in the UK. This will offer an unprecedented molecular-level insights into evolution and biodiversity. For the first time we will be able to ask questions on a genomic level of whole ecosystems. Ensembl plans to lead the way in annotating the diverse range of species that the DToL project will encompass.
As part of Ensembl Genebuild, you will join a team of bioinformaticians and developers who are experts in gene structure annotation. Your role will be to help with the effort to expand our annotation pipelines to produce the most high-quality annotation possible across the diverse range of species covered by DToL. Certain taxonomic groups, such as plants and invertebrate metazoa, pose specific challenges for gene annotation including very large evolutionary distances between species, a general lack of well characterised proteins and a sparsity of well annotated references species.
To address these issues, we will be developing new methods for genome annotation that exploit maximum utility from the available data. In particular we will be deploying and building upon the latest software for mapping and correcting long and short read transcriptome data sets. We will add hint-guided ab initio methods to our pipelines to help annotate data-sparse genomes. We will also examine the limits for cross-species mapping of both nucleotide and protein data.
The Ensembl Genebuild team have domain area expertise in software development, large-scale compute, big data, pipeline workflows and automation. We collaborate with consortia and communities from all over the world to annotate new genomes. Current and future projects for the team include: scaling up gene annotation pipelines, implementing machine learning techniques to improve gene structure annotation, working with long-read (PacBio IsoSeq, Nanopore) transcriptomic data to extending existing annotations and identify new splice variants.
Your main responsibility will be the development and deployment our large-scale annotation system to produce high-quality gene annotation. More specifically, you will:
- Produce high-quality, evidence-based gene sets for species across the eukaryotic tree-of-life, including protein-coding genes, noncoding RNA genes and pseudogenes
- Contribute significantly to the design of new annotation methods for non-vertebrate genomes
- Work in a release-based environment and coordinate with other teams
- Collaborate with international partners on genome projects
- Participate in training users in our annotation methods and workflows
- Work with state-of-the-art primary data to help improve gene structure annotation
You should hold an MSc, PhD or equivalent experience in Computer Science, Bioinformatics, Genetics or a related field.
You will be to write, understand and maintain complex code. You will also have domain experience in some of the following:
- Genome annotation
- Methods for DNA/RNA sequencing and sequence alignment
- Relational databases
- Scaling and optimisation
- Machine learning
- Non-vertebrate biology
You will have good communication and interpersonal skills, and be a self-starter who can manage their own time to meet the needs of several projects. The key attributes sought are the ability to work in a team, excellent attention to detail, solid problem-solving skills, and the desire to learn and improve. Furthermore, you should demonstrate your ability to communicate both biological and computational ideas (orally and in writing), time management to deadlines, and a desire to work in an international environment.
You might also have
Previous experience of processing large biological data sets in a production environment would be advantageous, including an understanding of compute clusters, pipeline workflows, software design and automation. Evidence of working in a dynamic, team-based environment or contributing to a large, shared code-base is desirable.
Why join us
At EMBL-EBI, we help scientists realise the potential of ‘big data’ in biology by enabling them to exploit complex information to make discoveries that benefit mankind. Working for EMBL-EBI gives you an opportunity to apply your skills and energy for the greater good. As part of the European Molecular Biology Laboratory (EMBL), we are a non-profit, intergovernmental organisation funded by 22 member states and two associate member states. We are located on the Wellcome Genome Campus near Cambridge in the UK, and our 600 staff are engineers, technicians, scientists and other professionals from all over the world.
EMBL is an inclusive, equal opportunity employer offering attractive conditions and benefits appropriate to an international research organisation. The remuneration package comprises a competitive salary, a comprehensive pension scheme and health insurance, educational and other family related benefits where applicable, as well as financial support for relocation and installation. For more information about pay and benefits click here
We have an informal culture, international working environment and excellent professional development opportunities but one of the really amazing things about us is the concentration of technical and scientific expertise – something you probably won’t find anywhere else.
If you’ve ever visited the campus you’ll have experienced first-hand our friendly, collegial and supportive atmosphere, set in the beautiful Cambridgeshire countryside. Our staff also enjoy excellent sports facilities including a gym, a free shuttle bus, an on-site nursery, cafés and restaurant and a library.
What else you need to know:
To view a copy of the full job description please click here: https://www.embl.de/jobs/searchjobs/index.php?ref=EBI01527&newlang=1&src...
To apply please submit a covering letter and CV through our online system.
Applications are welcome from all nationalities - visa information will be discussed in more depth with applicants selected for interview.
EMBL-EBI is committed to achieving gender balance and strongly encourages applications from women, who are currently under-represented at all levels. Appointment will be based on merit alone.
The initial contract is for a period of three years with the possibility of a fixed-term extension.
Applications will close at 23:00 British time on the date listed above.