About the TeamTheVelankar teammaintains macromolecular structure databases that form essential resources for biologists and other life scientists worldwide. PDBe is a founding partner of the Worldwide Protein Data Bank organisation, which maintains the global archive of 3D structural data on macromolecules the Protein Data Bank (PDB). The PDBe team also develops the PDBe Knowledge Base (PDBe-KB) and AlphaFold Protein Structure Database (AFDB). The PDBe team is international and inter-disciplinary and consists of expert data curators, bioinformaticians, scientific software developers and IT specialists.Your roleWe seek a skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in optimising and enhancing our data pipelines, ensuring efficient data processing, storage and retrieval. You will work closely with cross-functional teams to analyse requirements, propose new data pipeline architectures, and implement solutions to improve performance and scalability.The tasks for this post include the following:Analyse existing data pipelines and identify areas for improvement, optimisation, and scalability.Work closely with Bioinformaticians and annotators to integrate data pipelines with existing systems and applications.Monitor data pipeline performance, troubleshoot issues, and implement solutions to ensure reliability and efficiency.Stay current with industry trends and best practices in data engineering and recommend new technologies or tools to enhance data infrastructure.Document data pipelines, processes, and workflows for internal reference and knowledge sharing.Join us in shaping the future of structural biology data. In this role, youll use your IT skills and creative ideas to support and scale vital resources like the PDB, PDBe, PDBe-KB and AFDBensuring they remain robust, sustainable, and ready for tomorrows scientific challenges.You haveMSc in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertiseExpert in Data Modelling and Advanced SQLProficiency in Python programmingProficiency in ETL (Extract, Transform, Load) processes and tools for large-scale data processing.Strong understanding of relational databasesStrong understanding of relational databases with hands-on experience across multiple RDBMS platforms:PostgreSQL: Deep knowledge of PostgreSQL database architecture, performance tuning, partitioning strategies, indexing techniques, and query optimisationOracle: Extensive experience with Oracle databases, including PL/SQL, Oracle-specific features, and performance optimisationMySQL/MariaDB: Familiarity with alternative RDBMS platforms for data migration and compatibility scenariosExperience with database migrationProven experience in migrating databases between different RDBMS platforms, specifically:Oracle to PostgreSQL migration: Hands-on experience with Oracle to PostgreSQL migration projects, including understanding of compatibility layer (pg_proguard), data type mapping, stored procedure conversion, trigger migration, and handling Oracle-specific features in PostgreSQLData migration best practices: Experience with migration tools such as Oracle Data Pump, GoldenGate, custom ETL scripts, and data validation strategiesMigration planning: Ability to plan and execute migration projects, including downtime management, data consistency verification, and rollback strategiesCross-platform optimisation: Knowledge of leveraging PostgreSQL features to improve performance during migration scenariosProficiency in data warehousing (Redshift, BigQuery)Strong communication and collaboration skills, with the ability to work effectively in a team environment.Proficiency in oral and written EnglishYou might also havePhD in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertiseExperience in big data technologies and frameworks, such as Apache Spark, Hadoop or similar platformsHands-on experience with CI/CD (GitLab CI/GitHub Actions)Familiarity with JavaFamiliarity with Google Cloud Platform or AWSFamiliarity with data modelling techniques for AI (Artificial Intelligence) and ML (Machine Learning) applicationsFamiliarity with Neo4J or other graph databases is an added advantageFamiliarity with data visualisation (Tableau, PowerBI)Knowledge of, or affinity with, structural biology and bioinformaticsExperience working in international teamsApply now! Benefits and Contract InformationFinancial incentives: depending on circumstances, monthly family/marriage allowance of £278 monthly child allowance of £336 per child. Non resident allowance up to £569 per month. Annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurancesHybrid working arrangementsPrivate medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover)Generous time off: 30 days annual leave per year, in addition to eight bank holidaysRelocation package including installation grant (as applicable)Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely)Family benefits: On-site nursery, child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowancesContract duration: This position is a 3 year grant based contractSalary: Monthly salary starting at £3,303after tax (but excl. pension & insurances) + benefits (Total package will be dependent on family circumstances)International applicants: We recruit internationally and successful candidates are offered visa exemptions. Read more on our page for international applicants.Diversity and inclusion: At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ+ and individuals from all nationalities.Job location: This role is based in Hinxton, near Cambridge, UK. You will be required to relocate if you are based overseas and you will receive a generous relocation package to support you.To apply, please submit a covering letter and CV via our online system. Applications will close on 06/05/2026.JBRP1_UKTJ