Company

NLP Data ScientistSee more

addressAddressEMBL-EBI - European Bioinformatics Institute
CategoryScientific

Job description

Organisation data: Chemogenomics
Job Number: EBI02211
Contract Type: Staff Member
Contract Duration-Length of Time (years/months): 3 years (Project based contract)
Advertised Grade-Grading: Grade 5 or 6 (£3,090 or £3,456 per month after tax) plus benefits depending on personal circumstances
Closing date: 16 February 2024

About the team/job

We are looking for an enthusiastic and talented (NLP) Data Scientist to join the newly initiated AI knowledge management project, initially for a period of 3 years. This position will be situated in the Chemical Biology Services Team, which also develops and delivers several globally-recognised resources including ChEMBL, ChEBI, SureChEMBL, and UniChem. We are in particular interested in applications of AI and machine learning to mine the research literature for additional types of entities relevant to drug discovery not currently available to OT (such as variants, biomarkers, tissues/cell types, adverse events, and assay conditions). This position provides a real opportunity to make a significant impact on a critical problem in drug discovery for the many users of the OT Platform and an opportunity to contribute to the open source models and code associated with biological and drug discovery entities.

You will be embedded into a multi-disciplinary project team that also includes machine learning data experts and Data Scientists/engineers. You will need to be able to demonstrate the ability to work well with colleagues and to collaborate with external partners. You must have excellent communication and interpersonal skills and enjoy working in a stimulating, international environment.

Your role

  • Collect benchmark data sets from the open domain as training sets for NLP models;
  • Collect specifications, test prototypes and deliver tools for benchmarking;
  • Develop and utilise statistically robust methods for data analysis and benchmarking;
  • Work with other team members to find and use suitable pre-trained NLP models from the public domain (e.g., HuggingFace);
  • Work with other team members to retrain publicly available NLP models on the open scientific literature available in Europe PMC to ensure they are optimised for the project's need;
  • Support the team to modernise and extend the current entity recognition workflows to cover an array of additional types of entities relevant to drug discovery;
  • Support the team to development of new machine learning, deep learning or NLP protocols to enhance curation workflows;
  • Analyse the newly extracted entity relationships as part of specific use cases (e.g., explore their scientific value);
  • Collaborate with the OT partners to assess, prioritise, validate and refine the developed methods;
  • Work closely with the OT core team for the seamless integration of data and workflows into the OT Platform;
  • Actively disseminate the outcomes of the project to the scientific community and stakeholders through well-crafted presentations and publications.

You have

  • Advanced degree (MSc, PhD) in biology, biomedical sciences or related discipline;
  • Proficiency in at least one modern programming/scripting language (e.g. Python);
  • Experience of biological data curation and knowledge of bioinformatics databases;
  • Experience with advanced big data preprocessing, cleaning, and transformation techniques specific to textual data including ontologies;
  • Good understanding of statistical methods and their application to data analysis and use of data visualisation tools and libraries (such as Matplotlib, Seaborn) to effectively communicate data insights;
  • Excellent attention to detail;
  • Strong communication skills, both presentations and verbal;
  • Experience working in a team-oriented environment;
  • Able to work independently, to manage your time and work to deadlines.

You might also have

  • Experience working in a drug discovery and development environment;
  • Proficiency in using text analytics methods and/or machine learning tasks;
  • Knowledge of version control systems (e.g., GitHub);
  • Knowledge and practical experience with bioinformatics methods including systems biology and genetics analysis.

Why join us

Do something meaningful

At EMBL-EBI you can apply your talent and passion to accelerate science and tackle some of humankind's greatest challenges. EMBL-EBI, part of the European Molecular Biology Laboratory, is a worldwide leader in the storage, analysis and dissemination of large biological datasets. We provide the global research community with access to publicly available databases and tools which are crucial for the advancement of healthcare, food security, and biodiversity.

Join a culture of innovation

We are located on the Wellcome Genome Campus, alongside other prominent research and biotech organisations, and surrounded by beautiful Cambridgeshire countryside. This is a highly collaborative and inclusive community where our employees enjoy a relaxed atmosphere. We are committed to ensuring our employees feel valued, supported and empowered to reach their professional potential.

Enjoy lots of benefits

  • Financial incentives: Monthly family, child and non-resident allowances, annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurances;
  • Flexible working arrangements;
  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover);
  • Generous time off: 30 days annual leave per year, in addition to eight bank holidays;
  • Relocation package including installation grant (if required);
  • Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely);
  • Family benefits: On-site nursery, 10 days of child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances;
  • Benefits for non-UK residents: Visa exemption, education grant for private schooling, financial support to travel back to your home country every second year and a monthly non-resident allowance.

For more details please see our employee benefits page.

What else you need to know

  • Contract duration: This position is a project based 3 year contract which will expire in June 2027;
  • International applicants: We recruit internationally and successful candidates are offered visa exemptions. Read more on our page for international applicants;
  • Diversity and inclusion: At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ+ and individuals from all nationalities;
  • EMBL is a signatory of DORA. Find out how we implement best practices in research assessment in our recruitment processes here;
  • Job location: This role is based in Hinxton, near Cambridge, UK. You will be required to relocate if you are based overseas and you will receive a generous relocation package to support you;
  • How to apply: To apply please submit a cover letter and a CV through our online system. Panel interviews are planned to take place on 25th March.

Don't forget to mention EuroScienceJobs when applying.
Refer code: 2631554. NLP Data Scientist - The previous day - 2024-01-27 05:43

NLP Data Scientist

EMBL-EBI - European Bioinformatics Institute
Jobs feed

Internal Sales Support

Brellis Recruitment

Southam, Warwickshire

£26,000/annum

Sales Administrator

The Recruitment Group

Rugby, Warwickshire

£22,500 - £23,500/annum

Finance Administrator

Pertemps Edinburgh

City of Edinburgh, Scotland

£26,000 - £28,000/annum

Regional Sales Manager

Srs Group Holdings Ltd

Glasgow, Glasgow City

£50,000 - £65,000/annum

Field Service Manager

Stannah

BH1

Business Manager

Perfect Placement

Bristol, England

£24,000 - £65,000/annum

Recruitment Manager

Nurseplus Uk Ltd

Halifax, West Yorkshire

Wholesale Branch Manager

Morepeople

Bristol, England

£60,000 - £70,000/annum

White Goods Engineer

Heels & Brogues Group

Stevenage, Hertfordshire

£35,000/annum Plus Monthly Bonus

Service Engineer

Torchlight Recruitment Solutions Ltd

Buckland, Buckinghamshire

£30,000 - £35,000/annum

Share jobs with friends

Related jobs

Nlp Data Scientist

Senior Data Scientist

Yolk Recruitment

£90,000 - £120,000/annum

Cardiff, Wales

just now - seen

Senior Data Scientist | Dive Into Python

Dive Into Python

London, Greater London

2 weeks ago - seen

Senior Data Scientist – London | Dive Into Python

Dive Into Python

London, Greater London

2 weeks ago - seen

Senior Data Scientist | ICF Olson

Icf Olson

London, Greater London

2 weeks ago - seen

Principal Data Scientist (FTC - 24 Months)

Mayors Office For Policing And Crime

£55,009.00 - £62,860.00 per annum

London, Greater London

2 weeks ago - seen

Lead Data Scientist

Caterpillar

Peterborough, Cambridgeshire

2 weeks ago - seen

Data Scientist

Mars

London, Greater London

2 weeks ago - seen

Research Assistant/Research Associate* - Statistician/Health Data Scientist

University Of Cambridge

£

Cambridge, Scottish Borders

2 weeks ago - seen

Statistical Data Scientist

Rothamsted Research

£

Harpenden, Hertfordshire

2 weeks ago - seen

Data Scientist CGEMJP00255321

Cyber Security

Sheffield, South Yorkshire

2 weeks ago - seen

Mid-Level & Senior Data Scientists- Hybrid/Bristol - Up to £90k

Adecco

£50,000 - £90,000/annum Perf. based Bonus, Pension, Train

Bristol, England

2 weeks ago - seen

NRM- Price and Promo- Data Scientist

Pepsi Co

Gandipet Mandal / Hyderabad, India

3 weeks ago - seen

Data Scientist

Hallo Healthcare Group

Competitive+Benefits

Warwick, Warwickshire

3 weeks ago - seen

Lead Data Scientist

Competition And Markets Authority

£

London, Greater London

3 weeks ago - seen

Lead Data Scientist

Efinancialcareers

Competitive salary

South East

3 weeks ago - seen

Associate Data Scientist

Nominet

Salary negotiable

Oxfordshire, England

3 weeks ago - seen

Data Scientist

Rise Technical Recruitment Limited

£48,000 - £53,000 per annum

Avon, England

3 weeks ago - seen

Senior Healthcare Data Scientist 12 Month FTC

Harnham - Data & Analytics Recruitment

£60,000 - £70,000 per annum

South East

4 weeks ago - seen