Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints
More about Open Access at the Crick

Authors list

Joe Greener Shaun Kandathil David T Jones

Abstract

The inapplicability of amino acid covariation methods to small protein families has limited their use for structural annotation of whole genomes. Recently, deep learning has shown promise in allowing accurate residue-residue contact prediction even for shallow sequence alignments. Here we introduce DMPfold, which uses deep learning to predict inter-atomic distance bounds, the main chain hydrogen bond network, and torsion angles, which it uses to build models in an iterative fashion. DMPfold produces more accurate models than two popular methods for a test set of CASP12 domains, and works just as well for transmembrane proteins. Applied to all Pfam domains without known structures, confident models for 25% of these so-called dark families were produced in under a week on a small 200 core cluster. DMPfold provides models for 16% of human proteome UniProt entries without structures, generates accurate models with fewer than 100 sequences in some cases, and is freely available.

Journal details

Journal Nature Communications

Volume 10

Issue number 1

Pages 3977

Publication date 4 September 2019

Full text links

Publisher website (DOI) 10.1038/s41467-019-11994-0

Figshare View on figshare

Download Download full text [pdf]

Europe PubMed Central 31484923

Pubmed 31484923

Research case studies

Postdoctoral clinical fellows

Hello Brain!

News and reports

About us

Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints
More about Open Access at the Crick

Authors list

Abstract

Journal details

Full text links

Keywords

Crick labs/facilities

Genetic clues reveal lung cancer's next move

Ancient genomes reveal immunity adaptation in early farmers

4,000-year-old plague DNA found – the oldest cases to date in Britain

Study of ancient dog DNA traces canine diversity to the Ice Age

The Francis Crick Institute is a unique partnership between

Research case studies

Postdoctoral clinical fellows

Hello Brain!

News and reports

About us

Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints More about Open Access at the Crick

Authors list

Abstract

Journal details

Full text links

Keywords

Crick labs/facilities

Related content

Genetic clues reveal lung cancer's next move

Ancient genomes reveal immunity adaptation in early farmers

4,000-year-old plague DNA found – the oldest cases to date in Britain

Study of ancient dog DNA traces canine diversity to the Ice Age

Share the page

The Francis Crick Institute is a unique partnership between

Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints
More about Open Access at the Crick