Welcome, visitor! [ Login

 

Ding C.H., I. Dubchak. “Multi–class protein fold recognition using support vector machines and neural networks”. Bioinformatics 17,2001,pp. 349–358.

  • Listed: 25 May 2026 17 h 26 min

Description

Ding C.H., I. Dubchak. “Multi–class protein fold recognition using support vector machines and neural networks”. Bioinformatics 17,2001,pp. 349–358.

**Ding C.H., I. Dubchak. “Multi–class protein fold recognition using support vector machines and neural networks”. Bioinformatics 17,2001,pp. 349–358.**

When the field of **bioinformatics** entered the new millennium, researchers were racing to unlock the secrets hidden in protein structures. One landmark study that still resonates today is the 2001 paper by **Ding and Dubchak**, titled *“Multi‑class protein fold recognition using support vector machines and neural networks.”* Published in *Bioinformatics* (vol. 17, pp. 349‑358), this work pioneered the integration of **machine learning**—specifically **support vector machines (SVMs)** and **neural networks**—into the challenging problem of **protein fold recognition**. In this blog post, we’ll unpack the significance of their approach, explore how it shaped modern **computational biology**, and highlight why the paper remains a cornerstone for anyone interested in **protein structure prediction**.

### The Protein Fold Recognition Challenge

Proteins are the workhorses of life, and their functions are dictated by three‑dimensional shapes known as **folds**. Determining a protein’s fold from its amino‑acid sequence—without resorting to costly experimental methods like X‑ray crystallography—has long been a holy grail of **structural bioinformatics**. Traditional methods relied on sequence alignment and homology modeling, but these struggled when dealing with **remote homologs** or proteins lacking clear evolutionary relatives.

### Why Machine Learning Made a Difference

Ding and Dubchak recognized that **pattern recognition** techniques could capture subtle, non‑linear relationships between sequence features and structural classes. They introduced a **multi‑class classification framework** that combined two powerful algorithms:

1. **Support Vector Machines (SVMs)** – SVMs excel at finding optimal hyperplanes that separate data points in high‑dimensional space. By converting protein sequences into feature vectors (e.g., amino‑acid composition, predicted secondary structure, and evolutionary profiles), the SVM could discriminate among dozens of fold categories with impressive accuracy.

2. **Neural Networks** – Complementing the SVM, the authors employed feed‑forward neural networks to model complex interactions among features that might be missed by a linear classifier. The neural network’s ability to learn hierarchical representations added robustness, especially for folds with limited training examples.

The synergy of these models created a **hybrid system** that outperformed earlier single‑method approaches, achieving a notable boost in **recognition rates** across multiple benchmark datasets.

### Key Contributions and Results

– **Multi‑class capability**: Unlike binary classifiers that only distinguish “fold vs. non‑fold,” Ding & Dubchak’s system could simultaneously predict membership across **more than 30 fold classes**, a major step toward real‑world applicability.
– **Feature engineering**: The study highlighted the importance of integrating **evolutionary information** (e.g., position‑specific scoring matrices) with physicochemical descriptors, setting a template for later deep‑learning pipelines.
– **Performance metrics**: Their hybrid model reached **≈70 % accuracy** on the standard SCOP (Structural Classification of Proteins) benchmark—a remarkable achievement for 2001, when computational resources were far more limited.

### Legacy and Modern Relevance

Fast forward two decades, and the ideas introduced by Ding and Dubchak echo in today’s **deep learning** frameworks such as AlphaFold, RoseTTAFold, and various **convolutional neural network** (CNN) architectures. While modern models leverage massive datasets and GPU acceleration, the core principle remains: **combine diverse machine‑learning techniques to capture the multifaceted nature of protein sequences**.

Moreover, the paper’s emphasis on **feature selection** and **ensemble learning** continues to guide researchers designing **hybrid pipelines** for tasks like **protein–protein interaction prediction**, **functional annotation**, and **drug target identification**.

### Practical Takeaways for Bioinformatics Enthusiasts

1. **Start with robust features** – Even with cutting‑edge deep networks, the quality of input features (e.g., PSSMs, predicted disorder) heavily influences outcomes.
2. **Consider ensemble methods** – Pairing SVMs with neural networks, as demonstrated in 2001, can still yield gains, especially when data are scarce.
3. **Benchmark rigorously** – Use standardized datasets like SCOP or CATH to evaluate model performance and ensure reproducibility.

### Closing Thoughts

The 2001 Ding & Dubchak paper stands as a testament to the power of **interdisciplinary innovation**—marrying **computer science** with **molecular biology** to solve a problem that once seemed insurmountable. By pioneering a **multi‑class, hybrid machine‑learning** approach to protein fold recognition, they laid a foundation that continues to inspire **computational biologists**, **AI researchers**, and **pharmaceutical scientists** alike.

If you’re diving into **protein structure prediction** or exploring **machine‑learning applications in bioinformatics**, revisiting this classic study offers valuable insights and a historical perspective on how far the field has come—and where it’s headed next.

*Keywords: protein fold recognition, support vector machines, neural networks, bioinformatics, machine learning, protein structure prediction, computational biology, multi‑class classification, SCOP, feature engineering, ensemble learning.*

No Tags

3 total views, 3 today

  

Listing ID: N/A

Report problem

Processing your request, Please wait....

Sponsored Links

 

A. Subasi, “Automatic recognition of alertness level from EEG by using neur...

A. Subasi, “Automatic recognition of alertness level from EEG by using neural network and wavelet coefficients,” Expert System with Application, vol. 28, pp. 701-711, 2005. […]

No views yet

 

A. Subasi, “EEG signal classification using wavelet feature extraction and ...

A. Subasi, “EEG signal classification using wavelet feature extraction and a mixture of expert model,” Expert System with Application, in press. “A. Subasi, “EEG signal […]

No views yet

 

Jamal N. Al-karaki, Ahmed E. Kamal, “Routing techniques in wireless sensor ...

Jamal N. Al-karaki, Ahmed E. Kamal, “Routing techniques in wireless sensor networks: a survey”, Wireless Communications, IEEE [see also IEEE Personal Communications], vol: 11, issue: […]

No views yet

 

G. Pfurstcheller, C. Neuper, “Motor imagery and Direct Brain-Computer Commu...

G. Pfurstcheller, C. Neuper, “Motor imagery and Direct Brain-Computer Communication,” Proc. IEEE, vol. 89, pp. 1123-1134, 2001. “Motor Imagery and Direct Brain-Computer Communication” The concept […]

No views yet

 

G. Pfurstcheller, F. H. Lopes da Silva, “ Event-related EEG/MEG synchroniza...

G. Pfurstcheller, F. H. Lopes da Silva, “ Event-related EEG/MEG synchronization and desynchronizaiton: basic principles,” Clinical Neurophysiology, vol. 110, pp. 1842-1857, 1999. None

1 total views, 1 today

 

Fan Ye, Haiyun Luo, Jerry Cheng, et al. “A Two-tier Data Dissemination Mode...

Fan Ye, Haiyun Luo, Jerry Cheng, et al. “A Two-tier Data Dissemination Model for Large-scale Wireless Sensor Networks”. In Proc. of the 8th Annual International […]

No views yet

 

E. Houdayer, E. Labyt, and J. Cassim, at al, “Relationship between event-re...

E. Houdayer, E. Labyt, and J. Cassim, at al, “Relationship between event-related beta synchronization and afferent inputs: analysis of finger movement and peripheral nerve stimulations […]

1 total views, 1 today

 

A. Schl鰃l, C. Neuper, and G. Pfurtscheller, 揈stimating the mutual informati...

A. Schl鰃l, C. Neuper, and G. Pfurtscheller, 揈stimating the mutual information of an EEG-based Brain-Computer-Interface,?Biomedizinische. Technik., vol. 47, pp. 3-8, 2002. “A. Schl鰃l, C. Neuper, […]

1 total views, 1 today

 

Yan Yu, Ramesh Govindan, Deborah Estrin. “Geographical and Energy Aware Rou...

Yan Yu, Ramesh Govindan, Deborah Estrin. “Geographical and Energy Aware Routing: a recursive data dissemination protocol for wireless sensor networks”, In Seventh Annual ACM/IEEE International […]

1 total views, 1 today

 

B. Blankertz, K. R. Muller, and G. Curio, et al, “BCI Competition 2003—Prog...

B. Blankertz, K. R. Muller, and G. Curio, et al, “BCI Competition 2003—Progress and Perspectives in Detection and Discrimination of EEG Single Trials,” IEEE Trans. […]

1 total views, 1 today

 

A. Subasi, “Automatic recognition of alertness level from EEG by using neur...

A. Subasi, “Automatic recognition of alertness level from EEG by using neural network and wavelet coefficients,” Expert System with Application, vol. 28, pp. 701-711, 2005. […]

No views yet

 

A. Subasi, “EEG signal classification using wavelet feature extraction and ...

A. Subasi, “EEG signal classification using wavelet feature extraction and a mixture of expert model,” Expert System with Application, in press. “A. Subasi, “EEG signal […]

No views yet

 

Jamal N. Al-karaki, Ahmed E. Kamal, “Routing techniques in wireless sensor ...

Jamal N. Al-karaki, Ahmed E. Kamal, “Routing techniques in wireless sensor networks: a survey”, Wireless Communications, IEEE [see also IEEE Personal Communications], vol: 11, issue: […]

No views yet

 

G. Pfurstcheller, C. Neuper, “Motor imagery and Direct Brain-Computer Commu...

G. Pfurstcheller, C. Neuper, “Motor imagery and Direct Brain-Computer Communication,” Proc. IEEE, vol. 89, pp. 1123-1134, 2001. “Motor Imagery and Direct Brain-Computer Communication” The concept […]

No views yet

 

G. Pfurstcheller, F. H. Lopes da Silva, “ Event-related EEG/MEG synchroniza...

G. Pfurstcheller, F. H. Lopes da Silva, “ Event-related EEG/MEG synchronization and desynchronizaiton: basic principles,” Clinical Neurophysiology, vol. 110, pp. 1842-1857, 1999. None

1 total views, 1 today

 

Fan Ye, Haiyun Luo, Jerry Cheng, et al. “A Two-tier Data Dissemination Mode...

Fan Ye, Haiyun Luo, Jerry Cheng, et al. “A Two-tier Data Dissemination Model for Large-scale Wireless Sensor Networks”. In Proc. of the 8th Annual International […]

No views yet

 

E. Houdayer, E. Labyt, and J. Cassim, at al, “Relationship between event-re...

E. Houdayer, E. Labyt, and J. Cassim, at al, “Relationship between event-related beta synchronization and afferent inputs: analysis of finger movement and peripheral nerve stimulations […]

1 total views, 1 today

 

A. Schl鰃l, C. Neuper, and G. Pfurtscheller, 揈stimating the mutual informati...

A. Schl鰃l, C. Neuper, and G. Pfurtscheller, 揈stimating the mutual information of an EEG-based Brain-Computer-Interface,?Biomedizinische. Technik., vol. 47, pp. 3-8, 2002. “A. Schl鰃l, C. Neuper, […]

1 total views, 1 today

 

Yan Yu, Ramesh Govindan, Deborah Estrin. “Geographical and Energy Aware Rou...

Yan Yu, Ramesh Govindan, Deborah Estrin. “Geographical and Energy Aware Routing: a recursive data dissemination protocol for wireless sensor networks”, In Seventh Annual ACM/IEEE International […]

1 total views, 1 today

 

B. Blankertz, K. R. Muller, and G. Curio, et al, “BCI Competition 2003—Prog...

B. Blankertz, K. R. Muller, and G. Curio, et al, “BCI Competition 2003—Progress and Perspectives in Detection and Discrimination of EEG Single Trials,” IEEE Trans. […]

1 total views, 1 today