تفاصيل العمل

Collected data from major bioinformatics databases:

NCBI

Protein Data Bank

Unified heterogeneous datasets (genes, proteins, and structures) into a single structured format

Cleaned and processed the data to ensure consistency and usability

Prepared the dataset for downstream analysis such as molecular docking and disease-gene mapping

Project Goal:

To build a reliable and scalable dataset that connects genes, proteins, and potential drug compounds, enabling better insights in biomedical research.

Tools & Technologies:

Data Integration | Bioinformatics | Data Cleaning | Excel / Data Processing

ملفات مرفقة