Collected data from major bioinformatics databases:
NCBI
Protein Data Bank
Unified heterogeneous datasets (genes, proteins, and structures) into a single structured format
Cleaned and processed the data to ensure consistency and usability
Prepared the dataset for downstream analysis such as molecular docking and disease-gene mapping
Project Goal:
To build a reliable and scalable dataset that connects genes, proteins, and potential drug compounds, enabling better insights in biomedical research.
Tools & Technologies:
Data Integration | Bioinformatics | Data Cleaning | Excel / Data Processing