I am proficient in a diverse range of tools and technologies, allowing me to move seamlessly from raw biological data to actionable insights. Here is a summary of my core competencies.

Programming and Databases

Python

For data manipulation, machine learning, and building complex bioinformatics pipelines.

PandasNumPyScikit-learnXGBoostBiopython

R

For statistical analysis, data visualization, and specialized bioinformatics packages.

ggplot2dplyrcaretBioconductorDESeq2

SQL

For writing queries to retrieve, filter, and manage data from relational databases.

PostgreSQLMySQL

Bash/Shell Scripting

For automating workflows, managing large files, and creating reproducible analysis pipelines in a Linux environment.

AWKsedgrepPiping

Bioinformatics and Data Analysis

Genomic and NGS Analysis

End-to-end analysis of genomic data, from raw reads and quality control to variant calling and functional interpretation.

Viral GenomicsRNA-SeqVariant CallingFastQCMAFFTFreeBayes

Phylogenetics and Viral Evolution

Reconstructing evolutionary relationships to track viral transmission and identify key lineages.

MEGA XMaximum LikelihoodPhylogenetic TreesMSA

Machine Learning

Applying algorithms to biological and clinical datasets for prediction, classification, and feature identification.

ClassificationPredictive ModelingCross-ValidationFeature Selection (RFE)

Genomic Tools

Familiar with command-line utilities for handling various genomic data formats and performing sequence analysis.

SAMtoolsSRA ToolkitBLASTClustal

Computational Epidemiology and Public Health

Predictive Health Analytics

Developing models to forecast health outcomes (e.g., disease risk, treatment success) to inform public health strategy.

Risk StratificationClinical Data AnalysisHealth Informatics

Genomic Surveillance

Using genomic data to monitor pathogen evolution and transmission dynamics in real-time, supporting outbreak response.

Variant TrackingOutbreak AnalysisMolecular Epidemiology

Developer Tools and Platforms

Version Control

Daily user of Git and GitHub for code management, collaboration, and versioning.

GitGitHub

Development Environments

Proficient with standard environments for creating reproducible research and reports.

Jupyter NotebooksRMarkdownVS Code

Scientific Databases

Skilled at retrieving and interpreting data from major public biological and viral data repositories.

NCBIGEOGISAIDUniProtKEGG

Web Scripting and Data Mining

Data Acquisition and Automation

Using scripts for data mining, programmatically collecting data from websites, and interacting with APIs.

Beautiful SoupRequestsSeleniumBioMartAPI Integration

Wet Lab Skills

Molecular Biology Techniques

Fundamental laboratory experience providing essential context for the data I analyze.

DNA/RNA ExtractionPCR and qPCRGel ElectrophoresisCell CultureSpectrophotometry