Bioinformatics
Bioinformatics
Next batch is coming soon!
- 549 students
- Last updated 25/5/2025
Descriptions
The SmartED Bioinformatics Industrial Training Program offers a hands-on, project-driven approach to mastering modern biological data analysis. Students are trained in essential tools and techniques, from sequence alignment and DNA/RNA data retrieval to NGS analysis and differential gene expression using popular tools like FastQC, Trimmomatic, Hisat2, SAMtools, StringTie, and Ballgown. The course includes Linux (Ubuntu) and R environment setup, making it ideal for students in life sciences, biotechnology, and computational biology. By the end of the course, learners complete a capstone project on gene expression profiling using publicly available datasets from GEO or ENA.
Key Points
- Understand the Foundations of Bioinformatics
- Use Bioinformatics Databases & File Formats
- Perform Sequence Alignment & Analysis
- Analyze NGS Data from Start to Finish
- Run a Differential Gene Expression Pipeline
- Work with Linux, R, and Real Genomic Datasets
Course Lessons
- Topics: Role in biology, DNA/RNA/protein data types, file formats
- Capstone: Compare different biological data types with examples
- Homework: Identify and describe 3 bioinformatics databases
- Topics: DNA libraries, types of sequencing, alignment tools
- Capstone: Align DNA sequences using online tools (e.g., BLAST)
- Homework: Run a global and local alignment simulation
- Topics: 1st to 4th gen sequencing, QC, preprocessing
- Capstone: Prepare NGS pipeline with required tools
- Homework: Explain differences between 2nd and 3rd gen sequencing
- Topics: GEO/ENA access, Ubuntu & R setup, commands
- Capstone: Install and configure tools in Ubuntu for analysis
- Homework: Download and document a sample dataset from GEO
- Topics: FastQC, Trimmomatic, Hisat2, SAMtools
- Capstone: Build a functional pipeline and run preprocessing
- Homework: Document each step of the preprocessing pipeline
- Topics: StringTie, Ballgown, result interpretation
- Capstone: Analyze differentially expressed genes and visualize results in R
- Homework: Write a mini-report summarizing key findings and graphs
Projects
- Objective:
Conduct full gene expression profiling using an NGS dataset from GEO or ENA, running the full preprocessing and analysis pipeline.
- Requirements:
Retrieve real RNA-Seq data, Run FastQC, Trimmomatic, Hisat2, SAMtools, Assemble transcripts using StringTie, Analyze results using Ballgown in R, Present key differentially expressed genes with visualization, Submit final project report including commands, outputs, and biological interpretation.
- Format:
Individual project with report submission and optional presentation

Instructor

Mentor
This course includes:
- 25+ hours on-demand video
- Full lifetime access
- Access on mobile and TV
- Free Webinar
- Certificate of completion
After the final task and according to the results


After the final task and according to the results
Government Certified
Earn NSDC Certification
Benefits of NSDC Certification:
- Government-Recognized Credential
- Industry-Accepted Validation
- Enhanced Employability
- Added Value for Higher Education & International Opportunities
- Alignment with Skill India Mission
- National Skill Registry Entry