Mastering Python : Beyond the Basics with Pipeline Engineering

About Course

Python Module I: Basics to Advanced

  • Introduction to Python language  
  • Role of Programming in Bioinformatics
  • Installation of Python on various platforms
  • Installation of IDE-Print function, Comments, User input
  • Command line arguments and Data types
  • Variables and rules to create a variable
  • In-built functions of Python slicing and indexing in String
  • String and data formatting
  • Control statements (if-else,If-elif-else,for loop, etc)
  • Python data structure (List, Set, Tuple, etc.)
  • Methods of data structures
  • Function introduction & its requirement
  • Exception Handling, File Handling & Pandas Library

Python Module II: Packages of Biopython

  • Introduction to Biopython
  • Installation of Biopython
  • Conversion of a string into a biological sequence
  • Obtaining complement, reverse complement, transcribe, reverse transcribe, and translation from a sequence
  • Finding GC content from a sequence
  • SeqIO object: Reading various biological file formats such as Fasta & GenBank
  • Analysis of fasta sequences
  • Finding GC content of a fasta file containing multiple sequences & storing the data in a file
  • Converting a GenBank file into Fasta format

Python Module III (APPLICATION:- NGS pipeline engineering using Biopython)

  • Introduction to RNA Seq  
  • Necessary Tools installation
  • Learn how Data Retrieval is done
  • Quality Check of reads
  • Trimming and cleaning of data
  • Understanding mapping of reads on the reference genome and file formats (SAM, BAM)
  • Visualization techniques
  • Gene Expression Quantification & Analyzation
  • Gene ontology enrichment analysis
  • RNA seq pipeline development
  • Pipeline automation of RNA sequencing pipeline using Python
Show More
Bitbucket