Objectives
- Understand the principles and advantages of the Linux system
- Know and use the main bash commands. Ability to chain multiple commands using pipes
- Launch programs with arguments
- Gain independence to perform command line analyses
Pedagogical Content
- Introduction to the Linux system.
- File system: directory structure, paths, home directory, file and directory management.
- Principle of protections: reading file attributes, access rights, management of user groups.
- Shell usage: command reminders, input/output redirection, history, completion, launching programs with arguments.
- Commands relevant to bioinformatics: grep, cut, sed, sort, more, etc.
- Connection (ssh) - how to start a session from Linux or Windows PowerShell
Environments and best practices for using the BiRD cluster
Objectives
- Understand and implement the principles of reproducible science in analysis and development projects
- Acquire basic commands necessary for optimal use of the cluster
Pedagogical Content
- Introduction to reproducibility
- Best practices on code history and sharing: Git
- Conda environment
- Presentation of the computing cluster
- Introduction to workflows using Snakemake
Objectives
- Understand the key steps in RNASeq data analysis for a differential expression study
- Know how to perform command-line analysis using Snakemake.
Pedagogical Content
Day 1
- Principle of RNASeq technology: objectives and experimental design.
- Data quality assessment (FastQC, MultiQC).
- Sequence alignment to a reference genome (STAR).
Day 2
- Differential gene expression analysis (HTSeqCount, DESeq2).
- Functional annotation (GO, Kegg).
- Using the Snakemake workflow system.
- Comparison between RNASeq and 3’SRP methods.
The theoretical part is followed by a pipeline run step-by-step on a test dataset.
It will be possible to start an analysis on your own data.
Objectives
- Understand the principles and advantages of the Linux system
- Know and use the main bash commands. Ability to chain multiple commands using pipes
- Launch programs with arguments
- Gain independence to perform command line analyses
Pedagogical Content
- Introduction to the Linux system.
- File system: directory structure, paths, home directory, file and directory management.
- Principle of protections: reading file attributes, access rights, management of user groups.
- Shell usage: command reminders, input/output redirection, history, completion, launching programs with arguments.
- Commands relevant to bioinformatics: grep, cut, sed, sort, more, etc.
- Connection (ssh) - how to start a session from Linux or Windows PowerShell
Objectives
- Understand the key steps in RNASeq data analysis for a differential expression study
- Know how to perform command-line analysis using Snakemake.
Pedagogical Content
Day 1
- Principle of RNASeq technology: objectives and experimental design.
- Data quality assessment (FastQC, MultiQC).
- Sequence alignment to a reference genome (STAR).
Day 2
- Differential gene expression analysis (HTSeqCount, DESeq2).
- Functional annotation (GO, Kegg).
- Using the Snakemake workflow system.
- Comparison between RNASeq and 3’SRP methods.
The theoretical part is followed by a pipeline run step-by-step on a test dataset.
It will be possible to start an analysis on your own data.
Environments and best practices for using the BiRD cluster
Objectives
- Understand and implement the principles of reproducible science in analysis and development projects
- Acquire basic commands necessary for optimal use of the cluster
Pedagogical Content
- Introduction to reproducibility
- Best practices on code history and sharing: Git
- Conda environment
- Presentation of the computing cluster
- Introduction to workflows using Snakemake
Objectives
- Understand the key steps in RNASeq data analysis for a differential expression study
- Know how to perform command-line analysis using Snakemake.
Pedagogical Content
Day 1
- Principle of RNASeq technology: objectives and experimental design.
- Data quality assessment (FastQC, MultiQC).
- Sequence alignment to a reference genome (STAR).
Day 2
- Differential gene expression analysis (HTSeqCount, DESeq2).
- Functional annotation (GO, Kegg).
- Using the Snakemake workflow system.
- Comparison between RNASeq and 3’SRP methods.
The theoretical part is followed by a pipeline run step-by-step on a test dataset.
It will be possible to start an analysis on your own data.
Objectives
- Understand the principles and advantages of the Linux system
- Know and use the main bash commands. Ability to chain multiple commands using pipes
- Launch programs with arguments
- Gain independence to perform command line analyses
Pedagogical Content
- Introduction to the Linux system.
- File system: directory structure, paths, home directory, file and directory management.
- Principle of protections: reading file attributes, access rights, management of user groups.
- Shell usage: command reminders, input/output redirection, history, completion, launching programs with arguments.
- Commands relevant to bioinformatics: grep, cut, sed, sort, more, etc.
- Connection (ssh) - how to start a session from Linux or Windows PowerShell
Best practices for using the BiRD cluster
Objectives
- Understand and implement the principles of reproducible science in analysis and development projects
- Acquire basic commands necessary for optimal use of the cluster
Pedagogical Content
- Introduction to reproducibility
- Best practices on code history and sharing: Git
- Conda environment
- Presentation of the computing cluster
- Introduction to workflows using Snakemake
Les plateformes de bioinformatique du réseau Biogenouest (ABiMS, BiRD, GenOuest et SeBiMER) vous proposent une formation “FAIR-bioinfo” à destination des bioinformaticien.ne.s, bioanalystes et biostatisticien.ne.s.
Lors de cette formation, nous vous présenterons les principes “FAIR” (Facile à trouver, Accessible, Interopérable, Réutilisable) et leur application dans les projets d’analyse et de développement.
Des présentations théoriques suivies d’utilisations pratiques de plusieurs outils permettant d’améliorer la reproductibilité des analyses seront proposées.