Chapter 6 Exercises
These are intended to be done after completing the worked examples.
6.1 Exercise 1 — https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE119732
Using GSE119732, confirm whether the ID column contains Ensembl IDs with version suffixes.
- Extract the first 20 IDs.
- Count how many contain a
.. - Create a new column with versions stripped.
- Map the identifiers to HGNC symbols.
6.2 Exercise 2 — https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE122380
Using GSE122380, confirm whether the ID column contains Ensembl IDs with version suffixes.
- Extract the first 20 IDs.
- Create a new column with versions stripped.
- Map the identifiers to HGNC symbols.
- What is different about this file?