Genomics

public data mining resources

With the advancing of sequencing technologies, more and more public data are available for you to mine. One does not have to produce his own data, rather, mining public data sets can help to generate hypothesis and even publish decent papers if done properly. In this blog post, I am going to list some of the public data resources one can take advantage of. Gene Expression Omnibus Gene Expression Omnibus (GEO) is a NCBI supported public functional genomics data repository.

Continue reading

The devil 0 and 1 coordinate systems in genomics

We need to be aware that there are two genomics coordinate systems: 1 based and 0 based. There is really no mystery between these two. You EITHER count start at 0 OR at 1. However, this can make confusions when analyzing genomic data and one may make mistakes if not keep it in mind. The TWO systems See the figure below to understand the two systems. credit due to Vince Buffalo from his book Bioinformatics data skills.

Continue reading