Our SGP2 (http://genome.crg.es/software/sgp2/index.html) predictions on the mouse genome combine geneid predictions with tblastx comparison of the human genome against the mouse genome. SGP2 was run on the masked fasta sequences for the mouse genome assembly mm10 (made up of 66 genomics sequences including 21 chromosomes), and generated 35,235 gene predictions SGP2 uses homology evidences (the SRs - similarity regions) from TBLASTX, in which a masked version of the mm10 genome assembly was compared against the hg38 human genome assembly. Predictions were obtained per chromosome and output in the following formats: mm10.sgp2 mm10.sgp2.gff mm10.sgp2.gff3 mm10.sgp2.gtf mm10.sgp2.cds (nucleotide sequence of predicted sequences) mm10.prot (amino acid sequence of predicted protein sequences)