SGP predictions combine geneid predictions with tblastx comparison of the Human genome against the Mouse genome. SGP was run on the masked fasta sequences for the following human genome version: golden_path_20030410 (ncbi 33) It was also using homology evidences (the SRs - similarity regions) from TBLASTX, in which a masked version of the previous human genome assembly was comparedagainst the following mouse assembly version: (v4) mmFeb2003 (MGSCv4) Predictions were obtained per chromosome and output in the following formats: chr1.sgp (geneid) chr1.gff chr1.gtf chr1.cds (nucleotide sequence of predicted sequences) chr1.prot (amino acid sequence of predicted protein sequences) chr1_tbxsr.gff (gff output of geneid combined with tblastx)