Downloading SnpEff database into history: Create database instead!

hi, tried adding hg19/GRCh37 to history via SnpEff downloads but didnt work. it said the file was empty. also tried uploading it via the “Upload data” section in which I set the file as “snpeffdb” format but still doesnt work. I am practicing the galaxy training website (the genetic disease one). any suggestions?

Welcome @boi_incog

Let’s start over in a new topic, since time, the tool version, the server you are working at, and the Snpeff source location can all make a difference.

What is going on: after a recent update at the external location where these files are hosted (and downloaded from), there are some newer issues. It sounds like this is your use case, and the solution below is what we would recommended for you, too.

Please give this a try! You will want to create the SnpEff index using the genbank file, and output both the fasta and gff3 from the tool, then use those exact reference data for all of the upstream steps, including using the output fasta as a custom-genome.

Any questions while working through this, please let us know! :slight_smile:

You will want to plan ahead to avoid sequence identifier conflicts, especially when working with human data: genome, annotation, other annotation like this one, and where you plan to visualize the data. IGV can accept any genome but UCSC will require UCSC identifiers. You can try to convert identifiers in some files types but not all, see Replace column.

Use case: working at UCSC is important to you, or you have other reference data with UCSC (not Ensembl) identifiers. Solution: Pull in the UCSC version of the reference genome and annotation (RefSeq Genes is usually best), build the SnpEFF index, and all should be good to go!

XREf → Reference genomes at public Galaxy servers: GRCh38/hg38 example

Not sure whether this was the original issue, but the infrastructure hosting SnpEff and its databases seems to have issues since at least 12 July.
The author of SnpEff is aware of the issue, which seems to persist since the beginning of the week and seems to be seeking help from Microsoft, who are hosting the data.

see Databases not available? July 12, 2025? · Issue #596 · pcingola/SnpEff · GitHub

Until this is resolved there is just no way to download SnpEff databases whether through Galaxy or the command line.

1 Like

Yes, this is all the same issue! Thanks for explaining in a more concise way :rocket:

Everyone will need to build their own for now. We can help here.