Introducing Gene Retriever, the first data mining solution to retrieve all genes associated with a list of articles



The life sciences research community often asks: “What genes are referenced in the results of my PubMed search?” Gene Retriever, a collaboration between Acumenta Biotech and Sidra Medicine, answers this critical question by creating gene lists from the results of any PubMed search.

Comprised of software written by the Advanced Application team in Sidra’s Bio-Informatics division and the Literature Lab™ database and Gene Thesaurus™, Gene Retriever is an easy-to-use software application for Mac or Windows computers:

Gene Retriever allows users to retrieve all genes associated with a lit of articles.

How it works


Gene Retriever processes a list of PubMed IDs that you submit, and produces an analysis of the genes mentioned in the title, text and MeSH tags of each record.

Results are presented in a spreadsheet, and ranked to enable quick, comprehensive analysis. Links are provided to enable instant review of items of interest.


How accurate is Gene Retriever?

Very accurate. At the core of the Literature Lab™ database is the Acumenta Biotech Gene Thesaurus™, a repository of gene, protein and pathway nomenclature gathered from the major genomic databases and human-curated to produce searches with high precision and high recall. The Gene Thesaurus is unique because it is constantly updated on the Literature Lab platform, and human curation resolves ambiguous terminology, alias redundancy and generic terms.


analyzing your PubMed search with Gene Retriever


After running your PubMed search, click on the “Send to” link and select “File”. This will expand the window - select “PMID List” in the “Format” drop-down menu and click “Create File”. The file of PubMed IDs will be downloaded to a location that you specify on your computer.

Put this file in the Gene Retriever Input File directory, select it and click “RUN”. The analysis will begin immediately and the results will be placed in the Gene Retriever Output File directory. A list of thousands of PMIDs might take a little while to run. You can cursor over the Gene Retriever screen to see that it is at work.

The Literature Lab database starts on January 1, 1990. If your search produces abstracts going back earlier or you just want to see the genes in a specific time-frame, you can add date-range controls to your search using the following addition to your search:

Put your search in parentheses and add the syntax as shown below. You can tailor the dates to meet your interest.

( your search ) AND "1990/01/01"[EDAT] : "2019/03/31"[EDAT]

I’m interested, how do I get Gene Retriever?

It’s easy. Gene Retriever is provided with any License of the Literature Lab™ database.

Now you can easily get something you have always wanted but couldn’t get: an easy, thorough and accurate retrieval of the genes in the results of your PubMed search.

Click here for more information!