Legacy Bioinformatics Toolkit

Text2Knowledge

A compact Text2Knowledge archive for biomedical acronym discovery, gene and protein lookup, taxonomy browsing, and lightweight text annotation built around Medline, GenBank, OMIM, and related reference data.

400k+ acronym entries
2.4M+ gene-name records
80k+ supported species

What It Is

From biomedical text mining to structured knowledge work

This site preserves the original Text2Knowledge bioinformatics tools: fast lookup workflows for medical acronyms, gene and protein names, taxonomy records, literature signals, and text annotation.

It also provides context for the newer Text2Knowledge ontology platform, which carries the same text-to-structure idea into a broader setting. Instead of stopping at retrieval, that newer work focuses on organizing complex language from clinical notes, research documents, policies, and conversations into navigable knowledge models.

Together the two sites show the arc of the project: this archive remains a practical legacy toolkit for biomedical exploration, while the newer platform extends the concept toward editable ontologies, provenance-aware relationships, and queryable knowledge.

Core Tools

Pick a focused workflow

Acronym Finder

Look up a medical acronym or reverse-search a long form, then rate the candidate mappings.

400,000+ acronyms release 1.0

GeneQuery

Retrieve gene and protein interaction statements extracted from literature and ranked by confidence.

interaction lookup beta

Synonym Finder

Search a gene symbol and retrieve synonyms, accessions, taxonomy IDs, and linked phenotype identifiers.

2.4M+ names 80k+ species

Gene Mapper

Traverse the same gene index with a broader query surface, including synonym expansion and accession mapping.

mapping workflow release 1.0

Gene Tagger

Paste text and label detected genes with taxonomy and accession context, with precision and recall controls.

text annotation release 1.0

GO Digger

An archival placeholder for GO term exploration and clustering work that never fully shipped.

archive coming soon

Data Sources

Medline, GenBank, OMIM, taxonomy

The tools combine automatically extracted data with limited curated input. Expect uneven coverage, but quick, useful recall for exploratory work.

Support

Need background or the newer platform?

Use the Text2Knowledge help and FAQ for workflow notes, visit the project contact page for archive details, or explore the Text2Knowledge ontology studio to see how the broader effort has evolved beyond legacy search tools.