Get to know literature datasets in our antibody search platform

Our Antibody Database ecosystem draws from multiple sources, among other scientific publications that make up the Literature Database. This is a database of manually-curated antibody sequences from scientific publications. It contains information that helps antibody engineering teams quickly identify sequences and scientific publications they were retrieved from.

What is the Literature Database?

Biological sequences are often not deposited into standardized repositories such as GenBank. Instead, researchers can find them directly in scientific publications and their supplementary material. Though Natural Language Processing has gone a long way, there is no reliable automated method for identifying such sequences directly in papers.

The new Literature Database includes such antibody sequences, added with the help of manual curation of scientific publications. Antibody sequences are linked to the metadata of publications they originate from and targets to facilitate text-based retrievals.

Benefits of the Literature Database

Our Antibody Database brings together all the major data sources to form a single, comprehensive source of data on antibodies. The Literature Database is a constituent of this ecosystem drawing from scientific literature.

All the new data is integrated and standardized, so searching through the database is precise (antibody-specific), fast, and seamless. Each sequence is supplemented with a rich set of metadata that allows teams to draw scientific conclusions.

How it works

Hou can also perform a text search using keywords. The target name should be the gene name. The platform uses Natural Language Processing to assign and normalize synonyms from various databases to the target.

Users can also search for antibodies by sequence. When researching a sequence, the Antibody Database runs the sequence identity search for it to discover similar sequences. It compares your sequence to sequences from all of our datasets - from GenBank and Therapeutics to Literature Database.

The search engine operates on several antibody identity metrics: the full variable region sequence identity and CDR-1, 2, and 3. As a result, you get a breakdown of the most similar sequences across these metrics and their source of origin.

You can pick a database to see the details and get a list of sequences similar to yours, with the possibility to filter by identity metrics.

By clicking on Details, you can see the metadata for each sequence that may look different depending on the database you picked.

