First-time user guide

How to use OpenAntigens reports

OpenAntigens builds structure-aware antigen-design reports for human cell-surface and secreted protein targets. Use this guide to search the portal, interpret report sections, prioritize constructs, and export sequences for cloning or expression planning.

Quick start

  1. Search for a gene, UniProt entry, protein name, alias, family, topology track, or disease on the Browse page.
  2. Scan the topology, disease, AlphaFold, PDB, construct-count, and cross-reactivity columns to identify reports with the evidence needed for review.
  3. Open a target report by clicking the UniProt entry name. Public release snapshots are static and read-only; pending or failed rows show release-state metadata in the browser.
  4. Start with the Interactive Construct Builder to inspect sequence, cysteine warnings, furin warnings, ortholog-equivalent regions, and structure/pLDDT/PAE panels when compatible local structure assets exist.
  5. Review the ordered construct sections, copy TSV or FASTA outputs, and use the raw JSON/Markdown links for reproducible downstream analysis.

Search effectively

The main search box scans gene symbol, UniProt entry, protein name, aliases, family name, topology track, and embedded disease names. The disease-focused search box filters rows by Open Targets disease associations.

Example searches: EGFR, ITGAV, DKK1, OPRM, cadherin, breast cancer.

Filter and sort

The Browse page defaults to all rows sorted by PubTator hits from highest to lowest. Search terms narrow the table; Reset clears both search boxes and restores PubTator-hit sorting.

Click table headers to sort by entry, gene, protein name, family, disease score, topology track, PubTator hits, design region, AlphaFold availability, PDB availability, construct count, or cross-reactivity count.

The rows-per-page selector controls table density. Use 10 or 25 for detailed review and 50 or 100 for broad triage.

Understand topology tracks

Secreted, GPI-anchored, and single-pass proteins usually emphasize soluble antigen constructs from the mature secreted protein or extracellular region. Multipass proteins include a membrane-expression track, and mixed cases with large extracellular regions also include soluble extracellular-region constructs.

For multipass targets, do not interpret the soluble construct list as a complete expression strategy. Review the membrane-expression suggestions and full-length context.

Static release snapshots

Public OpenAntigens releases are static HTML snapshots. They can be browsed locally or hosted as ordinary static files, and the visible pages represent the data available at the recorded release date.

Browser-based refresh controls are not part of the public release interface. Updated annotations are incorporated in the next scheduled release.

Report section map

SectionWhat it tells youHow to use it
Target / Design RegionResolved UniProt target, organism, sequence length, topology-derived design scope, links, PubTator count, and source identifiers.Confirm target resolution and compare the extracellular, secreted, or membrane scope with the planned design.
Interactive Construct BuilderLinked sequence, cysteine/furin warnings, copy-ready sequence exports, and structure/pLDDT/PAE panels when compatible local structure assets exist.Adjust boundaries and export human, mouse, and cynomolgus monkey-equivalent constructs.
Open Targets Disease AssociationsIndirect and direct disease associations and scores from Open Targets.Treat scores as disease-association context; therapeutic validation requires separate evidence.
HomologyHuman-to-mouse and human-to-cynomolgus monkey equivalent regions when reference mappings are available.Review conservation, boundary transfer, and cross-species reagent risk before animal screening or validation.
Cross-reactivityLocal BLAST hits ranked by bit score, with identity, coverage, E-value, coordinates, and alignment text.Inspect high-scoring non-self hits before prioritizing antibody-discovery antigens.
Cysteines / PTMs / Furin sitesPotential expression liabilities, curated UniProt PTM or processing annotations, and optional mutation guidance.Review warnings before ordering constructs. Optional Cys-to-Ser and furin-site edits in the builder update names and sequences in real time.
Family ContextCanonical family/paralog context and directional identity matrix when available.Identify close paralogs that may create specificity risks or useful comparison antigens.
Interactions / AssemblyKnown interaction or complex context, including possible obligatory partners when evidence supports it.Flag proteins that may require co-expression, partner chains, or extra caution when designing isolated soluble constructs.
Construct Summary / ConstructsPrecomputed construct classes, boundaries, homolog-equivalent sequences, images, and sequence exports.Review construct classes in display order, then refine boundaries in the builder when needed.

Recommended construct-review order

Reports list constructs in a fixed priority order, and reviewing them in that order works well: full design region, then PDB-backed, domain annotated, strict calculated, and lenient calculated constructs. For multipass proteins, review membrane-expression constructs separately from soluble extracellular-region constructs. The Pre-generated Constructs page explains why each class sits where it does and how to choose between them.

Good antigen-design signals

Favorable soluble constructs usually have strong pLDDT across the region, low internal PAE, boundaries near curated domain edges or PDB precedent, no transmembrane/cytoplasmic contamination, no unexpected unpaired cysteines, and manageable cross-reactivity to paralogs.

Warning signs

Be cautious with constructs dominated by low pLDDT, high internal PAE, long disordered linkers, unresolved topology, dense basic furin-like motifs, unpaired cysteines, very high similarity to unwanted paralogs, or missing ortholog mappings for planned animal work.

pLDDT in one sentence

pLDDT tells you whether AlphaFold is locally confident at each residue. High pLDDT supports local fold confidence, but it does not prove that two domains have a fixed relative orientation.

PAE in one sentence

PAE reports AlphaFold's estimated uncertainty in relative placement between residues or regions. Low PAE within a block supports a cohesive structural unit; high PAE between blocks often marks flexible domain motion.

Common workflows

GoalRecommended workflow
Find antibody-discovery antigen candidatesSearch the gene or disease, filter for structure availability, inspect full design-region and PDB-backed constructs, then review cross-reactivity and homolog-equivalent sequences.
Design a compact domain antigenOpen the report, start with domain annotated and strict calculated constructs, inspect pLDDT/PAE boundaries, and export TSV/FASTA from the builder.
Prioritize cross-species screening constructsCheck human, mouse, and cynomolgus monkey sequences in the Live Selected Region, then prioritize constructs with high identity and conserved boundaries.
Avoid paralog cross-reactivityUse Family Context and BLAST Cross-reactivity together. The family matrix gives paralog-level context; BLAST alignments show local high-similarity regions.
Handle a multipass proteinLook for a large extracellular region if present, then separately review membrane-expression constructs, full-length AlphaFold context, full-length BLAST, and GPCR/membrane engineering suggestions.
Review release-state gapsIf a report is pending, failed, or missing a structure, treat that as a property of the current release snapshot and check future releases for updates.

Example: EGFR

Search EGFR, open EGFR_HUMAN, compare full extracellular-region and domain constructs, inspect PDB-backed boundaries, and check ERBB family context before choosing an antigen for antibody discovery.

Example: ITGAV

Search ITGAV, review interaction and assembly context carefully, and treat isolated integrin-alpha constructs cautiously because integrins often require beta-chain context for native assembly.

Example: DKK1

Search DKK1 to see how secreted proteins are handled. Focus on mature extracellular sequence, cysteine-rich domains, disulfide context, and compact domain constructs.

Example: OPRM

Search OPRM for a multipass example. Prioritize full-length membrane-expression context and treat short loops differently from soluble extracellular-region antigens.

Troubleshooting

ProblemLikely reasonWhat to do
Report is pending or failedThe target could not be completed for the current release because of source-data limitations, structure mismatch, or processing failure.Inspect the error column and available raw JSON. The target may be resolved in a later release.
AlphaFold is missingNo canonical AlphaFold model was found or the available model did not match the canonical UniProt sequence.Use PDB/domain evidence if available, or refresh the target after source-reference updates.
Homolog sequence is missingThe ortholog reference table lacks a usable canonical sequence or mapping for that species.Use human-only design until the ortholog table is updated, or inspect source identifiers manually.
Family matrix is absentNo canonical family was identified, the family exceeded the configured size limit, or precomputed paralog data are unavailable.Review BLAST local similarity and InterPro domain annotations for specificity context.
Cross-reactivity list is longThe query region is highly conserved, contains common domains, or the full-length track captured broad family similarity.Sort by bit score, inspect alignments, and focus on high-identity hits overlapping the selected construct.
Construct looks biologically wrongAutomated heuristics can miss ligand sites, partner requirements, topology edge cases, or literature-specific constraints.Use the builder to adjust boundaries and check Methods for the exact evidence sources and limitations.

Where to go next

Use the Builder guide for detailed instructions on sequence/structure selection, pLDDT, PAE, Cys-to-Ser edits, and FASTA/TSV exports.

Methods

Use the Methods page for source databases, construct heuristics, BLAST setup, family/paralog logic, update cadence, and limitations.

Construct methodology

Use the Constructs page for the dedicated explanation of pre-generated construct classes, structural diagnostics, homolog transfer, PTM-aware review, and limitations.

Downloads

Use the Downloads page for the portal index, manifest, and flat files that support reproducible downstream analysis.

Terms

Use the Terms page for software license, generated annotation license, source-data terms, and contact information.