Simple text searches at rcsb.org are now easier and more accurate. Text searching from the top query bar has been redesigned to be powered by the open source Apache Solr platform and are based on an indexing of PDBx/mmCIF data.
This new functionality is accessed by entering a search term or terms in the top bar of any RCSB PDB page and hitting ‘GO’. The text search supports searches for multiple words (for example, insulin receptor) as well as queries for adjacent words by enclosing the search term in double quotation marks (for example, “insulin receptor”). These two types of searches may return different results. The first search finds results where the search words appear anywhere in the entry, whereas the second search returns results where the search terms appear exactly as ordered in the query.
Search results are assigned “Match Scores” to help indicate the relevance of the result, and can be used to sort structures from “Higher to Lower” matches and vice versa. The figure below shows a search for the name Perutz.
When a search term appears in one of the following categories, the corresponding PDBx/mmCIF tokens are highlighted to help users gauge their level of interest in particular entries.
The figure below shows the results for an entry found with the search query "insulin receptor". Note the highlighting indicating the matching fields:
This figure shows the results for an entry found with the search query insulin receptor (without quotes). More results are returned than in the previous example. Note the highlighted terms insulin, receptor, and insulin receptor:
If a query match is found only in other tokens of a data file, results will be returned without highlighting and with the note “matching fields are not prominent.“ The figure below shows a search for the the term “model peptide”. In entry 3OTP, the term appears only in the _entity.details category in the entry’s data file.
RCSB PDB (citation) is managed by two members of the Research Collaboratory for Structural Bioinformatics: Rutgers and UCSD/SDSC
RCSB PDB is a member of the
The RCSB PDB is funded by a grant (DBI-1338415) from the
National Science Foundation, the
National Institutes of Health, and the
US Department of Energy.