Query filters
Entity
A filter to match an entity by its “EntityID”. Utilize the methods provided in Knowledge Graph to identify entities/topics/sources of interest and use the obtained IDs to build queries.
Example:
If you don’t know the EntityID, you can use the autosuggest feature to find it, and use the returned entity to build the query:
The search
object is of type Search
, and
the individual items returned by the search are instances of Document
.
See the Document
and the page Search results to see available attributes and methods. Also, see
reference_entities
for further details
on each specific entity type.
Topic
A filter to match content containing macroeconomic, geopolitical, and business events. Just like in the cases before, you can use the TopicID if it’s known:
Or use the autosuggest feature to find the Topic object:
Watchlist
A filter to match content that contains the entities and topics defined in a specific watchlist.:
Source
Bigdata’s ecosystem comprises key high-quality content sources, including web content, premium news, press wires, call transcripts, and regulatory filings. Filter out your search results by the target source in your query.
Example:
Keyword
Type and search, matching a specific keyword. Note that there is stemming applied to the keyword which means that the search will also match similar words. For example, searching for “resignation” will also match results containing the word “resignations”.
Example:
Similarity
Search for sentences after transforming them into embeddings.
Example:
- The OR operator (
|
) is not supported for Similarity. If you want to search for multiple sentences, you must use AND (&
) to combine them. - Querying by a Watchlist and Similarity is not supported. We advise creating a query per Entity ID and Similarity filter.
SentimentRange
With Sentiment Ranges you can filter out document chunks by specifying a sentiment score range between -1.00 and +1.00. This score reflects the sentiment of each chunk based on the language used in every sentence. A score closer to -1.00 indicates negative sentiment, while a score closer to +1.00 indicates positive sentiment.
Document
By providing a document ID, you can retrieve all the chunks within that document, or all the chunks that meet the criteria of your query statements.
Example:
TranscriptTypes
At this point, you’re already familiar with the various components of a query and how to filter by specific types of content. Now, let’s delve into how to perform queries that allow you to discover transcripts with greater precision:
TranscriptTypes
: This filter enables querying by the document type of the transcript. ADocumentChunk
will be defined by a single document type at a time, with the possible values being:ANALYST_INVESTOR_SHAREHOLDER_MEETING
: Analyst, Investor and Shareholder meeting.CONFERENCE_CALL
: General Conference Call.Coming Soon
GENERAL_PRESENTATION
: General Presentation.EARNINGS_CALL
: Earnings Call.EARNINGS_RELEASE
: Earnings Release.Coming Soon
GUIDANCE_CALL
: Guidance Call.SALES_REVENUE_CALL
: Sales and Revenue Call.SALES_REVENUE_RELEASE
: Sales and Revenue Release.Coming Soon
SPECIAL_SITUATION_MA
: Special Situation, M&A and Other.SHAREHOLDERS_MEETING
: Shareholders Meeting.Coming Soon
MANAGEMENT_PLAN_ANNOUNCEMENT
: Management Plan Announcement.Coming Soon
INVESTOR_CONFERENCE_CALL
: Investor Conference Call.Coming Soon
Example:
SectionMetadata
: This filter allows querying for segments inside transcript documents. ADocumentChunk
will be defined by one or more sections, always within its hierarchical structure:QA
: question and answer section. This section can be decomposed on:QUESTION
: a question made during the session to a speaker.ANSWER
: an answer from a speaker of the event.
MANAGEMENT_DISCUSSION
: Management Discussion Section.
Example:
FilingTypes
You can also query a specific Filing type:
FilingTypes
: This filter enables querying by a filing type. ADocumentChunk
will be defined by a single document type at a time, with the possible values being:SEC_10_K
: Annual report filing regarding a company’s financial performance submitted to the Securities and Exchange Commission (SEC).SEC_10_Q
: Quarterly report filing regarding a company’s financial performance submitted to SEC.SEC_8_K
: Report filed whenever a significant corporate event takes place that triggers a disclosure submitted to SEC.SEC_20_F
: Annual report filing for non-U.S. and non-Canadian companies that have securities trading in the U.S.SEC_S_1
: Filing needed to register the securities of companies that wish to go public with the U.S.SEC_S_3
: Filing utilized when a company wishes to raise capital.SEC_6_K
: Report of foreign private issuer pursuant to rules 13a-16 and 15d-16.
Example:
Reporting details
When querying TranscriptTypes
or FilingTypes
, you can also filter by
reporting details like:
FiscalYear
: Integer representing the annual reporting period.FiscalQuarter
: Integer representing the fiscal quarter covered.ReportingEntity
: This field allows searching by the reporting company.
Example:
FileTag
You can also add a tag to your query to filter by private documents that include that tag.
Example:
Query operators
Bigdata also allows you to perform very complex queries in a very
expressive way. This is done by combining different query filters with
&
(AND) |
(OR) and ~
(NOT) operators. For example:
This should be sufficient for most use cases, but sometimes the query is built from an external list of entities, keywords, topics, etc. For example, provided a list of entity ids you could do:
This is a bit cumbersome, so we provide two helper function to make this
easier: All
and Any
. The first one is used to combine a list of
entities, keywords, topics, etc. with the AND operator, and the second
one is used to combine them with the OR operator. With the help from
Any
the previous example would be rewritten as:
Document Version
Search by Document Version.
Example:
See class DocumentVersion
for further details.