Easily find the most relevant information from trusted sources and your own data. Use it to power agents that give accurate, real-time answers.
Before getting started, choose your preferred integration method π
We recommend using the API for this guide, but you can also follow along with the Python SDK.
It will only take you 5 minutes and will guide you through:β
Authenticate to bigdata.com
β
Query bigdata.com
β
Examine search results
Authenticate to bigdata.com
Enter the X-API-KEY in the header of the request to authenticate to the API.X-API-KEY: <your-api-key>Query bigdata.com
Bigdata.com processes millions of documents detecting entities
(companies, people, organizations, products, places, topics, etc) and
the events those entities play a role in. You can query all that data to
get direct insights about what you care about. You will be able to
create workflows that will optimize your day-to-day activities.Query example: As an example, I want to look for positive news about
the company that powered bigdata.com, RavenPack, in the month of June
2024.curl -X POST 'https://api.bigdata.com/v1/search' \
-H 'Content-Type: application/json' \
-H 'X-API-KEY: <your-api-key>' \
--data '{
"query": {
"filters": {
"timestamp": {
"start": "2024-05-31T22:00:00.000Z",
"end": "2024-06-30T21:59:59.999Z"
},
"entity": {
"any_of": [
"2BE1DC"
]
},
"sentiment": {
"values": [
"positive"
]
}
},
"max_chunks": 2
}
}'
Matching documents:Document headline: Asia Awards 2024: Best alternative data provider-RavenPack
Document headline: Fixed Income Portfolio Manager Jobs
Examine search results
Congratulations! π You have successfully analyzed millions of documents
and found two with the information that you are interested in.Letβs examine the output more closely. The results object contains the document headline, the publication date, the source of the document, the URL of the document, and the chunks that matched your query."results": [
{
"id": "5D16DA066F26D8D58D375B88F8B95245",
"headline": "Asia Awards 2024: Best alternative data provider-RavenPack",
"timestamp": "2024-06-20T07:20:53",
"source": {
"id": "E85BB4",
"name": "WatersTechnology",
"rank": "RANK_1"
},
"url": "https://www.waterstechnology.com/awards-rankings/7951868/asia-awards-2024-best-alternative-data-provider-ravenpack",
"chunks": [
{
"cnum": 3,
"text": "The solution ;\nThe RavenPack Factor Library removes this hurdle by offering streamlined access to actionable sentiment and market-moving indicators. Derived from unstructured data including news and transcripts, they deliver daily insights ranging from market perception, negative news, and trends for over 100,000 listed companies, to business cycle and macro insights like inflation and growth nowcasts. Even quant traders can benefit from signal components they can quickly test, thanks to this simple extract, transform and load (ETL) tool.",
"relevance": 0.09806532640442188,
"sentiment": 0.24
},
{
"cnum": 2,
"text": "RavenPack Factors augment the information available to discretionary and quantamental investors with quantitative insights without the need for a dedicated data infrastructure and team. This new offering helps investors streamline their workflows, shorten strategies' time to market, simplify risk management and capture opportunities to improve their performance ; Peter Hafez, chief data scientist, RavenPack",
"relevance": 0.09728399704675467,
"sentiment": 0.19
}
]
}
]
Summary
You have experienced using the Bigdata API to extract
insights from millions of unstructured data that bigdata.com has
processed.We recommend exploring the following key concepts to empower your
searches. It will only take you 5 minutes and will guide you through:β
Install bigdata-client package
β
Authenticate to bigdata.com
β
Query bigdata.com
β
Examine search results
Ready to get started? Letβs dive in!
Install bigdata-client package
First of all, letβs install bigdata-client package in a python
virtual environment.Open the terminal and create a virtual environment with the following
command:$ python3 -m venv bigdata_venv
Activate the virtual environment, every time you want to use it:$ source ./bigdata_venv/bin/activate
And install the bigdata-client within the environment bigdata_venv.(bigdata_venv) $ pip install bigdata-client
Authenticate to bigdata.com
Enter the python interpreter with the following commandNow you can import the Bigdata object from the bigdata_client package,>>> from bigdata_client import Bigdata
And initiate it with your bigdata.com personal credentials.>>> bigdata = Bigdata("YOUR_USERNAME", "YOUR_PASSWORD")
Query bigdata.com
Bigdata.com processes millions of documents detecting entities
(companies, people, organizations, products, places, topics, etc) and
the events those entities play a role in. You can query all that data to
get direct insights about what you care about. You will be able to
create workflows that will optimize your day-to-day activities.Query example: As an example, I want to look for positive news about
the company that powered bigdata.com, RavenPack, in the month of June
2024.from bigdata_client.query import Entity, SentimentRange
from bigdata_client.daterange import AbsoluteDateRange
# RavenPack ID: bigdata.com is powered by RavenPack. In another tutorial
# we will see how to look for entities IDs (Companies, People, etc).
RAVENPACK_ENTIY_ID="2BE1DC"
# Query positive news about RavenPack
query = Entity(RAVENPACK_ENTIY_ID) & SentimentRange([0, 1])
# Define a date range for the month of June 2024
in_june = AbsoluteDateRange("2024-06-01T08:00:00", "2024-07-01T00:00:00")
# Create a bigdata.com search
search = bigdata.search.new(query, date_range=in_june)
# Run the search and retrieve the two top documents
documents = search.run(2)
for doc in documents:
print(f"\nDocument headline: {doc.headline}")
Output:Document headline: Asia Awards 2024: Best alternative data provider-RavenPack
Document headline: Fixed Income Portfolio Manager Jobs
Examine search results
Congratulations! π You have successfully analyzed millions of documents
and found two with the information that you are interested in.Letβs examine the results more closely. The following command will
print the document headline, number of chunks that matched your query,
and chunk details.# Read all retrieved documents and print some details
for doc in documents:
print(f"\nDocument headline: {doc.headline}")
print(f"Number of chunks: {len(doc.chunks)}")
for chunk in doc.chunks:
# Print the sentiment detected in the text and the text itself
print(f" Chunk sentiment [-1, 1]: {chunk.sentiment}")
print(f" Chunk text: {chunk.text}")
Summary
You have experienced using the bigdata-client package to extract
insights from millions of unstructured data that bigdata.com has
processed.We recommend exploring the following key concepts to empower your
searches.
- Knowledge Graph: It helps
you find Entity IDs to query.
- Query filters: It describes all the
possible filters you can use in a query.
- Search Results: It describes the
Document and Chunk structure with many other parameters.
- Upload your own content: You can also upload
your private data. Bigdata.com will process them and consult them to
answer only your queries.