Why It Matters
Thematic investing requires systematic identification of companies aligned with structural trends, but manually tracking exposure across thousands of documents is inefficient and inconsistent. As mega-trends like AI and decarbonization reshape markets, investors need scalable ways to quantify which companies are genuinely positioned to benefit.What It Does
TheThematicScreener
class in the bigdata-research-tools
package helps solve this problem. Designed for analysts, PMs, and strategists managing thematic portfolios or scouting new ideas, it systematically connects companies to investment themes using unstructured data from news, earnings calls, and filings.
How It Works
TheThematicScreener
combines LLM-powered theme taxonomies, semantic content retrieval, and structured scoring methodologies to deliver:
- Automated theme breakdown into specific, measurable sub-categories
- Systematic positioning analysis to identify how companies align with key themes
- Cross-sector exposure comparison enabling portfolio-level thematic assessment
- Qualitative-to-quantitative transformation that turns narrative signals into structured, actionable insights
A Real-World Use Case
This cookbook walks through a full workflow, from defining a theme to quantifying company exposure, using “Supply Chain Reshaping” analysis across Top US 100 companies as a practical example. Ready to get started? Let’s dive in!Prerequisites
To run the Thematic Screener workflow, you can choose between three options:-
▶️ Colab cookbook
- Use this if you prefer running the workflow in a cloud environment.
- Follow the instructions written directly inside the cookbook.
- API keys must be configured as described within the Colab file itself.
-
💻 GitHub cookbook
- Use this if you prefer working locally or in a custom environment.
- Follow the setup and execution instructions in the
README.md
. - API keys are required:
- Option 1: Follow the key setup process described in the
README.md
- Option 2: Refer to this guide: How to initialise environment variables
- ❗ When using this method, you must manually add the OpenAI API key:
- ❗ When using this method, you must manually add the OpenAI API key:
- Option 1: Follow the key setup process described in the
-
🐳 Docker Installation
- Docker installation is available for containerized deployment.
- Provides an alternative setup method with containerized deployment, simplifying the environment configuration for those preferring Docker-based solutions.
Setup and Imports
Below is the Python code required for setting up our environment and importing necessary libraries.Defining your Screening Parameters
- Main Theme (
main_theme
): The central concept to explore - Company Universe (
companies
): The set of companies to screen - Time Period (
start_date
andend_date
): The date range over which to run the search - Document Type (
document_type
): Specify which documents to search over (transcripts, filings, news) - Sources (
sources
): Specify set of sources within a document type, for example which news outlets (available via Bigdata API) you wish to search over - Fiscal Year (
fiscal_year
): If the document type is transcripts or filings, fiscal year needs to be specified - Model Selection (
llm_model
): The LLM model used to mindmap the theme and label the search result chunks - Rerank Threshold (
rerank_threshold
): By setting this value, you’re enabling the cross-encoder which reranks the results and selects those whose relevance is above the percentile you specify (0.7 being the 70th percentile). More information on the re-ranker can be found here. - Focus (
focus
): Specify a focus within the main theme. This will then be used in building the LLM generated mindmapper
Mindmap a Theme Taxonomy with Bigdata Research Tools
You can leverage Bigdata Research Tools to generate a comprehensive theme taxonomy with an LLM, breaking down a megatrend into smaller, well-defined concepts for more targeted analysis.Retrieve Content
With the theme taxonomy and screening parameters, you can leverage the Bigdata API to run a search on company transcripts. We need to define 3 more parameters for searching:- Frequency (
freq
): The frequency of the date ranges to search over. Supported values:Y
: Yearly intervals.M
: Monthly intervals.W
: Weekly intervals.D
: Daily intervals. Defaults to3M
.
- Document Limit (
document_limit
): The maximum number of documents to return per query to Bigdata API. - Batch Size (
batch_size
): The number of entities to include in a single batched query.
Preview the DataFrame
Preview the DataFrame
# | Timestamp UTC | RP Document ID | Sentence ID | Headline | Entity Name | Text | Masked Text |
---|---|---|---|---|---|---|---|
0 | 2024-03-21 09:00:00+00:00 | 381E04DC8007FBF551D807540B70FAEA | 381E04DC8007FBF551D807540B70FAEA-0 | ABB: 2024 Annual General Meeting on Mar 21, 2024 - Transcript | ABB Ltd. | But the climate crisis also opens up a plethora of opportunities for APB. After all, we have the innovative technologies, products and solutions that make it possible to stem the tide of climate change. Robotics and automation also play a crucial role here as they enable enterprises to utilize precious resources more economically. This starts with raw materials and continues all the way to recycling. ABB’s digitalization and automation solutions also help our customers work and manufacture their products more effectively and efficiently to support the critical transition to the circular economy and ultimately improve their competitive edge. | But the climate crisis also opens up a plethora of opportunities for APB. After all, Target Company have the innovative technologies, products and solutions that make it possible to stem the tide of climate change. Robotics and automation also play a crucial role here as they enable enterprises to utilize precious resources more economically. This starts with raw materials and continues all the way to recycling. Target Company’s digitalization and automation solutions also help Target Company customers work and manufacture their products more effectively and efficiently to support the critical transition to the circular economy and ultimately improve their competitive edge. |
1 | 2024-03-21 09:00:00+00:00 | 381E04DC8007FBF551D807540B70FAEA | 381E04DC8007FBF551D807540B70FAEA-2 | ABB: 2024 Annual General Meeting on Mar 21, 2024 - Transcript | ABB Ltd. | The significance of AI extends beyond merely enhancing work processes or executing tasks with greater speed, precision and autonomy through machine vision. Generative AI is democratizing access to automation and robotics, not just for technologists or software engineers, but for everyone. As long as people think clearly and logically, they can instruct the system on their desired outcomes, thereby unlocking the full potential of AI for every manager and worker alike. Clearly, all of this necessitates the establishment of clear standards to ensure that both we and our customers can leverage the power of AI safely and responsibly. And I want to state unequivocally that our society must enact regulations governing the use of AI. | The significance of AI extends beyond merely enhancing work processes or executing tasks with greater speed, precision and autonomy through machine vision. Generative AI is democratizing access to automation and robotics, not just for technologists or software engineers, but for everyone. As long as people think clearly and logically, they can instruct the system on their desired outcomes, thereby unlocking the full potential of AI for every manager and worker alike. Clearly, all of this necessitates the establishment of clear standards to ensure that both Target Company and Target Company customers can leverage the power of AI safely and responsibly. And I want to state unequivocally that Target Company society must enact regulations governing the use of AI. |
2 | 2024-03-21 09:00:00+00:00 | 381E04DC8007FBF551D807540B70FAEA | 381E04DC8007FBF551D807540B70FAEA-3 | ABB: 2024 Annual General Meeting on Mar 21, 2024 - Transcript | ABB Ltd. | The situation in the Middle East is deeply concerning. Vital shipping routes have been disrupted and geopolitical tensions show no sign of easing. In short, the macroeconomic and political environment is volatile. It was in the last year and it is in this year and it is profoundly uncertain. In times like these, it is essential for businesses to ensure that they are well positioned strategically Rather than allowing uncertainty to undermine our ambitions, companies must become resilient to crises and well as more agile and sustainable. | The situation in the Middle East is deeply concerning. Vital shipping routes have been disrupted and geopolitical tensions show no sign of easing. In short, the macroeconomic and political environment is volatile. It was in the last year and it is in this year and it is profoundly uncertain. In times like these, it is essential for businesses to ensure that they are well positioned strategically Rather than allowing uncertainty to undermine Target Company ambitions, companies must become resilient to crises and well as more agile and sustainable. |
3 | 2024-03-21 09:00:00+00:00 | 381E04DC8007FBF551D807540B70FAEA | 381E04DC8007FBF551D807540B70FAEA-1 | ABB: 2024 Annual General Meeting on Mar 21, 2024 - Transcript | ABB Ltd. | The ABB way also plays a vital role here, for example, with our local for local strategy. ABB may be a global corporation, but we still need to reduce our dependencies on certain geographies. This is why in the USA, we now produce around 75% of products there. In China, it’s approximately 85 percent and in Europe, about 95%. The result is more flexibility and resilience. And it’s no different with what is likely to be the greatest challenge facing humanity today, climate change. Let me make one thing abundantly clear. The energy transition, the expansion of renewable energy sources and decarbonization of our economy are not negotiable. We must tackle climate change. We owe it to ourselves and to future generations. | The Target Company way also plays a vital role here, for example, with Target Company local for local strategy. Target Company may be a global corporation, but Target Company still need to reduce Target Company dependencies on certain geographies. This is why in the USA, Target Company now produce around 75% of products there. In China, it’s approximately 85 percent and in Europe, about 95%. The result is more flexibility and resilience. And it’s no different with what is likely to be the greatest challenge facing humanity today, climate change. Let me make one thing abundantly clear. The energy transition, the expansion of renewable energy sources and decarbonization of Target Company economy are not negotiable. Target Company must tackle climate change. Target Company owe it to ourselves and to future generations. |
4 | 2024-03-21 09:00:00+00:00 | 381E04DC8007FBF551D807540B70FAEA | 381E04DC8007FBF551D807540B70FAEA-1 | ABB: 2024 Annual General Meeting on Mar 21, 2024 - Transcript | ABB Ltd. | Distinguished shareholders, Bjorn did a fantastic job for ABB. Please give him a round of applause. I’m also proud that we’ve found a competent successor from our own ranks in Morten Virod as he’s actually here as well in one of the first rows. As you know, Bjorn will be handing over the management of the company to Morten in the course of the year. Morten is very familiar with ABB and has a deep understanding of our decentralized business model, which he will continue to strengthen. This will allow us to guarantee continuity and ensure that ABB can continue on its successful trajectory. All the best, and I look forward to working with you, Morton. I spoke earlier about the need for us to be resilient in the face of crises. This also means maintaining short supply chains and direct proximity to our customers. | Distinguished shareholders, Bjorn did a fantastic job for Target Company. Please give him a round of applause. I’m also proud that Target Company’ve found a competent successor from Target Company own ranks in Morten Virod as he’s actually here as well in one of the first rows. As you know, Bjorn will be handing over the management of Target Company to Morten in the course of the year. Morten is very familiar with Target Company and has a deep understanding of Target Company decentralized business model, which he will continue to strengthen. This will allow us to guarantee continuity and ensure that Target Company can continue on its successful trajectory. All the best, and I look forward to working with you, Morton. I spoke earlier about the need for us to be resilient in the face of crises. This also means maintaining short supply chains and direct proximity to Target Company customers. |
Label the Results
Use an LLM to analyze each text chunk and determine its relevance to the sub-themes. Any chunks which aren’t explicitly linked to supply chain reshaping will be filtered out.Assess Thematic Exposure
We’ll look at the top 10 most exposed companies to supply chain reshaping. The functionget_scored_df
will calculate the composite thematic score, summing up the scores across the sub-themes for each company (df_company
) or industry (df_industry
).





Extract Key Insights
The visualizations reveal key insights about how companies are positioning themselves within the supply chain reshaping theme:AI and Machine Learning Emerges as the Core Enabler
With the highest cumulative score across all companies, AI and Machine Learning is the most dominant theme, highlighting its foundational role in predictive analytics, automation, and optimization within modern supply chains.
Circular Economy and Automation as Structural Shifts
The strong presence of Circular Economy Practices and Automation & Robotics indicates a structural shift toward sustainable and efficient supply chain models—companies are not just digitizing but rethinking operational design.
Tech-Centric Players Lead the Pack
Siemens AG, Infineon Technologies AG, and Qualcomm Inc. are the frontrunners in thematic exposure, underscoring that companies at the intersection of industrial technology and digital infrastructure are best positioned to drive—and benefit from—supply chain transformation.
IoT Integration as a Bridge Between Physical and Digital
IoT’s high ranking shows its critical role in connecting assets, enabling real-time visibility, and facilitating advanced automation, especially for manufacturers and hardware-driven firms.
Industry Polarisation
Sector Engagement
- Semiconductors and Computer Services industries show the strongest average exposure, reflecting their integral role in enabling supply chain tech (e.g., sensors, connectivity, software).
- Traditional Sectors like Diversified Industrials show broader but shallower engagement, suggesting they are still in earlier phases of thematic adoption.
Strategic Focus
Concentration vs. Diversification in Exposure Most companies exhibit thematic concentration, focusing efforts on a few high-impact areas rather than spreading across all themes—likely reflecting strategic prioritization rather than lack of alignment.Export the Results
Export the data as Excel files for further analysis or to share with the team.Conclusion
The Thematic Screener provides a powerful way to identify companies that are most aligned with or exposed to specific investment themes. By leveraging BigData’s search capabilities and applying LLM-based classification, you can:- Discover thematic leaders - Find companies with the strongest strategic alignment to emerging trends
- Compare across industries - Identify which sectors are most proactive in addressing thematic challenges and opportunities
- Identify investment opportunities - Spot companies that may be undervalued relative to their thematic positioning
- Monitor thematic evolution - Track how themes gain or lose prominence across your investment universe over time