- Cerebral Valley
- Posts
- Brightwave - your autonomous investment research system đ
Brightwave - your autonomous investment research system đ
Plus: Founder Mike on leveraging a structured approach to agentic AI...
CV Deep Dive
Today, weâre talking with Mike Conover, co-founder and CEO of Brightwave.
Brightwave offers an autonomous investment research system, helping active asset managers in public and private markets generate and act on investment ideas with greater speed and comprehensiveness. Leveraging a structured approach to agentic AI, Brightwave processes and synthesizes vast amounts of dataâfrom SEC filings, earnings call transcripts and real-time news to high-signal, long-tail content from the public internetâallowing users to explore complex ideas quickly and with far less effort than a traditional investment research workflow. The platform differentiates itself by synthesizing a complete, interactive research report. Far from a question-answer style tool, Brightwave produces living documents that provide details on demand and allow users to dive deep into any topic.
Mike has over 15 years of experience in AI and machine learning, with a PhD in complex systems. His career spans work at LinkedIn, SkipFlag (acquired by Workday), and Databricks, where he created Dolly, the first LLM to demonstrate that truly open-source models can exhibit the kinds of instruction-following behavior weâve come to expect from ChatGPT.
In June 2024, Brightwave announced their $6M seed round led by Decibel Partners, with participation from Point72 Ventures, Moonfire Ventures, and angel investors, including leaders from OpenAI, Databricks, Uber, and LinkedIn.
In this conversation, Mike shares how Brightwave combines document retrieval with synthesis to generate deeper insights, discusses the toughest technical challenges in building the platform, and explains why ownership is central to Brightwaveâs team culture.
Letâs dive in âĄď¸
Read time: 8 mins
Our Chat with Mike đŹ
Thanks for joining us Mike â Letâs start with some background on yourself and what led you to found Brightwave.
Sure! Iâve been building AI and machine learning technologies for over 15 years, and Iâve worked on a wide range of machine learning teams, including tours of duty at LinkedIn, SkipFlag (a knowledge graph construction startup acquired by Workday), and most recently at Databricks. While at Databricks, I developed a language model called Dolly, which was the first to demonstrate that truly open-source technologies could exhibit the kinds of behaviors we see in LLMs like ChatGPT. Although it seems obvious now, back in April 2023 it was not clear this should be true, and our work played a major role in advancing the discussion around the democratization of AI technologies.
My co-founder has an equally deep backgroundâhe was the CTO of LedgerX, a federally-regulated derivatives exchange and clearinghouse, and filed his first deep learning patent back in 2018. We met while he was leading a large-scale semantic search effort at Workday, and we share an abiding interest in machine learning and finance.
We see the markets as the ultimate manifestation of how humans allocate and negotiate value, which obviously is attractive from a commercial standpoint, but also means weâre solving an incredibly interesting problem. If you think about the job of an active asset manager, itâs to understand something about the world nobody else has seen to identify a mispriced asset. In the context of the totality of information relevant to an investment decision, it seems clear that language models can expand the limits of human cognition, and, fundamentally, transform how humans understand complex systems like the global economy or the financial markets.
Whatâs the one-liner on Brightwave for the uninitiated?
Brightwave is building an AI-powered investment research assistant that helps financial analysts understand investment theses in greater depth and with greater speed. We have an agentic system that hardens and improves its understanding of a topic based on interactive input from an analyst, allowing them to go deep on ideas with a fraction of the effort it would have historically required.
Today we're announcing the $6M seed round for @brightwaveio led by @DecibelVC and with participation from @p72vc and @Moonfire_VC.
We are building an AI research assistant that generates insightful, trustworthy financial analysis on any subject and have customers with assets⌠x.com/i/web/status/1âŚ
â Mike Conover (@vagabondjack)
1:57 PM ⢠Jun 11, 2024
Who exactly are your users? Is Brightwave primarily built for independent investors or for institutional investors?
Weâre seeing broad-based demand across the financial services sector, from buy-side teams at long-short hedge funds to mutual funds, private credit groups, private equity, and venture capital. Especially for users focused on public equities, our value proposition is very straightforward, as we operate over SEC filings, earnings call transcripts, real-time breaking news, and content from the public web to drive insights across synthesized research findings. As we continually grow our product feature set, weâre also finding that teams in consulting, corporate strategy, corporate development, and investor relations can all benefit from Brightwave.
Let's dive a bit deeper into how the product itself actually works. What data are you processing, and how do you handle it?
We treat this as a deep technology problem, not just a wraparound for commercial LLM providers. Weâre able to leverage the Brightwave teamâs extensive experience in AI and machine learning to train our own models, alongside serving and orchestration layers that specifically address the needs of a pretty sophisticated user base. Weâre quite practical about the limitations of language models, and our approach to agentic behavior is focused on building âlane followingâ before full self-driving. We donât believe the current foundation models can fully self-supervise autonomous research, so we take a more constrained approach that allows the user to direct the attention of the system interactively.
Think of it like a Directed Acyclic Graph (DAG) or a sequence of behaviors where non-deterministic judgments are made at each step. We circumscribe the workflowâs activities, allowing the model to make decisions itâs equipped for but without leaving everything to the model. With Brightwave, the user directs the systemâs attention. For example, in our reports, users can highlight any passage in a Brightwave research report and invoke a library of LLM-powered analyses, directing the system to drill into areas of interest.
In terms of Retrieval-Augmented Generation (RAG), we treat search as a first-class citizen. Our approach focuses on search infrastructure that leverages semantically-aware representations of documents. For instance, we recognize that the executive preamble in an earnings call differs from the dialogue between an analyst and an executive, so we go beyond segmenting paragraphs to truly understand the meaning of documents. This approach leads to a higher quality of analysis and thought, which our customers have consistently told us is far superior to anything else on the market.
Users can also ingest their own content into the platform, and we will reason over that content with the same level of quality and meticulousness. Additionally, we pull material from the public Internet, as well as proprietary sources we've purchased access to. So, itâs a combination of their own data, public data, and our proprietary datasets.
Could you give us a better understanding of the competitive landscape youâre working in? And then, what sets Brightwave apart from an end user perspective?
There are a handful of companies in this space, and our users tell us weâre way out front on the basis of depth of thought and product experience. When I think of the value proposition for buyers, I think of it like Maslowâs hierarchy of needs. At the base, you have simple fact-finding and question-answering, essentially needle-in-a-haystack retrieval. For example, you can put a document into a large context window and ask a direct question, and the model will retrieve the answerâprovided it doesn't require complex thought or calculation.
The next layer up would be summarization, where given a document, Brightwave helps the user understand the gist of what it contains. We do all of that, but we see these as features vectoring towards commoditization, not standalone products. The real value comes in synthesisâconnecting fact patterns across multiple heterogeneous documents. Itâs about taking all the components like question-answering, summarization, and information retrieval and then tying them together to help users understand the complete story across many documents. Our approach introduces a lot of very interesting R&D challengesâthe UX problem weâre solving is how to reveal the thought process of a system that has considered thousands of pages of material and present that to a human in a meaningful wayâitâs highly non-trivial.
Recognizing you may not have time to pore over this 31-page whopper, we ran the bearish Goldman Sachs reports on the future of AI through @brightwaveio's analysis engine.
Needless to say, we're unconvinced there's not much to this AI thing.
â Mike Conover (@vagabondjack)
8:21 PM ⢠Jul 11, 2024
So, if synthesis makes connections that might otherwise go unnoticed, the next level of the pyramid is idea generationâidentifying the most salient fact patterns and suggesting what kinds of opportunities or nuances they might entail. Finally, at the top of the pyramid is fully autonomous research, where the model not only retrieves and synthesizes information but observes its own work process and makes decisions about which threads to pull. Brightwave operates at the higher levels of this stack because basic fact-finding and retrieval are just table stakesâtheyâre not that complicated. The synthesis and self-directed organization of the system's own work product are what set Brightwave apart.
What are your customersâ use cases that you find particularly illustrative of Brightwaveâs effectiveness?
These are the top three areas where Brightwave shines: first, it helps users make investment decisions faster and with more conviction by making sure they arenât overlooking anything crucial. Second, it enables teams to reason about massive volumes of content that would otherwise be impossible for a human to meticulously and effectively process. Third, it helps users solve the âblank pageâ problem, enabling users to generate ideas faster and evaluate factors they might not have otherwise considered.
A concrete example from our own work: I started with a broad question about how NestlĂŠ is positioned to capitalize on the secular trend towards more plant-based diets. Brightwave retrieved and analyzed thousands of pages of relevant material and highlighted that Southeast Asia has strong cultural inclinations toward plant-based meat alternatives. In terms of idea generation, Brightwave identified that Singapore is the only municipality with regulations permitting the retail sale of lab-grown meat. Pulling on that thread further, Brightwave found that JUST (the company behind plant-based mayo and egg substitutes) had recently secured something like a quarter million liters of bioreactor capacity in Singapore to produce lab-grown meat and that NestlĂŠ may face production headwinds in this very niche sub-sector of a sub-sector.
From a high-level, macro-thematic starting point, the system, with very limited supervision on my part, drilled down to super-granular insights on bioreactor capacity in Singapore. I like how this illustrates how Brightwave supports both breadth (processing large volumes of content) and depth (exploring specific avenues based on the userâs inquiries), and is representative of how our users put the system to work.
"You talk to the sharpest analysts, and they're extremely sophisticated people," said Mike Conover, CEO and co-founder of @brightwaveio. "I think it would be foolish to say that you're going to replace the sharpest, hardest working, most channel checking investment research teams⌠x.com/i/web/status/1âŚ
â Mike Conover (@vagabondjack)
9:46 PM ⢠Jul 18, 2024
You mentioned that this was a deeply technical problem. What was the hardest technical challenge in building Brightwave?
The hardest challenge has been dealing with the cumulative effects of methodological complexity in a system like ours. Imagine a physical system with a single joint that has just a little bit of "play" in it. On its own, you might not notice much, but when you have eight joints, all those tiny imperfections add up to create significant travel in the system. Brightwave involves multiple machine learning and generative subsystemsâthink of a Directed Acyclic Graph (DAG) with many different LLM calls, retrieval events, and self-critiques happening simultaneously. The challenge is optimizing all these elements jointly: how models critique their own work, how they process documents, what retrieval models are doing, and how fine-tunes affect the system. All these components must be measured and co-optimized. We canât perform gradient descent on the entire system, and so weâve had to develop new methods for measuring and improving system performance holistically. This is where the teamsâ experience on consumer-web machine learning teams comes into play. In those environments, machine learning subsystems often consume the outputs of other models. While each model might be differentiable on its own, the entire system isn't, making optimization at scale incredibly difficult.
An important infoviz note for those of you building radar plots for LLM evaluation benchmarking - organize your spokes semantically so that area inside the hull is a meaningful quantity, otherwise it's just a bad histogram.
â Mike Conover (@vagabondjack)
2:19 AM ⢠Dec 11, 2023
The core of the challenge is measurement and evaluation. It's about designing a wraparound "compass" that consistently points to true north, even though each system has its own nuances. It involves synthesizing evaluation data, human annotators, using LLM supervision to determine if a set of results is satisfactory, and, of course, classical statistical methods for measuring model performance for individual components of the system. Essentially, it's solving an integration testing problem, where every change must be evaluated not only at the atomic level but also at the composite level. You push a battery of examples through the system to ensure that, in the aggregate, it still behaves as expected. This is why so many other companies are breaking on these rocksâitâs a deeply complex, multi-layered engineering and research challenge that requires careful, continuous measurement at every level.
Are there any particular trends in AI that excite you, and how do those trends influence the way Brightwave will evolve over the next year?
Yes, there are several trends in AI that excite me. One area I'm particularly interested in is reinforcement learning-based planning algorithms that treat LLMs as just one step in a sequence of actions. Right now, agentic workflows tend to rely on something like greedy search, and I think the next wave of capabilities will come from models that jointly co-optimize planning and generative inference alongside the orchestration systems and serving unit economics that allow for speculative execution of many sub-plans.
It's also unclear to me whether we're approaching saturation on model quality yet. I do believe we'll see several more rounds of improvement, but it's fascinating that so many independent labs have reached similar benchmark scores without anyone having an extraordinary breakthrough, despite the massive economic incentives for doing so. That remains the billion-dollar question.
At Brightwave, we're skating to where the puck is going. We're building a system that's at the razor's edge of what the current state of the art is currently capable of doing, and Brightwave will improve as foundation models improve. However, we're not betting the farm on a massive breakthrough in model quality. Instead, we're focused on solving real problems today, with the expectation that tools will continue to improve incrementally.
Tell us a little bit about the team at Brightwave. What kind of culture are you building, and what kinds of roles are you hiring for?
We are definitely hiring, and our talent philosophy can be summed up as "operators only." We believe small teams of exceptional, opinionated people who know what excellent work looks like will outpace much larger groups. Ownership, humility, and craft are what we prize, with ownership mindset defining how we show up every day. We hire people who intrinsically understand, because of their upbringing, life experiences, whatever, that they have an inalienable right to choose how they respond to their environment. Itâs about having an internal locus of control, and recognizing that your ability to take an active role in navigating uncharted waters is the key determinant of the outcomes you experience in life. Thatâs the hardest thing to hire for, but weâre lucky to work with a group of people who embody these principles to a tee. They have extraordinary professional experiences, whether itâs publishing in NeurIPS or tours of duty at places like Meta, Goldman Sachs and UBS, but the thing thatâs special about Brightwave is the culture. We've hired exceptional peopleâinteresting, empathetic, intense. People want to work here. We've got folks turning down offers from the top companies in AI to join us. We've built something special.
As for roles, we fit the job to the person, not the other way around. If youâre an operator, weâll find a place for you to do the best work of your life. Weâre hiring search engineers, AI/ML engineers, systems engineers and designers to build an intelligent system like nothing else thatâs ever existed before.
Conclusion
To stay up to date on the latest with Brightwave, learn more about them here.
Read our past few Deep Dives below:
If you would like us to âDeep Diveâ a founder, team or product launch, please reply to this email ([email protected]) or DM us on Twitter or LinkedIn.
Join Slack | All Events | Jobs
Event Spotlight
Also, if youâre in San Francisco this weekend, join us for SHACK15 Conversations / 2040: A Silicon Valley Satire Author Pedro Domingos!
Join us as we welcome Pedro Domingos â a renowned AI researcher and tech industry insider â in conversation with SHACK15âs founder Jorn Lyseggen about the development of AI. The jumping off point will be Domingosâs fiction debut â 2040: A Silicon Valley Satire â a book that imagines a presidential election unlike any in U.S .history. The year is 2040 and an AI named PresiBot is facing off against a fake Native American chief seeking to abolish the United States. PresiBot is hallucination prone and buggy, KumbAI's brash CEO Ethan Burnswagger has lost the override button, and the fate of Americanâ democracy hangs in the balance, but there are bigger fish to fry because the startup is almost out of funding.
Piercing the zeitgeist around AI and set in a dystopian San Francisco in a near future we can all too easily anticipate, it features characters, entities, and incidents whose resemblance to actual ones may or may not be purely coincidental. The book raises important questions about what AI really is, how the tech industry works, where our deepening polarization might lead us, andâmost importantâhow to break out of this cycle. Copies of 2040 are available at Book Passage in the Ferry Building.