Dec. 3, 2023
LLMs are at the forefront of a computing revolution, seamlessly integrating into various industries. Our fascination with these advancements grows as we learn more about the diverse applications of LLMs from innovators in the AI community. One such innovator we recently got to know is Othmane Zoheir. His journey, starting from high-frequency trading on Wall Street to founding rumorz.io, an AI-powered financial intelligence and analytics platform, is very interesting.
In this post, Othmane shares the motivation and purpose of building Rumorz, and how LLMs are helping achieve that.
My name is Othmane, and I am a quant trader and builder of AI, trading and fintech applications. Trading, technology and AI have been my main interests since high school.
My first job out of school after graduating from the University of Michigan undergrad in 2015 was in high frequency trading on Wall Street, where I traded all day and built profitable FX market making algos for a top bank. I was having fun trading, coding and actively participating in global markets.
After 2 good years, I was even more fascinated by automation and intelligence so I went to back grad school in 2018 to study machine learning and NLP.
I’ve been working independently since then, building tech and trading applications for hedge funds and clients, and recently founded rumorz.io, an AI powered financial intelligence and analytics platform.
Though I’ve been using NLP and transformer based models since 2018, I went deep into generative AI early 2023. We’re clearly at the beginning of a new software paradigm so I thought now is the time to go all in on building a product!
During my LLM journey, I got a bit fed up with frameworks like langchain and llama-index, so I’ve built an open-source LLM library called tinyllm which I use in every LLM project of mine and in Rumorz! The library is still growing and is powering my apps in prod.
I keep up to date with the latest AI news, papers, models and tools releases, but also focus on building around what is deployable in production today. LLMs need a domain expert to really add value so I try to focus more on the data side than the generative side.
Financial intelligence
My main use case for LLMs is building Rumorz. The main benefit of LLMs for the app is in organizing financial data in a knowledge graph. LLMs excel at NER and semantic understanding so a financial knowledge graph immediately came to mind when LLMs became cheap to use. A Knowledge graph will allow traceable retrieval and Q&A to chat with the real-time graph, but also running models and alpha signals on the knowledge graph.
Stock options assistant
I recently built a Q&A chatbot to answer questions related to Stocks Options data for an Options Trader. The idea was to build an app to allow any Options trader to get fast answers around prices, volumes and metrics from the options market.
Ecommerce and customer support
I got into LLMs through a conversation with a friend who owned a Shopify store and was discussing the benefits on an LLM based chatbot. The same evening, I got into langchain, llama-index, vector stores and such to build an MVP chatbot for the store, available on Github. The chatbot was a langchain RAG with access to the store’s products, inventory and FAQ and it blew her mind. I realized how powerful LLMs were while building this chatbot.
tinyllm
After building 1 to 2 chat applications, I decided do build a leaner Python LLM library than langchain, llama-index etc which come with large unnecessary overheads.
The library is open-source and I’m using it for all my LLM apps! Building it not only helped me simplify my LLM development, but also go deeper into LLM ops and what the infratructure requirements were to build scalable LLM applications.
Like many others, I’ve started exploring RAG applications with langchain and llama-index. Prototyping with those tools was great, but I was often facing issues with code complexity, debugging and lack of documentation.
After months of using different tools, I’ve developed my own open-source Python infrastructure called tinyllm, which includes chains/RAG development, observability/tracing and evaluation. The idea is to keep code simple, easily traceable and scalable. It can integrate with any other LLM libraries like langchain if desired. I use tinyllm to develop Rumorz and it’s been stable for a while now!
In terms of chat models across, my go-to is still OpenAI’s gpt 3.5, and gpt-4 for very complex tasks. With that said, I like Mistral 7b and other smaller open-source models.
For production use, I think lots is doable with open-source, provided you know how to create good prompts or finetune models and have the budget to deploy your own model instance.
For embeddings, I’ve moved away from OpenAI’s da-vinci as it creates an unnecessary bottleneck in a pipeline due to rate limits and costs. I’ve instead for open source embeddings models as they’ve proved to be faster, free, and sometimes even more accurate than OpenAI’s embeddings.
Project 1: Stock options agent
A Stock Options trader needed a chatGPT like Options agent to answer any questions related to Stock options: Pricing, Volumes, Order flow, Greeks, Pricing and Strategies
The problem with navigating options is that access and readability of Options data is complex and not easily implementable. For each stock ticker, there are often 100s of options contracts available for pricing, trading and analyzing (different strike prices, maturity etc…). Today, traders cannot easily price and get live data on any given stock options strategy and flexible/sophisticated options pricing tools are expensive.
Project 2: Rumorz.io - the real-time financial knowledge graph
This is actually a personal project I have been working on the side for years. It started as trading tools and algos I developed on my free time for my personal trading account. I’ve recently decided to productize all those tools and ideas and make it an app, rumorz.io
Over my 10 years working in institutional trading and investments, I’ve spent hundreds of hours per year reading news, compiling summaries and extracting narratives and insights from financial news. Either for trading decisions, client reports or investment thesis development. Specifically, the business problem with financial news is 2 fold:
The solution is a real-time financial knowledge graph coupled with an AI copilot to access, understand and analyze financial markets in real-time.
The knowledge graph: each node represents a financial asset, person, company, country etc, and every edge between nodes represents a semantic relationship between the 2 entities. When a user asks a question, the RAG pipeline will find the relevant subgraph as context to answer
The rumorz.io app: rumorz extracts sentiment, emotion and trending themes from the financial knowledge graph in real-time.
We offer AI, chatbot and ML application development across all industries: finance, marketing, Ecommerce, HR, real estate etc, for startups, SMB or larger corporates.
We start from the client’s problem and current solution to the problem, and from there, reverse engineer a solution tailored specifically to the client’s existing tech stack, operations, UI/UX and level of tech proficiency. This typically includes includes development, training and education for user adoption.
Below is my portfolio and social links!