IBM’s generative AI solution has achieved a significant milestone by securing the top spot on the BIRD benchmark, which evaluates the performance of AI systems in handling complex database queries, according to IBM Research.
IBM’s Innovation in Data Management
With the exponential growth of data from various sources such as website clicks and sales reports, organizations face challenges in efficiently retrieving and utilizing this information. IBM’s latest innovation aims to simplify this process through the use of large language models (LLMs) to write SQL, the dominant language for database interaction.
IBM’s text-to-SQL generator, named ExSL+granite-20b-code, leverages an extractive schema-linking technique to identify database organization and retrieve relevant data tables and columns. The solution has outperformed other AI systems on the BIRD leaderboard, demonstrating its ability to parse natural language questions and translate them into SQL queries effectively.
Performance and Future Prospects
Despite being the top performer on the BIRD benchmark, IBM’s solution correctly answered 68% of questions, compared to 93% by human engineers. However, it showed promising results in code execution speed, scoring 80 out of 100, just below the 90 scored by human engineers. IBM researchers are optimistic about closing the gap between AI and human performance in SQL generation, given the rapid advancements in LLMs.
This achievement is part of IBM’s broader effort to enhance data management tools for enterprises. The company has already integrated LLM-powered components into products like IBM Knowledge Catalog and watsonx.data, which enrich structured data with descriptions and business terminology, making it easier to find and use.
Search-Locate-and-Compare with Generative AI
IBM’s approach to relational databases, which it pioneered in 1970, continues to evolve with the advent of generative AI. The company’s text-to-SQL generator is designed to handle complex queries by transforming natural language questions into precise SQL code. This involves a three-step process: schema linking, content linking, and SQL generation.
In the first step, schema linking, the system matches keywords in the question with data tables and columns. The second step, content linking, involves generating SQL code to compare relevant data columns. Finally, the system generates a series of SQL queries, selecting the most accurate one.
IBM’s solution is distinguished by its extractive schema linking method, which significantly improves speed, and its generative approach to content linking, enhancing accuracy. These innovations have propelled IBM to the forefront of the BIRD benchmark, although the real-world application remains more complex.
Conversational GUI for Enhanced User Interaction
In addition to the text-to-SQL generator, IBM is developing a conversational graphical user interface (CGUI) to facilitate better interaction with structured data. This CGUI combines the personal feel of an AI chat interface with the intuitiveness of a web-based GUI, allowing users to seamlessly interact with and review the AI system’s work.
The CGUI aligns questions and answers in the chat box with visual results in the GUI, providing a coherent user experience. Users can refine their queries, visualize results, and even export data into charts for presentations, enhancing productivity and insight generation.
IBM’s commitment to integrating these generative AI features into its watsonx products underscores its mission to drive AI into the entire data services pipeline, aiming to make data management more efficient and accessible.
What’s Next
IBM researchers are continuously working to improve the text-to-SQL generator and underlying language models. The integration of these innovations into IBM’s watsonx products will further enhance the capabilities of enterprises to manage and utilize their data effectively.
For more information, visit the IBM Research blog.
Image source: Shutterstock