- Konsulenttjenester
Smart Search with ChatGPT: 8-Wk Proof of Concept
The solution aims to harness the power of AI to facilitate intuitive, human-like interactions with a repository of information, ultimately enhancing accessibility and usability.
DESCRIPTION: The system will encompass a knowledge base (KB) containing approximately one hundred documents pertinent to a specific business domain. The Knowledge Base in this solution will primarily consist of PDF documents manually uploaded to Azure Storage with no initial integrations with source systems. It is important to note that this architecture focuses on machine-readable documents, eliminating the need for Optical Character Recognition (OCR) services. Additionally, all documents will be in the English language, excluding the optional translation component shown in the architecture. Azure Form Recognizer will be utilized to extract textual components, such as paragraphs, from the given documents, resulting in structured and readily usable information. This textual information will then be transposed into vector format using embeddings, with the assistance of the dedicated OpenAI service. The vectorized information will be stored in a non-relational database, such as Redis or CosmosDB, creating an effective repository of paragraphs ready for retrieval. A similar process will be applied to user-provided questions in natural language. By employing embeddings, these questions will be transformed into vector format to facilitate search purposes. The vectorized question will be compared with the content of the paragraphs extracted from the documents to identify the most relevant paragraphs and documents to construct a response. Using a "Top-K" logic, the resulting paragraphs will be filtered, and the ChatGPT service will be engaged to create a summary of their content, which directly answers the original user's question. Azure OpenAI services, including ChatGPT, will be harnessed for content retrieval and the generation of responses in natural language. Language support is crucial, with an initial focus on English for both source documentation and user-submitted questions. During the implementation phase, the potential for supporting other languages, at least for user queries, will be assessed. Leveraging Azure features, the system will be capable of responding to specific queries, including requests for summaries of research conducted by a specific institute on a particular topic and precise inquiries related to specific subjects. A user-friendly interface is essential, allowing users to input questions and receive model-generated responses in textual form. This interface will be made accessible through a prototypical web application. Initially, the system will be deployed on the Azure Avvale subscription, with plans for migration to the Azure client subscription when available.
OUTCOMES:
DELIVERABLES:
PLANNING:
AUDIENCE: ICT Dept., Data Scientists, Business users
LANGUAGES: Italian or English