This paper presents Matchit, a novel method for expediting issue investigation and generating actionable insights from textual data. Recognizing the challenges of extracting relevant information from large, unstructured datasets, Matchit leverages Large Language Models (LLMs) to automatically identify and categorize key information into distinct topics, each representing a different aspect of the issue under investigation. This process offers two key functionalities: fully automatic topic extraction based solely on input data, providing a concise overview of the problem and potential solutions, and user-guided extraction, where domain experts can specify the type of information or pre-defined categories to target specific insights. This flexibility allows for both broad exploration and focused analysis of the data.
Matchit's efficacy is demonstrated through its application in the automotive industry, where it successfully extracts repair diagnostics from diverse textual sources like repair records, surveys, and customer service logs. By identifying and categorizing information related to failure modes, symptoms, repair actions, and procedures, Matchit enables efficient identification of similar repairs in new datasets, significantly reducing manual review efforts.
Furthermore, Matchit's adaptability to user-defined categories and domain knowledge ensures the extraction of high-quality, relevant information. Case studies presented demonstrate the tool's effectiveness in achieving accurate matching results. Matchit's versatility extends beyond the automotive domain, offering a powerful solution for any application requiring customized information extraction, categorization, and matching from textual data.