Add new ChatMemory implementation to be used for stateful data extraction #1067

mariofusco · 2024-05-08T15:05:53Z

This pull request implements the ChatMemory that I have discussed and proposed here. This should be a good fit to be used for stateful data extraction, so for instance the included test case produces the following output:

User: hi
User: my name is Mario Fusco
User: I'm 50
Extracted Customer { firstName = "Mario", lastName = "Fusco", age = 50 }

In essence what this ChatMemory does is simply concatenating the values of variables sent by a user at each iteration, recreating at each step the user message from the original prompt template and those concatenated variables. In this way the user message sent to the LLM at the 3rd prompt of my example above will be something like:

"Extract information about a customer from this text 'hi. my name is Mario Fusco. I'm 50'. The response must contain only the JSON with customer's data and without any other sentence. You must answer strictly in the following JSON format: {\n"firstName": (type: string),\n"lastName": (type: string),\n"age": (type: integer)"

In order to implement this feature I had to add to the UserMessage both the prompt template and the set of variables from which it has been created. I believe that carrying those information can be useful also beyond the specific needs of this pull request. In reality probably it would be an ever better design if the UserMessage would know how to render itself and use the PromptTemplate internally instead of having a text populated from the outside as it does now. I'm open to also implement this further improvement, but for now I just wanted to demonstrate the general idea with the smallest possible set of changes.

/cc @sebastienblanc

…tion

langchain4j · 2024-05-10T09:06:48Z

Hi @mariofusco, thanks a lot! Will try to review it asap

mariofusco · 2024-06-04T06:54:47Z

Do you have any news about this? I'm seeing that this pull request could be relevant also in the light of broader feature requests related to chat memories. More in general we may want to introduce some sort of minimal SPI to facilitate the pluggability of custom chat memory implementations and maybe rewrite the existing memories in terms of this SPI. I could sketch this idea in a different pull request if you're interested, or maybe did you already have something similar in mind?

langchain4j · 2024-06-04T07:23:05Z

@mariofusco sorry, I did not have time to look at it yet, I will try to do it today

langchain4j · 2024-06-04T14:35:44Z

Hi @mariofusco! If I understand correctly, this use case assumes interactive conversation with the user to collect all the needed details, right? But in the test there is no response from the model (with guidance what information should be provided) because it outputs a Customer object, not a text. How exactly should this be used? Thanks!

mariofusco · 2024-06-04T15:59:09Z

Hi @mariofusco! If I understand correctly, this use case assumes interactive conversation with the user to collect all the needed details, right? But in the test there is no response from the model (with guidance what information should be provided) because it outputs a Customer object, not a text. How exactly should this be used? Thanks!

That's correct, this chat memory is designed to be used with extractors so it cannot provide any message for the user. This means that a second AI service needs to be used together with this in order to give some feedback to user. Here you can see an example of how I used a very similar strategy. In fact here for each state of the conversation I had to use both a [Customer/Flight]Extractor and [Customer/Flight]ChatService. I have no idea if there's a better way to achieve a similar result and in particular if it is possible to avoid the double AI service for this situation. Any advice is welcome.

Add new ChatMemory implementation to be used for stateful data extrac…

386fbf1

…tion

langchain4j added the P2 High priority label May 10, 2024

plblueraven mentioned this pull request May 27, 2024

[FEATURE] More ChatMemory implementations #1184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new ChatMemory implementation to be used for stateful data extraction #1067

Add new ChatMemory implementation to be used for stateful data extraction #1067

mariofusco commented May 8, 2024

langchain4j commented May 10, 2024

mariofusco commented Jun 4, 2024

langchain4j commented Jun 4, 2024

langchain4j commented Jun 4, 2024 •

edited

mariofusco commented Jun 4, 2024

Add new ChatMemory implementation to be used for stateful data extraction #1067

Are you sure you want to change the base?

Add new ChatMemory implementation to be used for stateful data extraction #1067

Conversation

mariofusco commented May 8, 2024

langchain4j commented May 10, 2024

mariofusco commented Jun 4, 2024

langchain4j commented Jun 4, 2024

langchain4j commented Jun 4, 2024 • edited

mariofusco commented Jun 4, 2024

langchain4j commented Jun 4, 2024 •

edited