You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is this specific to vertex? Could this be the default for all models? I have noticed that the langchain integration does not generally use the token counts that come back with a response, and, the token counts in the response are different than what is displayed in the dashboard. Was digging into the python SDK code today to try and debug. It seems like the llamaIndex integration does this already.
More than happy to try and contribute towards this.
When using
generate
theGenerationChunk
object containsusage_metadata
. It'd be useful to capture the token counts from there.Example:
Docs: https://python.langchain.com/v0.1/docs/integrations/llms/google_vertex_ai_palm/
The text was updated successfully, but these errors were encountered: