fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream #32708

girlsending0 · 2025-08-27T05:14:48Z

Description:
This PR fixes an issue where stream_usage metadata was not being returned during invoke or stream calls for HuggingFace chat models.
I updated ChatHuggingFace (via ChatHuggingFaceWithUsage) to align with BaseChatOpenAI behavior, ensuring usage information is properly included in streaming outputs.

Issue: N/A (but addresses missing usage metadata in HuggingFace integration).

Dependencies: None

Twitter handle: None

vercel · 2025-08-27T05:14:53Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Preview	Comments	Updated (UTC)
langchain	Ignored	Preview		Aug 31, 2025 7:21am

codspeed-hq · 2025-08-27T05:16:19Z

CodSpeed WallTime Performance Report

Merging #32708 will not alter performance

_{Comparing girlsending0:fix/add_stream_usage (074af3b) with master (fcf7175)}

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

Summary

✅ 13 untouched benchmarks

codspeed-hq · 2025-08-27T05:22:41Z

CodSpeed Instrumentation Performance Report

Merging #32708 will not alter performance

_{Comparing girlsending0:fix/add_stream_usage (074af3b) with master (fcf7175)}

Summary

✅ 14 untouched benchmarks

ccurme

Would you mind sharing a reproducible snippet or adding a test to demonstrate the functionality?

It looks like token usage is already accessible via streaming when using HF Endpoints:

from langchain_huggingface import ChatHuggingFace, HuggingFaceEndpoint


llm = HuggingFaceEndpoint(
    repo_id="openai/gpt-oss-120b",
    task="conversational",
    provider="fireworks-ai",
)

model = ChatHuggingFace(llm=llm)

full = None
for chunk in model.stream("hello"):
    full = chunk if full is None else full + chunk

full.usage_metadata

ccurme · 2025-09-09T18:33:35Z

libs/partners/huggingface/langchain_huggingface/chat_models/huggingface.py

@@ -492,6 +492,9 @@ class GetPopulation(BaseModel):
    """Modify the likelihood of specified tokens appearing in the completion."""
    streaming: bool = False
    """Whether to stream the results or not."""
+    stream_usage: bool = False


Could we make this stream_usage: Optional[bool] = None?

(langchain-openai mistakenly did not do this)

ccurme · 2025-09-09T18:34:07Z

libs/partners/huggingface/langchain_huggingface/chat_models/huggingface.py

    def _stream(
        self,
        messages: list[BaseMessage],
        stop: Optional[list[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        *,
+        stream_usage: Optional[bool] = True,


Could we implement on _astream as well?

fix: add stream_usage

fe96acf

mdrxy changed the title ~~fix(huggingface): add stream_usage support for ChatHuggingFace invoke/stream~~ fix(huggingface): add stream_usage support for ChatHuggingFace invoke/stream Aug 27, 2025

mdrxy added the integration Related to a provider partner package integration label Aug 27, 2025

girlsending0 added 4 commits August 27, 2025 14:53

fix: fix the FBT001 warning

955a8c2

fix: fix the positional arg err

eeeb3d4

fix: fix the line

0e90305

apply formatter

7ffd183

girlsending0 closed this Aug 27, 2025

girlsending0 reopened this Aug 27, 2025

Merge branch 'master' into fix/add_stream_usage

074af3b

ccurme reviewed Sep 9, 2025

View reviewed changes

ccurme self-assigned this Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream #32708

fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream #32708

Uh oh!

girlsending0 commented Aug 27, 2025 •

edited

Loading

Uh oh!

vercel bot commented Aug 27, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Aug 27, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Aug 27, 2025 •

edited

Loading

Uh oh!

ccurme left a comment

Uh oh!

ccurme Sep 9, 2025

Uh oh!

ccurme Sep 9, 2025

Uh oh!

Uh oh!

fix(huggingface): add stream_usage support for ChatHuggingFace invoke/stream #32708

Are you sure you want to change the base?

fix(huggingface): add stream_usage support for ChatHuggingFace invoke/stream #32708

Uh oh!

Conversation

girlsending0 commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed WallTime Performance Report

Merging #32708 will not alter performance

Summary

Uh oh!

codspeed-hq bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #32708 will not alter performance

Summary

Uh oh!

ccurme left a comment

Choose a reason for hiding this comment

Uh oh!

ccurme Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

ccurme Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream #32708

fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream #32708

girlsending0 commented Aug 27, 2025 •

edited

Loading

vercel bot commented Aug 27, 2025 •

edited

Loading

codspeed-hq bot commented Aug 27, 2025 •

edited

Loading

codspeed-hq bot commented Aug 27, 2025 •

edited

Loading