Fix not supported field warnings in count_tokens_openai #6987

seunggil1 · 2025-09-02T15:57:01Z

Why are these changes needed?

1. `OpenAIChatCompletionClient.count_tokens` (from autogen_ext.models.openai) ignores several JSON Schema fields of Tools (e.g., anyOf, default, title), printing Not supported field ... warnings and producing consistent gaps between the pre-send estimate and usage.prompt_tokens.

Autogen Tool description example

def tool3(
        test1: Annotated[Optional[str], "example"] = None,
        test2: Literal["1", "2"] = "2"
) -> str:
    return str(test1) + str(test2)

tools = [ FunctionTool(tool3, description="example tool 3") ]
client.count_tokens(messages, tools=tools)

Printed warning log

Not supported field anyOf
Not supported field default
Not supported field title

Changes

Add anyOf, default, title in count_tokens_openai

elif field == "anyOf":
    tool_tokens -= 3
    for o in v["anyOf"]:
        tool_tokens += 3
        tool_tokens += len(encoding.encode(o["type"]))
elif field == "default":
    tool_tokens += 2
    tool_tokens += len(encoding.encode(json.dumps(v["default"])))
elif field == "title":
    tool_tokens += 2
    tool_tokens += len(encoding.encode(v["title"]))

Limitations

This change reduces—but does not eliminate—the discrepancy between estimated token counts and actual usage.
I don't currently know the exact logic behind counting message tokens in the count_tokens_openai function, so I just made it similar to other fields.
- The only reference I've found at this point is the token calculation guide in the OpenAI Cookbook: https://cookbook.openai.com/examples/how_to_count_tokens_with_tiktoken
- If there are official guidelines or recommended constants beyond the Cookbook, I’d appreciate pointers

2. In actual requests, Autogen omits the tools parameter when no tools are provided. But the current implementation adds +12 unconditionally, which overcounts tokens for tool-less calls.

Per OpenAI’s cookbook heuristics for tool-call counting, func_end = 12 represents the wrapper overhead that applies only when a tools array exists.
- I referenced this article : https://cookbook.openai.com/examples/how_to_count_tokens_with_tiktoken
Changes
- func_end(+12) only when tools exist in count_tokens_openai

# num_tokens += 12 # before
 if oai_tools:     # changed
    num_tokens += 12
return num_tokens

Difference in results before / after modification

messages = [UserMessage(content="What is the current time in Seoul?", source="user")]
model_client = OpenAIChatCompletionClient(model="gpt-4o")
token_estimate = model_client.count_tokens(messages=messages)

create_result = await model_client.create(
    messages=messages,
    cancellation_token=ctx.cancellation_token,
)
token_usage = create_result.usage.prompt_tokens

if token_usage != token_estimate:
    print(f"Token usage mismatch: estimated {token_estimate}, actual {token_usage}")

# before
Token usage mismatch: estimated 29, actual 16
# after
Token usage mismatch: estimated 17, actual 16

Related issue number

Closes #6980

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

…t_count_tokens

seunggil1 · 2025-09-02T16:08:38Z

@microsoft-github-policy-service agree

seunggil1 added 4 commits September 2, 2025 01:53

Fix not supported field warnings in count_tokens_openai

d40c0a2

Fix apply func_end(+12) only when tools exist in count_tokens_openai

c5cbf31

Fix test coverage for optional parameters in count_tokens_openai

f0589ec

Refactor count_tokens test cases in test_openai_chat_completion_clien…

c7db7e4

…t_count_tokens

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix not supported field warnings in count_tokens_openai #6987

Fix not supported field warnings in count_tokens_openai #6987

seunggil1 commented Sep 2, 2025 •

edited

Loading

Uh oh!

seunggil1 commented Sep 2, 2025

Uh oh!

Uh oh!

Fix not supported field warnings in count_tokens_openai #6987

Are you sure you want to change the base?

Fix not supported field warnings in count_tokens_openai #6987

Conversation

seunggil1 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

1. OpenAIChatCompletionClient.count_tokens (from autogen_ext.models.openai) ignores several JSON Schema fields of Tools (e.g., anyOf, default, title), printing Not supported field ... warnings and producing consistent gaps between the pre-send estimate and usage.prompt_tokens.

2. In actual requests, Autogen omits the tools parameter when no tools are provided. But the current implementation adds +12 unconditionally, which overcounts tokens for tool-less calls.

Related issue number

Checks

Uh oh!

seunggil1 commented Sep 2, 2025

Uh oh!

Uh oh!

seunggil1 commented Sep 2, 2025 •

edited

Loading

1. `OpenAIChatCompletionClient.count_tokens` (from autogen_ext.models.openai) ignores several JSON Schema fields of Tools (e.g., anyOf, default, title), printing Not supported field ... warnings and producing consistent gaps between the pre-send estimate and usage.prompt_tokens.