docs: update detection core with tips for using Gemini integration #1925

tberends · 2025-08-02T11:57:12Z

Description

On request of @SkalskiP at PR: https://github.com/roboflow/notebooks/pull/384

This PR improves the documentation regarding the ordering of content in requests that combine images with text prompts. Following Google's Gemini API best practices, text prompts are now placed after image parts in the contents array when using a single image with text.

Type of change

This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

According to the Gemini API documentation on image prompts, when using a single image with text, the recommended approach is to place the text prompt after the image part in the contents array. This ordering has been shown to produce significantly better results in practice.

In our testing with Process & Instrument Diagrams (P&IDs) using object detection, this reordering led to drastically improved accuracy in bounding box positioning. While the object labels were already accurate, the spatial precision of detected elements improved considerably with the optimized prompt ordering

Docs

Docs updated? What were the changes: updated the tips for prompt engineering

docs: update detection core with tips for using Gemini integration

70c2865

tberends requested a review from SkalskiP as a code owner August 2, 2025 11:57

fix(pre_commit): 🎨 auto format pre-commit hooks

f6c34af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: update detection core with tips for using Gemini integration #1925

docs: update detection core with tips for using Gemini integration #1925

Uh oh!

tberends commented Aug 2, 2025

Uh oh!

Uh oh!

docs: update detection core with tips for using Gemini integration #1925

Are you sure you want to change the base?

docs: update detection core with tips for using Gemini integration #1925

Uh oh!

Conversation

tberends commented Aug 2, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Docs

Uh oh!

Uh oh!