add A2 cpu inference #110

MarcelWilnicki · 2025-08-25T13:30:15Z

Hi,

A proposal of A2 blueprint. The blueprint is similar to this blueprint: https://github.com/oracle-quickstart/oci-ai-blueprints/blob/main/docs/sample_blueprints/model_serving/cpu-inference/cpu-inference-mistral-vm.json

and aimed at reaching similar performance for mistral:7b-instruct-q8_0 and llama3.1:8b-instruct-q8_0 models.

from our internal benchmarks we got such numbers for A2:

mistral:7b-instruct-q8_0
b=1, t=8, tg_throughput=6.27, pp_throughput=446.95, ttft=0.15

llama3.1:8b-instruct-q8_0
b=1, t=8, tg_throughput=5.63, pp_throughput=419.10, ttft=0.20

and such numbers for E4:

mistral:7b-instruct-q8_0
b=1, t=4, tg_throughput=6.91, pp_throughput=470.62, ttft=0.15

llama3.1:8b-instruct-q8_0
b=1, t=4, tg_throughput=6.54, pp_throughput=427.20, ttft=0.17

The main difference in the blueprint is 8 cores for A2 vs 4 cores for E4.

oracle-contributor-agreement · 2025-08-25T13:30:20Z

Thank you for your pull request and welcome to our community! To contribute, please sign the Oracle Contributor Agreement (OCA).
The following contributors of this PR have not signed the OCA:

PR author: MarcelWilnicki
marcel.wilnicki@gmail.com (@MarcelWilnicki)

To sign the OCA, please create an Oracle account and sign the OCA in Oracle's Contributor Agreement Application.

When signing the OCA, please provide your GitHub username. After signing the OCA and getting an OCA approval from Oracle, this PR will be automatically updated.

If you are an Oracle employee, please make sure that you are a member of the main Oracle GitHub organization, and your membership in this organization is public.

MarcelWilnicki added 2 commits August 22, 2025 16:25

add a2 blueprint

7769e19

change deployment name

dd7f6ca

oracle-contributor-agreement bot added the OCA Required At least one contributor does not have an approved Oracle Contributor Agreement. label Aug 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add A2 cpu inference #110

add A2 cpu inference #110

Uh oh!

MarcelWilnicki commented Aug 25, 2025

Uh oh!

oracle-contributor-agreement bot commented Aug 25, 2025

Uh oh!

Uh oh!

add A2 cpu inference #110

Are you sure you want to change the base?

add A2 cpu inference #110

Uh oh!

Conversation

MarcelWilnicki commented Aug 25, 2025

Uh oh!

oracle-contributor-agreement bot commented Aug 25, 2025

Uh oh!

Uh oh!