llama-stack-mirror/tests/integration/recordings/responses
Matthew Farrellee ce7a3b4dff
feat: update Cerebras inference provider to support dynamic model listing (#3481)
# What does this PR do?

- update Cerebras to use OpenAIMixin
- enable openai completions tests
- enable openai chat completions tests
- disable with n > 1 tests
- add recording for --setup cerebras --subdirs inference --pattern
openai


## Test Plan

`./scripts/integration-tests.sh --stack-config server:ci-tests --setup
cerebras --subdirs inference --pattern openai`

```
tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity] 
instantiating llama_stack_client
Port 8321 is already in use, assuming server is already running...
llama_stack_client instantiated in 0.053s
PASSED                                                                                            [  2%]
tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming_suffix[txt=cerebras/llama-3.3-70b-inference:completion:suffix] SKIPPED (Suffix is not supported for the model: cerebras/llama-3.3-70b.)                   [  4%]
tests/integration/inference/test_openai_completion.py::test_openai_completion_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity] PASSED                                                                                                [  6%]
tests/integration/inference/test_openai_completion.py::test_openai_completion_prompt_logprobs[txt=cerebras/llama-3.3-70b-1] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.)             [  8%]
tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=cerebras/llama-3.3-70b] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.)                 [ 10%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01] PASSED                                                          [ 12%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] PASSED                                                                  [ 14%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cere...) [ 17%]
tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-True] PASSED                                                                                                                     [ 19%]
tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-True] PASSED                                                                                                          [ 21%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming_with_file[txt=cerebras/llama-3.3-70b] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support chat completion calls wit...) [ 23%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                               [ 25%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_multiple_strings[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                            [ 27%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_float[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                  [ 29%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_dimensions[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                             [ 31%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_user_parameter[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                         [ 34%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_empty_list_error[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                            [ 36%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_invalid_model_error[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                         [ 38%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_different_inputs_different_outputs[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                          [ 40%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_base64[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                 [ 42%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_base64_batch_processing[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                     [ 44%]
tests/integration/inference/test_openai_completion.py::test_openai_completion_prompt_logprobs[txt=cerebras/llama-3.3-70b-0] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.)             [ 46%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_02] PASSED                                                          [ 48%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] PASSED                                                                  [ 51%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cere...) [ 53%]
tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-False] PASSED                                                                                                                    [ 55%]
tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-False] PASSED                                                                                                         [ 57%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                          [ 59%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_multiple_strings[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                       [ 61%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_float[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                             [ 63%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_dimensions[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                        [ 65%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_user_parameter[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                    [ 68%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_empty_list_error[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                       [ 70%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_invalid_model_error[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                    [ 72%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_different_inputs_different_outputs[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                     [ 74%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_base64[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                            [ 76%]
tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_base64_batch_processing[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test)                                [ 78%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01] PASSED                                                     [ 80%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] PASSED                                                             [ 82%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote:...) [ 85%]
tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-True] PASSED                                                                                                                [ 87%]
tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-True] PASSED                                                                                                     [ 89%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_02] PASSED                                                     [ 91%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] PASSED                                                             [ 93%]
tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote:...) [ 95%]
tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-False] PASSED                                                                                                               [ 97%]
tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-False] PASSED                                                                                                    [100%]

=================================================================================================================== slowest 10 durations ====================================================================================================================
0.37s call     tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01]
0.34s call     tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-False]
0.18s call     tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-True]
0.17s setup    tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity]
0.15s call     tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-True]
0.13s call     tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-True]
0.12s call     tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-False]
0.12s call     tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-True]
0.12s call     tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-False]
0.08s call     tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02]
================================================================================================================== short test summary info ==================================================================================================================
SKIPPED [1] tests/integration/inference/test_openai_completion.py:75: Suffix is not supported for the model: cerebras/llama-3.3-70b.
SKIPPED [3] tests/integration/inference/test_openai_completion.py:123: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.
SKIPPED [4] tests/integration/inference/test_openai_completion.py:103: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support n param.
SKIPPED [1] tests/integration/inference/test_openai_completion.py:129: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support chat completion calls with base64 encoded files.
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:90: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:112: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:136: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:154: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:175: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:195: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:206: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:217: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:244: embedding_model_id empty - skipping test
SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:278: embedding_model_id empty - skipping test
================================================================================================= 18 passed, 29 skipped, 50 deselected, 4 warnings in 3.02s =================================================================================================
```
2025-09-23 16:26:00 -04:00
..
0b3f2e4754ff.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
0b27fd737699.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
00ba04f74a96.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
0c1f45455d3b.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
0d3290adae1d.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
0e8f2b001dd9.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
0fda25b9241c.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
0ff78129bb3a.json feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955) 2025-07-29 16:50:26 -07:00
1adfaa0e062e.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
1b92be674e2a.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
1b8394f90636.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
1b45391880c6.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
1e11c2b20ff8.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
1f48f4b2ae33.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
2afe3b38ca01.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
2b2ad549510d.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
2d187a11704c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
3a81146f2afa.json chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395) 2025-09-18 13:09:57 +02:00
3c0bf9ba81b2.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
3c3f13cb7794.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
3ca695048bee.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
3cdb5cab6ce6.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
3dff18060ebc.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
3ef0f9aab128.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
4a3a4447b16b.json fix(ci, tests): ensure uv environments in CI are kosher, record tests (#3193) 2025-08-18 17:02:24 -07:00
4ca6152a0eb8.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
4d4440c8641b.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
4de6877d86fa.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
5db0c44c83a4.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
5f5d16afadb4.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
5fa0e98f3d84.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
6c4e2e207e8a.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
6cb0285a7638.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
6cc063bbd7d3.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
6d35c91287e2.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
6d937e5e9233.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
6f96090aa955.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
6fbea1abca7c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
6fe1d4fedf12.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
7b25b702ea18.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
7b4815aba6c5.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
7bcb0f86c91b.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
07c5fa34d9ca.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
7e6806cba34a.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
7ef63231b9f8.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
7f53b458dad9.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
8bba71367e87.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
9b9e8cf39b15.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
9b812cbcb88d.json fix(recording): endpoint resolution (#3013) 2025-08-01 16:23:54 -07:00
9c4bc9c3e7ac.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
9c007f300365.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
9c28ec9ac338.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
9c140a29ae09.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
9e7a83d3d596.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
9e651e5fcfe2.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
9f3d749cc1c8.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
9fadf5a3d68f.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
10eea8c15ddc.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
011f70e24ce4.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
35db283fef1d.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
40f524d1934a.json fix(recording): endpoint resolution (#3013) 2025-08-01 16:23:54 -07:00
41e27b9b5d09.json feat(batches, completions): add /v1/completions support to /v1/batches (#3309) 2025-09-05 11:59:57 -07:00
43e106de6736.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
44a1d9de0602.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
44fb9cf5875f.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
48d2fb183a2a.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
50a8dc5b8ece.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
56ac6a7c6df0.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
57b67d1b1a36.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
61be36ad8ccd.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
62aa454ea5f9.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
70adef2c30c4.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
72c1126ff2f9.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
72e075bf28e8.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
75d0dd9d0fa3.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
76b89a84cd6f.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
80e4404d8987.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
80f09f27dd61.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
81a91f79c51d.json fix(recording): endpoint resolution (#3013) 2025-08-01 16:23:54 -07:00
83c2ffb72daa.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
84cab42e1f5c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
89b141855b81.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
90fec951fdb9.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
94d11daee205.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
97d3812bfccb.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
97e259c0d3e5.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
121a72d1c4cf.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
173ecb3aab28.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
177ba8517262.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
198ef7208389.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
211b1562d4e6.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
249b7f0ddde6.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
325a72db5755.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
382c2f22274c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
390f0c7dac96.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
432a346b2ed8.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
441e2832387f.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
477f8946bf7d.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
511eb1b92e34.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
545d86510a80.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
0547d0909f24.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
554de3cd986f.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
563b994bb7d1.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
565b1072cb9d.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
802f60021837.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
830a1fe14938.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
836f51dfb3c5.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
840fbb380b73.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
894fdacb1cfa.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
2983cc1d79f0.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
3877ecf1bc62.json chore: move recordings one directory upwards 2025-07-29 12:46:19 -07:00
4014dd44c15f.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
6730dcde0b73.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
6857b19d3f0a.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
6906a6e71988.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
7354ec181984.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
12913f20f6ac.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
17030e75309f.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
17253d7cc667.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
23506e73bb9e.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
28648cf8d421.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
29585e055e6f.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
31407e035752.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
50340cd4d253.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
67198cbad48f.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
85594a69d74a.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
140187e305dc.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
174458ad71b2.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
178016edef0e.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
197228e26971.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
561746e1c8de.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
0648374e43e7.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
731824c54461.json fix(ci, tests): ensure uv environments in CI are kosher, record tests (#3193) 2025-08-18 17:02:24 -07:00
875323ed9913.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
4096743baf8e.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
04172112ffbb.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
4597743bcd2a.json feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955) 2025-07-29 16:50:26 -07:00
8295382a8e7c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
8752115f8d0c.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
27463384d1a3.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
4420515208a8.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
5370751803dc.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
6412295819a1.json chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395) 2025-09-18 13:09:57 +02:00
417020320684.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a0c4df33879f.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a1c5bf09ea53.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
a4c8d19bb1eb.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a59d0d7c1485.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a98eecadddc8.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
a410d4840402.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
a5187d9d5057.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a6810c23eda8.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
a97477559b10.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
aacf9abc51d4.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
ae1c22f18ecc.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
ae6835cfe70e.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
af6ca03dcbc3.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
afb33182f365.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
b4cda53cd04f.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
b5e3ed420986.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
b9f6e724ae06.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
b14ff438ca99.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
b44cc7a7afc8.json chore(misc): make tests and starter faster (#3042) 2025-08-05 14:55:05 -07:00
b91f1fb4aedb.json fix(recording): endpoint resolution (#3013) 2025-08-01 16:23:54 -07:00
b459f403a5ae.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
b612debbd3bf.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
b24590574a85.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
bbd0637dce16.json fix(recording): endpoint resolution (#3013) 2025-08-01 16:23:54 -07:00
bce560cbf1c6.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
bd356b27a085.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
bf79a89cc37f.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
c9cba6f3ee38.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
c31a86ea6c58.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
c2199d6064db.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
c7582fa7c2c4.json feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955) 2025-07-29 16:50:26 -07:00
c791119e6359.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
c9667519ad7c.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
cb3df2a1dc22.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
cd094caaf1c0.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
cf55f983d1ff.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
d0ac68cbde69.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
d3e27b7234e2.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
d4c86ac355fb.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
d4f56d7d1996.json fix: telemetry fixes (inference and core telemetry) (#2733) 2025-08-06 13:37:40 -07:00
d64ffaa0de6f.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
d86d4fc1eaca.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
d85689907fec.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
da531c71e64f.json chore(misc): make tests and starter faster (#3042) 2025-08-05 14:55:05 -07:00
dac7a32e5db9.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
dbc41d2417e1.json chore(misc): make tests and starter faster (#3042) 2025-08-05 14:55:05 -07:00
dc8120cf0774.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
dd9e7d5913e9.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
dd226d71f844.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
decfd950646c.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
e0a6dce1d94b.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
e2c9b07709fe.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
e08e01e5652a.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
e9c8a0e4f0e0.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
e08848bfcd28.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
e509387fc329.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
e29300494763.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
e96152610712.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
ecae140151d1.json chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395) 2025-09-18 13:09:57 +02:00
ecf6f0c51485.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
ed9e9b34008d.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
eee6a163b837.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
eee47930e3ae.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
eefb4206a4a9.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00
ef59cbff54d0.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
ef757a75ed08.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
f0bbea34c5cc.json feat: add embedding and dynamic model support to Together inference adapter (#3458) 2025-09-16 11:53:41 -07:00
f1ea938b0b0d.json chore(misc): make tests and starter faster (#3042) 2025-08-05 14:55:05 -07:00
f3c3afbd9b7e.json feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955) 2025-07-29 16:50:26 -07:00
f6d655e91ac3.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
f70f30f54211.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
f80b99430f7e.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
f477c2fe1332.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
f518ea4fde7d.json feat: add dynamic model registration support to TGI inference (#3417) 2025-09-15 15:52:40 -04:00
f1592dee71e5.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
f6857bcea729.json feat: implement query_metrics (#3074) 2025-08-22 14:19:24 -07:00
fb785db7fafd.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
fcdef245da95.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
fe140befeba4.json feat(ci): add support for running vision inference tests (#2972) 2025-07-31 11:50:42 -07:00
ff7db0102b28.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
ff3271401fb4.json feat: add Azure OpenAI inference provider support (#3396) 2025-09-11 13:48:38 +02:00
ffd7b58fded8.json feat: (re-)enable Databricks inference adapter (#3500) 2025-09-23 15:37:23 -04:00
models-4a3a4447b16b-3057338f.json feat(tests): auto-merge all model list responses and unify recordings (#3320) 2025-09-03 11:33:03 -07:00
models-bd032f995f2a-af43cc69.json feat: update Cerebras inference provider to support dynamic model listing (#3481) 2025-09-23 16:26:00 -04:00