llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Matthew Farrellee ce7a3b4dff feat: update Cerebras inference provider to support dynamic model listing (#3481 ) # What does this PR do? - update Cerebras to use OpenAIMixin - enable openai completions tests - enable openai chat completions tests - disable with n > 1 tests - add recording for --setup cerebras --subdirs inference --pattern openai ## Test Plan `./scripts/integration-tests.sh --stack-config server:ci-tests --setup cerebras --subdirs inference --pattern openai` ``` tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity] instantiating llama_stack_client Port 8321 is already in use, assuming server is already running... llama_stack_client instantiated in 0.053s PASSED [ 2%] tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming_suffix[txt=cerebras/llama-3.3-70b-inference:completion:suffix] SKIPPED (Suffix is not supported for the model: cerebras/llama-3.3-70b.) [ 4%] tests/integration/inference/test_openai_completion.py::test_openai_completion_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity] PASSED [ 6%] tests/integration/inference/test_openai_completion.py::test_openai_completion_prompt_logprobs[txt=cerebras/llama-3.3-70b-1] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.) [ 8%] tests/integration/inference/test_openai_completion.py::test_openai_completion_guided_choice[txt=cerebras/llama-3.3-70b] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.) [ 10%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01] PASSED [ 12%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] PASSED [ 14%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cere...) [ 17%] tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-True] PASSED [ 19%] tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-True] PASSED [ 21%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming_with_file[txt=cerebras/llama-3.3-70b] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support chat completion calls wit...) [ 23%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 25%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_multiple_strings[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 27%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_float[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 29%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_dimensions[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 31%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_user_parameter[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 34%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_empty_list_error[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 36%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_invalid_model_error[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 38%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_different_inputs_different_outputs[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 40%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_base64[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 42%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_base64_batch_processing[openai_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 44%] tests/integration/inference/test_openai_completion.py::test_openai_completion_prompt_logprobs[txt=cerebras/llama-3.3-70b-0] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters.) [ 46%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_02] PASSED [ 48%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] PASSED [ 51%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote::cere...) [ 53%] tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-False] PASSED [ 55%] tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-False] PASSED [ 57%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_single_string[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 59%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_multiple_strings[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 61%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_float[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 63%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_dimensions[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 65%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_user_parameter[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 68%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_empty_list_error[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 70%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_invalid_model_error[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 72%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_different_inputs_different_outputs[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 74%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_with_encoding_format_base64[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 76%] tests/integration/inference/test_openai_embeddings.py::test_openai_embeddings_base64_batch_processing[llama_stack_client-cerebras/llama-3.3-70b-None-None-None-384] SKIPPED (embedding_model_id empty - skipping test) [ 78%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01] PASSED [ 80%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] PASSED [ 82%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_01] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote:...) [ 85%] tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-True] PASSED [ 87%] tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-True] PASSED [ 89%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_02] PASSED [ 91%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] PASSED [ 93%] tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming_with_n[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] SKIPPED (Model cerebras/llama-3.3-70b hosted by remote:...) [ 95%] tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-False] PASSED [ 97%] tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-False] PASSED [100%] =================================================================================================================== slowest 10 durations ==================================================================================================================== 0.37s call tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_non_streaming[openai_client-txt=cerebras/llama-3.3-70b-inference:chat_completion:non_streaming_01] 0.34s call tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-False] 0.18s call tests/integration/inference/test_openai_completion.py::test_inference_store[client_with_models-txt=cerebras/llama-3.3-70b-True] 0.17s setup tests/integration/inference/test_openai_completion.py::test_openai_completion_non_streaming[txt=cerebras/llama-3.3-70b-inference:completion:sanity] 0.15s call tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-True] 0.13s call tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-True] 0.12s call tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[client_with_models-txt=cerebras/llama-3.3-70b-False] 0.12s call tests/integration/inference/test_openai_completion.py::test_inference_store[openai_client-txt=cerebras/llama-3.3-70b-True] 0.12s call tests/integration/inference/test_openai_completion.py::test_inference_store_tool_calls[openai_client-txt=cerebras/llama-3.3-70b-False] 0.08s call tests/integration/inference/test_openai_completion.py::test_openai_chat_completion_streaming[client_with_models-txt=cerebras/llama-3.3-70b-inference:chat_completion:streaming_02] ================================================================================================================== short test summary info ================================================================================================================== SKIPPED [1] tests/integration/inference/test_openai_completion.py:75: Suffix is not supported for the model: cerebras/llama-3.3-70b. SKIPPED [3] tests/integration/inference/test_openai_completion.py:123: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support vllm extra_body parameters. SKIPPED [4] tests/integration/inference/test_openai_completion.py:103: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support n param. SKIPPED [1] tests/integration/inference/test_openai_completion.py:129: Model cerebras/llama-3.3-70b hosted by remote::cerebras doesn't support chat completion calls with base64 encoded files. SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:90: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:112: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:136: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:154: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:175: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:195: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:206: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:217: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:244: embedding_model_id empty - skipping test SKIPPED [2] tests/integration/inference/test_openai_embeddings.py:278: embedding_model_id empty - skipping test ================================================================================================= 18 passed, 29 skipped, 50 deselected, 4 warnings in 3.02s ================================================================================================= ```		2025-09-23 16:26:00 -04:00
..
0b3f2e4754ff.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
0b27fd737699.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
00ba04f74a96.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
0c1f45455d3b.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
0d3290adae1d.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
0e8f2b001dd9.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
0fda25b9241c.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
0ff78129bb3a.json	feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955 )	2025-07-29 16:50:26 -07:00
1adfaa0e062e.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
1b92be674e2a.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
1b8394f90636.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
1b45391880c6.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
1e11c2b20ff8.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
1f48f4b2ae33.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
2afe3b38ca01.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
2b2ad549510d.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
2d187a11704c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
3a81146f2afa.json	chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395 )	2025-09-18 13:09:57 +02:00
3c0bf9ba81b2.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
3c3f13cb7794.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
3ca695048bee.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
3cdb5cab6ce6.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
3dff18060ebc.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
3ef0f9aab128.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
4a3a4447b16b.json	fix(ci, tests): ensure uv environments in CI are kosher, record tests (#3193 )	2025-08-18 17:02:24 -07:00
4ca6152a0eb8.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
4d4440c8641b.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
4de6877d86fa.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
5db0c44c83a4.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
5f5d16afadb4.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
5fa0e98f3d84.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
6c4e2e207e8a.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
6cb0285a7638.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
6cc063bbd7d3.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
6d35c91287e2.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
6d937e5e9233.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
6f96090aa955.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
6fbea1abca7c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
6fe1d4fedf12.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
7b25b702ea18.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
7b4815aba6c5.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
7bcb0f86c91b.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
07c5fa34d9ca.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
7e6806cba34a.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
7ef63231b9f8.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
7f53b458dad9.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
8bba71367e87.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
9b9e8cf39b15.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
9b812cbcb88d.json	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
9c4bc9c3e7ac.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
9c007f300365.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
9c28ec9ac338.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
9c140a29ae09.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
9e7a83d3d596.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
9e651e5fcfe2.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
9f3d749cc1c8.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
9fadf5a3d68f.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
10eea8c15ddc.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
011f70e24ce4.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
35db283fef1d.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
40f524d1934a.json	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
41e27b9b5d09.json	feat(batches, completions): add /v1/completions support to /v1/batches (#3309 )	2025-09-05 11:59:57 -07:00
43e106de6736.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
44a1d9de0602.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
44fb9cf5875f.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
48d2fb183a2a.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
50a8dc5b8ece.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
56ac6a7c6df0.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
57b67d1b1a36.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
61be36ad8ccd.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
62aa454ea5f9.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
70adef2c30c4.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
72c1126ff2f9.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
72e075bf28e8.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
75d0dd9d0fa3.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
76b89a84cd6f.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
80e4404d8987.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
80f09f27dd61.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
81a91f79c51d.json	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
83c2ffb72daa.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
84cab42e1f5c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
89b141855b81.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
90fec951fdb9.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
94d11daee205.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
97d3812bfccb.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
97e259c0d3e5.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
121a72d1c4cf.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
173ecb3aab28.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
177ba8517262.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
198ef7208389.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
211b1562d4e6.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
249b7f0ddde6.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
325a72db5755.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
382c2f22274c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
390f0c7dac96.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
432a346b2ed8.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
441e2832387f.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
477f8946bf7d.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
511eb1b92e34.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
545d86510a80.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
0547d0909f24.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
554de3cd986f.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
563b994bb7d1.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
565b1072cb9d.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
802f60021837.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
830a1fe14938.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
836f51dfb3c5.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
840fbb380b73.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
894fdacb1cfa.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
2983cc1d79f0.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
3877ecf1bc62.json	chore: move recordings one directory upwards	2025-07-29 12:46:19 -07:00
4014dd44c15f.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
6730dcde0b73.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
6857b19d3f0a.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
6906a6e71988.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
7354ec181984.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
12913f20f6ac.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
17030e75309f.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
17253d7cc667.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
23506e73bb9e.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
28648cf8d421.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
29585e055e6f.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
31407e035752.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
50340cd4d253.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
67198cbad48f.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
85594a69d74a.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
140187e305dc.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
174458ad71b2.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
178016edef0e.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
197228e26971.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
561746e1c8de.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
0648374e43e7.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
731824c54461.json	fix(ci, tests): ensure uv environments in CI are kosher, record tests (#3193 )	2025-08-18 17:02:24 -07:00
875323ed9913.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
4096743baf8e.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
04172112ffbb.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
4597743bcd2a.json	feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955 )	2025-07-29 16:50:26 -07:00
8295382a8e7c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
8752115f8d0c.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
27463384d1a3.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
4420515208a8.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
5370751803dc.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
6412295819a1.json	chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395 )	2025-09-18 13:09:57 +02:00
417020320684.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a0c4df33879f.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a1c5bf09ea53.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
a4c8d19bb1eb.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a59d0d7c1485.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a98eecadddc8.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
a410d4840402.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
a5187d9d5057.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a6810c23eda8.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
a97477559b10.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
aacf9abc51d4.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
ae1c22f18ecc.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
ae6835cfe70e.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
af6ca03dcbc3.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
afb33182f365.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
b4cda53cd04f.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
b5e3ed420986.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
b9f6e724ae06.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
b14ff438ca99.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
b44cc7a7afc8.json	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
b91f1fb4aedb.json	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
b459f403a5ae.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
b612debbd3bf.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
b24590574a85.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
bbd0637dce16.json	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
bce560cbf1c6.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
bd356b27a085.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
bf79a89cc37f.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
c9cba6f3ee38.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
c31a86ea6c58.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
c2199d6064db.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
c7582fa7c2c4.json	feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955 )	2025-07-29 16:50:26 -07:00
c791119e6359.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
c9667519ad7c.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
cb3df2a1dc22.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
cd094caaf1c0.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
cf55f983d1ff.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
d0ac68cbde69.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
d3e27b7234e2.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
d4c86ac355fb.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
d4f56d7d1996.json	fix: telemetry fixes (inference and core telemetry) (#2733 )	2025-08-06 13:37:40 -07:00
d64ffaa0de6f.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
d86d4fc1eaca.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
d85689907fec.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
da531c71e64f.json	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
dac7a32e5db9.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
dbc41d2417e1.json	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
dc8120cf0774.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
dd9e7d5913e9.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
dd226d71f844.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
decfd950646c.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
e0a6dce1d94b.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
e2c9b07709fe.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
e08e01e5652a.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
e9c8a0e4f0e0.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
e08848bfcd28.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
e509387fc329.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
e29300494763.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
e96152610712.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
ecae140151d1.json	chore: update the ollama inference impl to use OpenAIMixin for openai-compat functions (#3395 )	2025-09-18 13:09:57 +02:00
ecf6f0c51485.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
ed9e9b34008d.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
eee6a163b837.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
eee47930e3ae.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
eefb4206a4a9.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00
ef59cbff54d0.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
ef757a75ed08.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
f0bbea34c5cc.json	feat: add embedding and dynamic model support to Together inference adapter (#3458 )	2025-09-16 11:53:41 -07:00
f1ea938b0b0d.json	chore(misc): make tests and starter faster (#3042 )	2025-08-05 14:55:05 -07:00
f3c3afbd9b7e.json	feat(ci): use replay mode, setup ollama if specific label exists on PR (#2955 )	2025-07-29 16:50:26 -07:00
f6d655e91ac3.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
f70f30f54211.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
f80b99430f7e.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
f477c2fe1332.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
f518ea4fde7d.json	feat: add dynamic model registration support to TGI inference (#3417 )	2025-09-15 15:52:40 -04:00
f1592dee71e5.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
f6857bcea729.json	feat: implement query_metrics (#3074 )	2025-08-22 14:19:24 -07:00
fb785db7fafd.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
fcdef245da95.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
fe140befeba4.json	feat(ci): add support for running vision inference tests (#2972 )	2025-07-31 11:50:42 -07:00
ff7db0102b28.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
ff3271401fb4.json	feat: add Azure OpenAI inference provider support (#3396 )	2025-09-11 13:48:38 +02:00
ffd7b58fded8.json	feat: (re-)enable Databricks inference adapter (#3500 )	2025-09-23 15:37:23 -04:00
models-4a3a4447b16b-3057338f.json	feat(tests): auto-merge all model list responses and unify recordings (#3320 )	2025-09-03 11:33:03 -07:00
models-bd032f995f2a-af43cc69.json	feat: update Cerebras inference provider to support dynamic model listing (#3481 )	2025-09-23 16:26:00 -04:00