llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 16:12:46 +00:00

History

Matthew Farrellee ae804ed5a8 feat: (re-)enable Databricks inference adapter Databricks inference adapter was broken, would not start, see #3486 - remove deprecated completion / chat_completion endpoints - enable dynamic model listing w/o refresh, listing is not async - use SecretStr instead of str for token - backward incompatible change: for consistency with databricks docs, env DATABRICKS_URL -> DATABRICKS_HOST and DATABRICKS_API_TOKEN -> DATABRICKS_TOKEN - databricks urls are custom per user/org, add special recorder handling for databricks urls - add integration test --setup databricks - enable chat completions tests - enable embeddings tests - disable n > 1 tests - disable embeddings base64 tests - disable embeddings dimensions tests note: reasoning models, e.g. gpt oss, fail because databricks has a custom, incompatible response format test with: ./scripts/integration-tests.sh --stack-config server:ci-tests --setup databricks --subdirs inference --pattern openai note: databricks needs to be manually added to the ci-tests distro for replay testing	2025-09-20 05:05:05 -04:00
..
__init__.py	feat(tests): introduce inference record/replay to increase test reliability (#2941 )	2025-07-29 12:41:31 -07:00
inference_recorder.py	feat: (re-)enable Databricks inference adapter	2025-09-20 05:05:05 -04:00

Matthew Farrellee ae804ed5a8 feat: (re-)enable Databricks inference adapter

Databricks inference adapter was broken, would not start, see #3486

- remove deprecated completion / chat_completion endpoints
- enable dynamic model listing w/o refresh, listing is not async
- use SecretStr instead of str for token
- backward incompatible change: for consistency with databricks docs, env DATABRICKS_URL -> DATABRICKS_HOST and DATABRICKS_API_TOKEN -> DATABRICKS_TOKEN
- databricks urls are custom per user/org, add special recorder handling for databricks urls
- add integration test --setup databricks
- enable chat completions tests
- enable embeddings tests
- disable n > 1 tests
- disable embeddings base64 tests
- disable embeddings dimensions tests

note: reasoning models, e.g. gpt oss, fail because databricks has a custom, incompatible response format

test with: ./scripts/integration-tests.sh --stack-config server:ci-tests --setup databricks --subdirs inference --pattern openai

note: databricks needs to be manually added to the ci-tests distro for replay testing

2025-09-20 05:05:05 -04:00

__init__.py

feat(tests): introduce inference record/replay to increase test reliability (#2941 )

2025-07-29 12:41:31 -07:00

inference_recorder.py

feat: (re-)enable Databricks inference adapter

2025-09-20 05:05:05 -04:00