llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-16 19:29:27 +00:00

History

Robert Riley (OCI) 6ad5fb5577 Some checks failed Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s Details SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 0s Details Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped Details Integration Tests (Replay) / generate-matrix (push) Successful in 3s Details API Conformance Tests / check-schema-compatibility (push) Successful in 10s Details SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 11s Details Python Package Build Test / build (3.12) (push) Successful in 15s Details Python Package Build Test / build (3.13) (push) Successful in 18s Details Test External API and Providers / test-external (venv) (push) Failing after 30s Details UI Tests / ui-tests (22) (push) Successful in 56s Details Vector IO Integration Tests / test-matrix (push) Failing after 1m1s Details Unit Tests / unit-tests (3.13) (push) Failing after 1m44s Details Unit Tests / unit-tests (3.12) (push) Failing after 1m48s Details Integration Tests (Replay) / Integration Tests (, , , client=, ) (push) Failing after 3m17s Details Pre-commit / pre-commit (22) (push) Successful in 3m22s Details feat: Adding OCI Embeddings (#4300 ) # What does this PR do? Enabling usage of OCI embedding models. ## Test Plan Testing embedding model: `OCI_COMPARTMENT_OCID="" OCI_REGION="us-chicago-1" OCI_AUTH_TYPE=config_file pytest -sv tests/integration/inference/test_openai_embeddings.py --stack-config oci --embedding-model oci/openai.text-embedding-3-small --inference-mode live` Testing chat model: `OCI_COMPARTMENT_OCID="" OCI_REGION="us-chicago-1" OCI_AUTH_TYPE=config_file pytest -sv tests/integration/inference/ --stack-config oci --text-model oci/openai.gpt-4.1-nano-2025-04-14 --inference-mode live` Testing curl for embeddings: `curl -X POST http://localhost:8321/v1/embeddings -H "Content-Type: application/json" -d '{ "model": "oci/openai.text-embedding-3-small", "input": ["First text", "Second text"], "encoding_format": "float" }'` `{"object":"list","data":[{"object":"embedding","embedding":[-0.017190756...0.025272394],"index":1}],"model":"oci/openai.text-embedding-3-small","usage":{"prompt_tokens":4,"total_tokens":4}}` --------- Co-authored-by: Omar Abdelwahab <omaryashraf10@gmail.com>		2025-12-08 13:05:39 -08:00
..
recordings	feat(responses)!: Add web_search_2025_08_26 to the WebSearchToolTypes (#4103 )	2025-11-07 10:01:12 -08:00
__init__.py	fix: remove ruff N999 (#1388 )	2025-03-07 11:14:04 -08:00
dog.png	refactor: tests/unittests -> tests/unit; tests/api -> tests/integration	2025-03-04 09:57:00 -08:00
test_openai_completion.py	feat: add oci genai service as chat inference provider (#3876 )	2025-11-10 16:16:24 -05:00
test_openai_embeddings.py	feat: Adding OCI Embeddings (#4300 )	2025-12-08 13:05:39 -08:00
test_openai_vision_inference.py	feat(internal): add image_url download feature to OpenAIMixin (#3516 )	2025-09-26 17:32:16 -04:00
test_provider_data_routing.py	feat!: Architect Llama Stack Telemetry Around Automatic Open Telemetry Instrumentation (#4127 )	2025-12-01 10:33:18 -08:00
test_rerank.py	feat: Add rerank API for NVIDIA Inference Provider (#3329 )	2025-10-30 21:42:09 -07:00
test_tools_with_schemas.py	fix: Remove authorization from provider data (#4161 )	2025-11-17 12:16:35 -08:00
test_vision_inference.py	chore(apis): unpublish deprecated /v1/inference apis (#3297 )	2025-09-27 11:20:06 -07:00
vision_test_1.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_2.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00
vision_test_3.jpg	feat: introduce llama4 support (#1877 )	2025-04-05 11:53:35 -07:00