forked from phoenix-oss/llama-stack-mirror
		
	chore: move all Llama Stack types from llama-models to llama-stack (#1098)
llama-models should have extremely minimal cruft. Its sole purpose should be didactic -- show the simplest implementation of the llama models and document the prompt formats, etc. This PR is the complement to https://github.com/meta-llama/llama-models/pull/279 ## Test Plan Ensure all `llama` CLI `model` sub-commands work: ```bash llama model list llama model download --model-id ... llama model prompt-format -m ... ``` Ran tests: ```bash cd tests/client-sdk LLAMA_STACK_CONFIG=fireworks pytest -s -v inference/ LLAMA_STACK_CONFIG=fireworks pytest -s -v vector_io/ LLAMA_STACK_CONFIG=fireworks pytest -s -v agents/ ``` Create a fresh venv `uv venv && source .venv/bin/activate` and run `llama stack build --template fireworks --image-type venv` followed by `llama stack run together --image-type venv` <-- the server runs Also checked that the OpenAPI generator can run and there is no change in the generated files as a result. ```bash cd docs/openapi_generator sh run_openapi_generator.sh ```
This commit is contained in:
		
							parent
							
								
									c0ee512980
								
							
						
					
					
						commit
						314ee09ae3
					
				
					 138 changed files with 8491 additions and 465 deletions
				
			
		|  | @ -7,14 +7,7 @@ import json | |||
| import logging | ||||
| from typing import AsyncGenerator, Dict, List, Optional, Union | ||||
| 
 | ||||
| from llama_models.datatypes import ( | ||||
|     GreedySamplingStrategy, | ||||
|     SamplingParams, | ||||
|     TopKSamplingStrategy, | ||||
|     TopPSamplingStrategy, | ||||
| ) | ||||
| from llama_models.llama3.api.chat_format import ChatFormat | ||||
| from llama_models.llama3.api.datatypes import StopReason, ToolCall | ||||
| from openai.types.chat import ChatCompletionMessageToolCall | ||||
| from pydantic import BaseModel | ||||
| 
 | ||||
|  | @ -37,6 +30,14 @@ from llama_stack.apis.inference import ( | |||
|     Message, | ||||
|     TokenLogProbs, | ||||
| ) | ||||
| from llama_stack.models.llama.datatypes import ( | ||||
|     GreedySamplingStrategy, | ||||
|     SamplingParams, | ||||
|     StopReason, | ||||
|     ToolCall, | ||||
|     TopKSamplingStrategy, | ||||
|     TopPSamplingStrategy, | ||||
| ) | ||||
| from llama_stack.providers.utils.inference.prompt_adapter import ( | ||||
|     convert_image_content_to_url, | ||||
| ) | ||||
|  |  | |||
		Loading…
	
	Add table
		Add a link
		
	
		Reference in a new issue