llama-stack/llama_toolchain/inference
Xi Yan 5712566061
Remove request wrapper migration (#64)
* [1/n] migrate inference/chat_completion

* migrate inference/completion

* inference/completion

* inference regenerate openapi spec

* safety api

* migrate agentic system

* migrate apis without implementations

* re-generate openapi spec

* remove hack from openapi generator

* fix inference

* fix inference

* openapi generator rerun

* Simplified Telemetry API and tying it to logger (#57)

* Simplified Telemetry API and tying it to logger

* small update which adds a METRIC type

* move span events one level down into structured log events

---------

Co-authored-by: Ashwin Bharambe <ashwin@meta.com>

* fix api to work with openapi generator

* fix agentic calling inference

* together adapter inference

* update inference adapters

---------

Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
Co-authored-by: Ashwin Bharambe <ashwin@meta.com>
2024-09-12 15:03:49 -07:00
..
adapters Remove request wrapper migration (#64) 2024-09-12 15:03:49 -07:00
api Remove request wrapper migration (#64) 2024-09-12 15:03:49 -07:00
meta_reference Remove request wrapper migration (#64) 2024-09-12 15:03:49 -07:00
quantization API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py Remove request wrapper migration (#64) 2024-09-12 15:03:49 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
prepare_messages.py API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
providers.py [Inference] Use huggingface_hub inference client for TGI adapter (#53) 2024-09-12 09:11:35 -07:00