test(verification): overwrite test result instead of creating new ones (#1934)

# What does this PR do? ## Test Plan (myenv) ➜ llama-stack python tests/verifications/generate_report.py --providers fireworks,together,openai --run-tests
2025-12-09 03:19:20 +00:00 · 2025-04-10 16:59:28 -07:00 · 2025-04-10 16:59:28 -07:00 · 2fcb70b789
commit 2fcb70b789
parent a4cc4b7e31
5 changed files with 926 additions and 580 deletions
--- a/tests/verifications/REPORT.md
+++ b/tests/verifications/REPORT.md
@ -1,6 +1,6 @@
 # Test Results Report

-*Generated on: 2025-04-09 22:52:19*
+*Generated on: 2025-04-10 16:48:18*

 *This report was generated by running `python tests/verifications/generate_report.py`*

@ -15,15 +15,15 @@

 | Provider | Pass Rate | Tests Passed | Total Tests |
 | --- | --- | --- | --- |
-| Together | 67.7% | 21 | 31 |
-| Fireworks | 90.3% | 28 | 31 |
-| Openai | 100.0% | 22 | 22 |
+| Together | 64.7% | 22 | 34 |
+| Fireworks | 82.4% | 28 | 34 |
+| Openai | 100.0% | 24 | 24 |



 ## Together

-*Tests run on: 2025-04-09 22:50:58*
+*Tests run on: 2025-04-10 16:46:35*

 ```bash
 # Run all tests for this provider:
@ -56,10 +56,11 @@ pytest tests/verifications/openai_api/test_chat_completion.py --provider=togethe
 | test_chat_streaming_image | ⚪ | ❌ | ❌ |
 | test_chat_streaming_structured_output (calendar) | ✅ | ❌ | ❌ |
 | test_chat_streaming_structured_output (math) | ✅ | ❌ | ❌ |
+| test_chat_streaming_tool_calling | ✅ | ❌ | ❌ |

 ## Fireworks

-*Tests run on: 2025-04-09 22:50:02*
+*Tests run on: 2025-04-10 16:44:44*

 ```bash
 # Run all tests for this provider:
@ -92,10 +93,11 @@ pytest tests/verifications/openai_api/test_chat_completion.py --provider=firewor
 | test_chat_streaming_image | ⚪ | ✅ | ✅ |
 | test_chat_streaming_structured_output (calendar) | ✅ | ✅ | ✅ |
 | test_chat_streaming_structured_output (math) | ✅ | ✅ | ✅ |
+| test_chat_streaming_tool_calling | ❌ | ❌ | ❌ |

 ## Openai

-*Tests run on: 2025-04-09 22:51:44*
+*Tests run on: 2025-04-10 16:47:28*

 ```bash
 # Run all tests for this provider:
@ -127,3 +129,4 @@ pytest tests/verifications/openai_api/test_chat_completion.py --provider=openai
 | test_chat_streaming_image | ✅ | ✅ |
 | test_chat_streaming_structured_output (calendar) | ✅ | ✅ |
 | test_chat_streaming_structured_output (math) | ✅ | ✅ |
+| test_chat_streaming_tool_calling | ✅ | ✅ |