Docs Updates #61

rgreenberg1 · 2024-09-12T19:09:29Z

Summary:

This pull request introduces the GuideLLM CLI guide, README enhancements, image uploads, and the supported backends documentation to highlight all the backends that can be used with GuideLLM.

Test Cases:
The GuideLLM CLI has been tested with various LLM models and backends.
Unit tests ensure core functionalities work as expected.

Documentation:
Created documentation detailing the GuideLLM CLI usage and output metrics.
Created documentation detailing the openai-compatible API/HTTP pathway for TGI, llama.cpp, and DeepSparse in supported_backends.md

Additional Information:
The pull request includes changes to the docs/guides directory for the CLI documentation.
Binary files containing performance summary visualizations are added to the docs/assets directory.

Please review and provide feedback.

Minor Wording Edits.

parfeniukink

Great job!

parfeniukink · 2024-09-13T11:50:31Z

docs/guides/cli.md

+- The `--target` flag specifies the server hosting the model. In this case, it is a local vLLM server.
+- The `--model` flag specifies the model to evaluate. The model name should match the name of the model deployed on the server
+- By default, GuideLLM will run a `sweep` of performance evaluations across different request rates, each lasting 120 seconds. The results will be saved to a local directory.


I would rename flag to parameter since our CLI supports both: parameters and flags. If you specify a flag - there is no value next to it. If you specify parameter - the value is requied then.

In some cases we may get an error if the tokenizer is not specified. I would add another item here. Text is below:

The --tokenizer parameter specifies the tokenizer to encount the number of tokens in the dataset. If you faced any issues try using --tokenizer neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8.

rgreenberg1 added 9 commits September 10, 2024 11:33

Update README.md

a30aebc

Minor Wording Edits.

Add files via upload

a045690

Update cli.md

8035f55

Create supported_backends.md

a1be47a

Update README.md

d823583

Update supported_backends.md

d65dee2

Update supported_backends.md

fc7ab53

Update README.md

c0e6ef7

Update cli.md

cfa7ad3

rgreenberg1 requested review from markurtz and parfeniukink September 12, 2024 19:09

parfeniukink approved these changes Sep 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs Updates #61

Docs Updates #61

rgreenberg1 commented Sep 12, 2024

parfeniukink left a comment

parfeniukink Sep 13, 2024

Docs Updates #61

Are you sure you want to change the base?

Docs Updates #61

Conversation

rgreenberg1 commented Sep 12, 2024

parfeniukink left a comment

Choose a reason for hiding this comment

parfeniukink Sep 13, 2024

Choose a reason for hiding this comment