Research: bench of the llamacpp #10405

jumbo-q · 2024-11-19T12:50:36Z

Research Stage

Background Research (Let's try to avoid reinventing the wheel)
Hypothesis Formed (How do you think this will work and it's effect?)
Strategy / Implementation Forming
Analysis of results
Debrief / Documentation (So people in the future can learn from us)

Previous existing literature and research

theres two Questions:

Is the content of this batch input self-defined, similar to Some other Infer framework or is there a specific dataset for it? Or other operations?
The output time only provides the average and variance for each token. How is this time calculated? Is it the mean and variance over multiple runs? Also, what part of the execution is being timed? From which point to which point is the timing measured?

Hypothesis

No response

Implementation

No response

Analysis

No response

Relevant log output

No response

jumbo-q added the research 🔬 label Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research: bench of the llamacpp #10405

Research: bench of the llamacpp #10405

jumbo-q commented Nov 19, 2024 •

edited

Loading

Research: bench of the llamacpp #10405

Research: bench of the llamacpp #10405

Comments

jumbo-q commented Nov 19, 2024 • edited Loading

Research Stage

Previous existing literature and research

Hypothesis

Implementation

Analysis

Relevant log output

jumbo-q commented Nov 19, 2024 •

edited

Loading