-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add the num_inference_steps to the AI API for text-to-image endpoint #80
Add the num_inference_steps to the AI API for text-to-image endpoint #80
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@mikezupper, thanks a lot for this pull request. I reviewed it, and it works perfectly 🚀. I did, however, add one commit to handle an edge case that could crash the pipeline. Additionally, I changed the default num_inference_steps
from 25
to 50
to match the default set by the diffusers library. Once this issue gets resolved, we can remove the default value from our codebase altogether.
2e89e06
to
0e97245
Compare
added the generated runner code final changes for num_inference_steps removed debug logging fixed spacing fixed spacing
Ensure the pipeline does not crash when the `num_inference_steps` argument is set to a value lower than 1. Update the default `num_inference_steps` to align with diffusion model defaults.
0e97245
to
8b38b4c
Compare
This commit updates the AI subnet OpenApi spec so that the reference documentation correctly represents the new `num_inference_steps` parameter introduced in livepeer/ai-worker#80.
This commit updates the AI subnet OpenApi spec so that the reference documentation correctly represents the new `num_inference_steps` parameter introduced in livepeer/ai-worker#80.
* docs(ai): add on-chain discovery documentation This commit adds documentation on how Orchestrators can broadcast their AI subnet service URIs to AI subnet Gateways, enabling their discovery on the subnet for job execution. * docs(ai): improve subsection titles and discovery text This commit improves the subsection titles and Discovery text. * docs(ai): improve introduction text * ci(ai): add AI pipelines document structure * docs(ai): add artibtrum explorer discovery steps This commit adds a guide on how AI nodes can advertise their AI URI using the Arbitrum explorer. * docs(ai): add AI API reference This commit adds an initial draft for the AI Subnet API reference. It is not yet linked to an upstream API endpoint so the playground is not yet working. * chore(ai): cleanup redundant OpenAPI entries * ci(ai): add AI docs codeowners * docs(ai): add pipelines documentation This commit adds the initial draft for the documentation that explains the pipelines available on the network. * docs(ai): fix broken images This commit ensures that images are working when deployed on the remote. * feat(ai): temporarily use external paths for pipeline images This commit replaces the relative image paths used in the pipelines documentation with external paths. We can change this back if we found a way to use relative paths inside mermaid graphs. * feat(ai): add contribution pages scaffold This commit adds a general scaffold for the contribution pages. It is a quick draft and should not yet be published to the upstream docs. * feat(ai): add contribution pages scaffold (#553) This commit adds a general scaffold for the contribution pages. It is a quick draft and should not yet be published to the upstream docs. * Livepeer.Cloud SPE Milestone #1 Deliverables: Add Gateway Docs * docs(ai): improve AI subnet introduction This commit improves the AI subnet introduction page. * fix(ai): fix kickstart collab card link This commit ensures that the collaboration card is linked correctly. * refactor(ai): improve pipeline mermaid diagrams This commit improves the styling used in the pipeline mermaid diagrams. * refactor(ai): improve pipeline documentation This commit updates the pipelines documentation to increase readability. * fix(ai): implement temporary solution for mermaid image issue This commit implements a temporary solution to address the issue of images not displaying correctly when embedded in a mermaid graph on the deployed site. Can be replaced with relative path if mintlify fixes the issue (see https://mintlify-users.slack.com/archives/C05D00NS55F/p1714485030542069). * feat(ai): add builders page This commit adds the builders page and a showcase to show dApps already build on the AI subnet. * refactor(ai): add cloud API disclaimer This commit adds a small disclaimer to the API documentation that states that the livepeer.cloud API is not production ready. * feat(ai): add orchestrator setup guide This commit adds the orchestrator setup guide. * refactor(ai): improve showcase descriptions This commit improves the descriptions of the projects in the showcase. * refactor(ai): highlight showcase keywords This commit highlights some keywords in the showcase descriptions. * feat(ai): add AI gateway docs This commit adds the documentation for setting up a AI gateway on the AI subnet. It also improves the project structure. * refactor(ai): adopt impersonal writing style (#556) This commit updates the text to reflect an impersonal writing style, consistent with the ecosystem's voice. It eliminates personal pronouns such as "our" and "we". * fix(ai): fix broken AI gateway links This commit fixes several broken internal links in the AI gateway docs. * refactor(ai): improve AI subnet introduction section This commit improves the readability of the AI subnet introduction section. * refactor(ai): hide contribution/SDK pages and cleanup pipelines This commit hides the contribution and SDK pages and cleansup the pipeline pages. * refactor(ai): improve text formatting This commit applies some last text formatting improvements. * addressed review comments from Rick * refactor(ai): add model parameter callout to API docs This commit adds a model parameter callout to the API docs to explain that not all parameters are used by all models. * chore(ai): fix small typo This commit fixes some small typos. * refactor(ai): fix spelling This commit fixes some small spelling errors. * made Prerequisites - more concise * chore(ai): add Analytics token This commit adds the Google Analytics GA4 token to the documentation config. * docs(gateway): apply prettier mdx formatting This commit applies the prettier formatting suggestions on the Gateway docs (see https://prettier.io/blog/2021/11/25/2.5.0.html). * refactor(ai): gateway former naming comment This commit makes it more clear that Gateways were formaly named Broadcasters. * refactor(ai): simplify binary install This commit simplifies the binary install of the AI subnet by giving users one simple download url they can use to downloade the latest `ai-video` branch binary. This was made possible in livepeer/go-livepeer@fb9764b. * docs(ai): apply community feedback This commit applies the feedback I received from the community about things that could be improved. Co-authored-by: Authority_Null <[email protected]> Co-authored-by: Chuck Chain <[email protected]> * docs(ai): add RealViz warm comment This commit adds a comment letting users know that several orchestrators have the https://huggingface.co/SG161222/RealVisXL_V4.0 model warm on their GPU. * docs(ai): fix incorrect RealViLightning model link This commit ensures that the RealVizLightning model link is pointing to the right url. * chore(ai): update AI subnet OpenAPI spec This commit updates the AI subnet OpenApi spec so that the reference documentation correctly represents the new `num_inference_steps` parameter introduced in livepeer/ai-worker#80. * docs: replace -broadcaster flag with -gateway in AI subnet documentation (#564) This commit replaces the `broadcaster` flag with the new `gateway` flag. See livepeer/go-livepeer#3048 for more information. * docs(ai): remove pricePerUnit requirement for AI docs (#565) This commit removes the dependency on the `pricePerUnit` flag in the AI orchestrator documentation now that livepeer/go-livepeer#3047 has been merged. * docs(ai): add RealVisXL to I2I pipeline This commit adds the [SG161222/RealVisXL_V4.0_Lightning](https://huggingface.co/SG161222/RealVisXL_V4.0_Lightning) model to the supported models on the I2I pipeline documentation page. * docs: add Gateway network page This commit adds a Gateway page under the core-concepts category, explaining the role and functions of a Gateway. * docs: improve Gateway configuration wording * chore: apply prettier to Gateway node page * Update gateways.mdx Clarified gateways as the type of node, and talked about hosted gateways as something that individual entities may run. * docs: fix Gateway node page links This commit fixes some broken links in the Gateway node page. * docs(ai): remove double subnet requirements text This commit removes duplicate text in the AI subnet requirements section. * docs(ai): address community feedback This commit clarifies points that the community identified as unclear. * docs(ai): enhance introduction This commit strengthens the AI Subnet introduction, providing clearer and more detailed information about its capabilities and benefits. * docs(ai): add DeepCache optimization method This commit adds the DeepCache optimization method to the AI models configuration page. This method can be used to speedup the pipelines to up to 50%. * docs(ai): replace dApp with application This commit replaces the parts that mention dApps with applications since the AI subnet can serve both web2 and web3 applications. * docs(ai): ensure GPU binaries are listed This commit ensures the GPU binaries are listed in the Orchestrator binary setup guide. * docs(ai): fix incorrect binary format This commit fixes a small syntax error in the AI subnet GPU binary download url. * docs(ai): add HuggingFace token read permission comment This commit ensures that people are aware that they only need to create a HuggingFace token with read permissions. * docs(ai): improve introduction This commit improves the introduction to make clear that orchestrators can have multiple workers. It also improves the diagram. * docs(ai): change dApp to app This commit ensures that all references to dApp are replaced by app or application. * docs(ai): apply some small text improvements This commit applies some small text improvements to the AI documentation. * docs(ai): update AI API reference This commit updates the AI API reference to include the new `nsfw` feature (see livepeer/ai-worker@272ac74). * refactor(ai) improve introduction wording (#570) * Update verbiage in introduction * docs(ai): mention AI Video subnet --------- Co-authored-by: Elite Encoder <[email protected]> --------- Co-authored-by: Mike Zupper <[email protected]> Co-authored-by: Authority_Null <[email protected]> Co-authored-by: Chuck Chain <[email protected]> Co-authored-by: Elite Encoder <[email protected]> Co-authored-by: Mike Zupper <[email protected]> Co-authored-by: Doug Petkanics <[email protected]>
* add num_inference_steps to text-to-image endpoints added the generated runner code final changes for num_inference_steps removed debug logging fixed spacing fixed spacing * refactor(runner): handle 'num_inference_steps' edge cases Ensure the pipeline does not crash when the `num_inference_steps` argument is set to a value lower than 1. Update the default `num_inference_steps` to align with diffusion model defaults. --------- Co-authored-by: Rick Staa <[email protected]>
* add num_inference_steps to text-to-image endpoints added the generated runner code final changes for num_inference_steps removed debug logging fixed spacing fixed spacing * refactor(runner): handle 'num_inference_steps' edge cases Ensure the pipeline does not crash when the `num_inference_steps` argument is set to a value lower than 1. Update the default `num_inference_steps` to align with diffusion model defaults. --------- Co-authored-by: Rick Staa <[email protected]>
This change allows higher quality image generation to the AI text-to-image endpoint. The the num_inference_steps parameter will be set at a sensible default of 25 (if no value is provided).
This change will require the Gateway and Orchestrator nodes to be updated.