Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ags web surfer #4340

Merged
merged 8 commits into from
Nov 27, 2024
Merged

Ags web surfer #4340

merged 8 commits into from
Nov 27, 2024

Conversation

husseinmozannar
Copy link
Contributor

@husseinmozannar husseinmozannar commented Nov 25, 2024

Why are these changes needed?

Adding WebSurfer into AutoGen Studio.

  • enable adding WebSurfer agent to any team
  • enable viewing websurfer text messages in the ags ui
  • enable viewing websurfer multimodal messages in ui.

Websurfer agent can be added using the AGS team spec as follows:

{
  "version": "1.0.0",
  "component_type": "team",
  "name": "websurfer_team",
  "participants": [
    {
      "component_type": "agent",
      "name": "websurfer_agent",
      "agent_type": "MultimodalWebSurfer",
      "model_client": {
        "component_type": "model",
        "model": "gpt-4o-2024-08-06",
        "model_type": "OpenAIChatCompletionClient"
      },
      "headless": true,
      "logs_dir": "logs",
      "downloads_folder": "logs",
      "to_save_screenshots": false,
      "use_ocr": false,
      "animate_actions": false
    }
  ],
  "team_type": "RoundRobinGroupChat",
  "termination_condition": {
    "component_type": "termination",
    "termination_type": "MaxMessageTermination",
    "max_messages": 3
  }
}

Related issue number

Closes #4290

@husseinmozannar
Copy link
Contributor Author

Some current issues:

  • width of websurfer messages because of page metadata breaks the view, it becomes full width and hides the agent graph viz

Not related to this, but things to fix later:

  • multi-line input
  • sessions should be on the LHS side bar, there should be a top toggle for the conditions, to make it look more in lign with current chat UIs out there.
  • definition of model client is kinda not great, you want to define it somewhere separately or have a default so that you easily add it to the agent.

@victordibia
Copy link
Collaborator

victordibia commented Nov 27, 2024

Looking good.

  • Fixed width issue .. also enabled text truncation and ability to view images fullscreen.

Other issues

Overall, we should define some protocol for showing tracking files generated by agents. Currently we stuff the b64 string of an image in the message itself. It would be useufl to have a protocol eg., for storing files to some location and passing in the url and the ui can view those files (ideally in a user owned directory mounted by the web api) (cc @ekzhu ).

@victordibia victordibia marked this pull request as ready for review November 27, 2024 05:22
@husseinmozannar husseinmozannar merged commit 531ec38 into main Nov 27, 2024
43 checks passed
@husseinmozannar husseinmozannar deleted the ags_web_surfer branch November 27, 2024 05:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Enable v1 support for WebSurfer Agent in AGS
2 participants