TrackGPT: Track What You Need in Videos via Text Prompts

Inspired by Track-Anything, TrackGPT allows users to detect and track objects in videos using text prompts. It is developed upon GroundingDINO, DetGPT, Segment Anything and XMem. By leveraging the capabilities of DetGPT, TrackGPT is able to interpret user instructions in natural language to segment objects of interest in video frames. Users input a text instruction, and TrackGPT intelligently finds and tracks the specified object throughout the video.

News

[2023-05-15] We made TrackGPT public!

Example:

Text prompt: What did Biden do to protect his health?
Text prompt: I want to track elon.
Text prompt: Help me focus on the man playing basketball.

Setup

In order to execute the code, it is required to have a minimum of either one 40GB GPU or two 32GB GPUs.

This section is to be done.

License

This repository is released under BSD 3-Clause License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
GroundingDINO		GroundingDINO
configs		configs
dataset		dataset
detgpt.egg-info		detgpt.egg-info
detgpt		detgpt
docs		docs
examples		examples
output_models		output_models
prompts		prompts
tools		tools
tracker		tracker
LICENSE.md		LICENSE.md
README.md		README.md
demo.ipynb		demo.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrackGPT: Track What You Need in Videos via Text Prompts

News

Example:

Setup

License

About

Releases

Packages

Contributors 2

Languages

License

eshoyuan/TrackGPT

Folders and files

Latest commit

History

Repository files navigation

TrackGPT: Track What You Need in Videos via Text Prompts

News

Example:

Setup

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages