HumanBodyPose

Examples

Download the demo resource

wget http://download.openmmlab.com/agentlego/human.jpg

Use the tool directly (without agent)

from agentlego.apis import load_tool

# load tool
tool = load_tool('HumanBodyPose', device='cuda')

# apply tool
image = tool('human.jpg')
print(image)

With Lagent

from lagent import ReAct, GPTAPI, ActionExecutor
from agentlego.apis import load_tool

# load tools and build agent
# please set `OPENAI_API_KEY` in your environment variable.
tool = load_tool('HumanBodyPose', device='cuda').to_lagent()
agent = ReAct(GPTAPI(temperature=0.), action_executor=ActionExecutor([tool]))

# agent running with the tool.
img_path = 'human.jpg'
ret = agent.chat(f'Extract pose of the human in the image {img_path}')
for step in ret.inner_steps[1:]:
    print('------')
    print(step['content'])

Set up

Before using the tool, please confirm you have installed the related dependencies by the below commands.

pip install -U openmim
pip install git+https://github.com/jin-s13/xtcocoapi
mim install -U mmpose

Reference

This tool uses a RTM Pose model in default settings. See the following paper for details.

@misc{jiang2023rtmpose,
      title={RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose},
      author={Tao Jiang and Peng Lu and Li Zhang and Ningsheng Ma and Rui Han and Chengqi Lyu and Yining Li and Kai Chen},
      year={2023},
      eprint={2303.07399},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

PoseToImage

Examples

Download the demo resource

wget http://download.openmmlab.com/agentlego/pose_demo.jpg

Use the tool directly (without agent)

from agentlego.apis import load_tool
from PIL import Image

# load tool
tool = load_tool('PoseToImage', device='cuda')

# apply tool
image = tool('pose_demo.jpg', 'A pretty dancing girl.')
print(image)

With Lagent

from lagent import ReAct, GPTAPI, ActionExecutor
from agentlego.apis import load_tool

# load tools and build agent
# please set `OPENAI_API_KEY` in your environment variable.
tool = load_tool('PoseToImage', device='cuda').to_lagent()
agent = ReAct(GPTAPI(temperature=0.), action_executor=ActionExecutor([tool]))

# agent running with the tool.
img_path = 'pose_demo.jpg'
ret = agent.chat(f'According to the pose image `{img_path}`, draw a pretty dancing girl.')
for step in ret.inner_steps[1:]:
    print('------')
    print(step['content'])

Set up

Before using the tool, please confirm you have installed the related dependencies by the below commands.

pip install -U diffusers

Reference

This tool uses a Control Net model in default settings. See the following paper for details.

@misc{zhang2023adding,
      title={Adding Conditional Control to Text-to-Image Diffusion Models},
      author={Lvmin Zhang and Maneesh Agrawala},
      year={2023},
      eprint={2302.05543},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

HumanFaceLandmark

Examples

Download the demo resource

wget http://download.openmmlab.com/agentlego/face.png

Use the tool directly (without agent)

from agentlego.apis import load_tool
from PIL import Image

# load tool
tool = load_tool('HumanFaceLandmark', device='cuda')

# apply tool
face_landmark = tool('face.png')
print(face_landmark)

With Lagent

from lagent import ReAct, GPTAPI, ActionExecutor
from agentlego.apis import load_tool

# load tools and build agent
# please set `OPENAI_API_KEY` in your environment variable.
tool = load_tool('HumanFaceLandmark', device='cuda').to_lagent()
agent = ReAct(GPTAPI(temperature=0.), action_executor=ActionExecutor([tool]))

# agent running with the tool.
img_path = 'face.png'
ret = agent.chat(f'Draw the face landmark of the human in the image `{img_path}`')
for step in ret.inner_steps[1:]:
    print('------')
    print(step['content'])

Set up

Before using the tool, please confirm you have installed the related dependencies by the below commands.

pip install -U openmim
pip install git+https://github.com/jin-s13/xtcocoapi
mim install -U mmpose

Reference

This tool uses a RTM Pose model in default settings. See the following paper for details.

@misc{jiang2023rtmpose,
      title={RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose},
      author={Tao Jiang and Peng Lu and Li Zhang and Ningsheng Ma and Rui Han and Chengqi Lyu and Yining Li and Kai Chen},
      year={2023},
      eprint={2303.07399},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

HumanBodyPose

Examples

Set up

Reference

PoseToImage

Examples

Set up

Reference

HumanFaceLandmark

Examples

Set up

Reference

Files

README.md

Latest commit

History

README.md

File metadata and controls

HumanBodyPose

Examples

Set up

Reference

PoseToImage

Examples

Set up

Reference

HumanFaceLandmark

Examples

Set up

Reference