Autogen for image analysis with Claude3.5 API and MultimodalConversableAgent #4606

yananliusdu · 2024-12-08T10:08:25Z

yananliusdu
Dec 8, 2024

Is that possible in autogen to use MultimodalConversableAgent and Claude3.5 API for image analysis? I tried following two methods that all failed:
1: use such prompts message= f"""What's on the image? <img 1.PNG>. """ same with using the openai api. this did not work
2: according to the examples in https://docs.anthropic.com/en/docs/build-with-claude/vision#example-multiple-images, encode image into base64, which means convert image into str as an input for the api. However, it reported error 'TypeError: can only concatenate str (not "list") to str'

rysweet · 2024-12-09T18:37:54Z

rysweet
Dec 9, 2024
Collaborator

Hi @yananliusdu - unfortunately without a more complete stack trace we can't really see which call is throwing the TypeError. Please post the whole stack trace and we might be able to help.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Autogen for image analysis with Claude3.5 API and MultimodalConversableAgent #4606

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Autogen for image analysis with Claude3.5 API and MultimodalConversableAgent #4606

Uh oh!

yananliusdu Dec 8, 2024

Replies: 1 comment

Uh oh!

rysweet Dec 9, 2024 Collaborator

yananliusdu
Dec 8, 2024

rysweet
Dec 9, 2024
Collaborator