乐呵呵、欢的博客

lehhair's Blog

Pot-App 文字识别插件 GPT-vison

2024-05-27

Pot-App 文字识别插件 GPT-vison

水个博客,插件在这里下载: 点击下载
通过调用GPT-vision模型对图片进行识别

使用说明,从release下载解压后需要重命名成[plugin].com.pot-app.ocrspace.potext.potext才能安装

你可以这样填写参数

API KEY: sk-xxxxx

API Endpoint: https://api.openai.com/v1/chat/completions

Model: gpt-4o

自用的Prompt: Analyze the image. OCR any text; skip if none. Use "---" to separate OCR text from analysis. Provide analysis in Chinese.

Prompt: Please analyze the provided image, recognize and extract all the text content, and describe the main content of the image. Your response will be used as input for the next AI assistant or translation assistant. Please return the result in Chinese (cn).

or

Prompt: Please analyze the provided image, perform OCR to extract all text content, and describe the main content of the image. Return the result in Chinese. The OCR content should be at the beginning, in its original form, followed by the explanation in Chinese.

or

Prompt: Analyze the image, perform OCR to extract all text in [brackets], translate the text in [brackets], and describe the main content in Chinese, separated by ---.(推荐)

这里输出语言可以自行选择,也可以保持原文,看你喜好

Stream: False (首先这里true没有应用场景,其次会报错,仅作预留)