azure-ai-vision-imageanalysis-py
Azure AI Vision Image Analysis SDK for Python wraps the Azure AI Vision 4.
Setup & Installation
What This Skill Does
Azure AI Vision Image Analysis SDK for Python wraps the Azure AI Vision 4.0 API. It extracts captions, tags, detected objects with bounding boxes, OCR text, people locations, and suggested crop regions from images submitted by URL or file upload.
Running multiple vision tasks (caption, OCR, object detection) in one API call avoids stitching together separate models or services.
When to use it
- Generating alt text for uploaded product images automatically
- Extracting printed text from scanned receipts or forms
- Detecting people in security camera frames for downstream processing
- Getting suggested crop coordinates for profile photos at specific aspect ratios
- Tagging wildlife photographs by detected objects and scene content
Similar Skills
mcp-builder
A development guide for building MCP (Model Context Protocol) servers that connect LLMs to external APIs and services.
skill-creator
A skill for building, testing, and refining other skills.
template
A starter scaffold for building new agent skills.
answers
Provides AI-generated answers grounded in live web search results through Brave's OpenAI-compatible chat completions endpoint.
