Ui.Vision

Task and UI test automation with Computer Vision/OCR. Ui.Vision combines browser automation and desktop automation.

Ui.Vision: AI-Enhanced Automation for Browsers & Desktop

"Ui.Vision" is an open-source automation tool that integrates browser and desktop automation with advanced computer vision and OCR. With the new Anthropic Claude integration, it simplifies complex tasks into single-line commands, enhancing usability. Featuring visual UI testing, desktop automation compatibility, and a command line API for integration, it stands out as a robust and fully local solution for automation, free of subscriptions and cloud dependencies. Happy automating!

Add-on stats

Rating: 4.20
(13)
Version: 9.3.8 (Last updated: 2024-12-04)
Creation date: 2022-01-17
Risk impact: Very high risk impact
Risk likelihood:
Manifest version: 3
Permissions:
  • bookmarks
  • clipboardRead
  • clipboardWrite
  • cookies
  • debugger
  • downloads
  • downloads.ui
  • notifications
  • storage
  • tabs
  • See more
Host permissions:
  • <all_urls>
Size: 5.59M

Other platforms

Ui.Vision (v9.3.8)
3.92 (217) 100,000
Not available on Android
Ui.Vision (v9.3.8)
4.30 (46) 4,773
Want to check extension ranking and stats more quickly for other Edge add-ons? Install Chrome-Stats extension to view Edge-Stats data as you browse the Edge Add-on Store.

Add-on summary

Analyze keywords

New Dec 5, 2024 Update: Anthropic Claude Computer Use Integration

Open-Source Ui.Vision has consistently been at the forefront of visual web automation. With Claude’s integration, we’re taking the next step forward. The aiComputerUse command allows you to automate complex tasks with a single line of code that would traditionally require hundreds of lines of classic Ui.Vision commands (such as XClick, OCRExtractScreenshot, If/then statements, and more). For example, you can teach Ui.Vision to play TicTacToe with just one short "Play this game..." prompt.

-- Ui.Vision is an open-source automation RPA software that combines classic browser automation with modern computer vision and OCR:

(1) Visual Browser Automation

Ui.Vision's visual UI testing commands assist web designers and developers in checking and ensuring the accuracy of website layouts and canvas elements. It can identify and read images and text within canvas elements, images, and videos.

(2) Visual Desktop Automation for Windows, Mac, and Linux

Beyond web browser automation, Ui.Vision uses image and text recognition (OCR) to automate browser extensions and desktop environments as well. It can interpret images and text on the desktop, executing actions like clicking, moving, dragging and dropping the mouse, and simulating keyboard inputs.

See more

User reviews

These summaries are automatically generated weekly using AI based on recent user reviews. Edge Add-on Store does not verify user reviews, so some user reviews may be inaccurate, spammy, or outdated.
Pros
  • 可以全自动化处理各种操作,非常万能
  • 对简单网页处理很好用
  • 功能丰富,支持变量存储和调用
  • 对批量化操作非常有帮助
Cons
  • 对于复杂网站和场景的自动化能力不足
  • 录制功能不支持复杂元素,如弹窗按钮
  • 没有提供定时运行脚本的示例
  • 免费版的限制,如每天100次在线OCR的限制
Most mentioned
  • 操作自动化
  • 功能多样,但有一定学习曲线
  • 不足之处包括复杂场景支持差和限制性免费功能
  • 界面语言切换(中文支持)的需求
User reviews
ミライシードなどを全自動化したりと万能すぎです。

简单网页处理很好用,客户端或复杂场景可以试一试 小瓶RPA,支持挂机定时任务
by github, 2023-04-21

in short, i really love this addon until now, its not opening when i click on icon in edge browser. i disabled and reenabled this addon even tho its not opening.
by Jayaprakash, 2022-07-12
View all user reviews

Add-on safety

Risk impact

Ui.Vision requires a lot of sensitive permissions. Exercise caution before installing.

Risk likelihood

Ui.Vision is probably trust-worthy. Prefer other publishers if available. Exercise caution when installing this add-on.

Upgrade to see risk analysis details

Promo images

Ui.Vision marquee promo image
Marquee promo image

Similar add-ons

Here are some Edge add-ons that are similar to Ui.Vision: