VScan

A visual perception layer for the blind

新版本 0.2.3

- There is now a standalone editor for entering the system prompt and user prompt. This editor has a large text field, which should make it easy to work with long and complex prompts.
- Various UI improvements and bug fixes.

This is a little project of mine aiming to research how vision LLMs could help out blind people on travel and in their every-day life by substituting eyesight for various visual tasks. VScan turns your smartphone's camera into a device for visual perception. You can define various optical cognitive functions, like looking for objects, signs, evaluating a scene or simply mediating visual impressions. You can afterwards use these functions on the camera view, just like a sighted person would use their eyes to achieve a specific goal in the physical world.

Each cognitive tool consists of two major parts:

The camera to be used - front / back, as well as camera parameters - resolution, flashlight etc.

The prompts used for LLM processing. LLM is the bridge between raw pixel data and your interpretation of it, and in the user/system prompt, you can specify what exactly are you interested in for the particular function and how should it be communicated, as well as the LLM model that should be used.

Camera input in combination with an LLM processing prompt forms a cognitive function, which can be used to serve various visual tasks.

VScan is open-source software. Visit the project's official repository to learn more about its background, motivation, specific usage examples and setup instructions.

作者: Rastislav Kish
授權條款: GNU General Public License v3.0 only
問題追蹤系統
原始碼
構建中介資料
可重現構建狀態

版本

雖然在下方可選擇下載 APK 檔案，但要留意這樣的安裝方式將不會收到更新通知，是一種較不安全的下載方法。建議您先安裝 F-Droid 用戶端使用。

下載 F-Droid

版本 0.2.3 (23) 建議於 2025 年 10 月 18 日新增

arm64-v8a armeabi-v7a x86 x86_64

此版本需要 Android 7.0 或更高的版本。

此套件包由原開發者構建和簽署，並保證與此原始碼 Tarball 保持一致。
權限
- 拍攝相片和影片
  
  這個應用程式在使用期間可以使用相機拍照及錄影。
- 以高取樣率存取感應器資料
  
  允許應用程式以高於 200 Hz 的頻率對感應器資料進行取樣
- 擁有完整的網路存取權
  
  允許應用程式建立網路通訊端及使用自訂網路通訊協定。瀏覽器和其他應用程式會提供將資料傳輸到網際網路的方法，因此不需要這項權限也能將資料傳輸到網際網路。
- android.permission.READ_EXTERNAlSTORAGE
- 錄製音訊
  
  這個應用程式在使用期間可以使用麥克風錄音。
- android.permission.WRITE_EXTERNAlSTORAGE
- com.rastislavkish.vscan.DYNAMIC_RECEIVER_NOT_EXPORTED_PERMISSION
下載 APK 6.6 MiB PGP 簽章 | 構建日誌

版本

搜尋應用程式