安装包图标

VScan

A visual perception layer for the blind
新版本 0.2.3
- There is now a standalone editor for entering the system prompt and user prompt. This editor has a large text field, which should make it easy to work with long and complex prompts.
- Various UI improvements and bug fixes.

This is a little project of mine aiming to research how vision LLMs could help out blind people on travel and in their every-day life by substituting eyesight for various visual tasks. VScan turns your smartphone's camera into a device for visual perception. You can define various optical cognitive functions, like looking for objects, signs, evaluating a scene or simply mediating visual impressions. You can afterwards use these functions on the camera view, just like a sighted person would use their eyes to achieve a specific goal in the physical world.


Each cognitive tool consists of two major parts:

  • The camera to be used - front / back, as well as camera parameters - resolution, flashlight etc.

  • The prompts used for LLM processing. LLM is the bridge between raw pixel data and your interpretation of it, and in the user/system prompt, you can specify what exactly are you interested in for the particular function and how should it be communicated, as well as the LLM model that should be used.


Camera input in combination with an LLM processing prompt forms a cognitive function, which can be used to serve various visual tasks.


VScan is open-source software. Visit the project's official repository to learn more about its background, motivation, specific usage examples and setup instructions.

版本

尽管下面提供了 APK 安装包的下载选项,但你应该注意,以这种方式安装将不会收到更新通知,这是一种不太安全的下载方式。 我们建议你安装使用 F-Droid 客户端。

下载 F-Droid
  • 版本 0.2.3 (23) 推荐 更新于 2025-10-18

    arm64-v8a armeabi-v7a x86 x86_64

    该版本需要 Android 7.0 及以上版本。

    此包由原始开发者构建并签名,并保证对应于此源代码 tarball

    权限
    • 拍摄照片和视频
      当你使用此应用时,它可以使用相机拍摄照片和录制视频。
    • 以高采样率访问传感器数据
      允许应用以高于 200 Hz 的频率对传感器数据进行采样
    • 拥有完全的网络访问权限
      允许此应用创建网络套接字和使用自定义网络协议。浏览器和其他应用提供了将数据发送到互联网的方法,因此不需要此权限将数据发送到互联网。
    • android.permission.READ_EXTERNAlSTORAGE
    • 录音
      当你使用此应用时,它可以使用麦克风录音。
    • android.permission.WRITE_EXTERNAlSTORAGE
    • com.rastislavkish.vscan.DYNAMIC_RECEIVER_NOT_EXPORTED_PERMISSION

    下载 APK 6.6 MiB PGP 签名 | 构建日志