当前位置：首页 > backend >正文

ASR（语音识别）语音/字幕标注通过via（via_subtitle_annotator）

backend 2025/6/23 18:52:46

文章目录

1 VIA 官网资料
2 语音/字幕标注
3 键盘快捷键
- 常规
- 当一个时间片段被选中时
- 图像或视频帧中的空间区域

1 VIA 官网资料

VIA官网：https://www.robots.ox.ac.uk/~vgg/software/via/
VIA官网标注示例：https://www.robots.ox.ac.uk/~vgg/software/via/demo/via_subtitle_annotator.html

在这里插入图片描述

2 语音/字幕标注

在线标注网站：https://whiffe.github.io/VIA/via_subtitle_annotator.html

下面的操作，参考操作视频：https://www.bilibili.com/video/BV1CrTazFETv/

首先选择视频
在这里插入图片描述

然后按 A ，添加时间片段

空格键控制是否播放，通过听音频，来调整时间片段长度。

再通过点击下图的显示的：>，来填入字幕内容

多标注几个，如下：
在这里插入图片描述
然后点击保存，保存的json名字要和视频名一直（视频叫0001，那么json文件也要叫0001）

刷新网页，重新加载保存了的json文件

3 键盘快捷键

常规

在这里插入图片描述

键盘快捷键（Keyboard Shortcuts）
常规（General）

命令（Command）	快捷键（Shortcut）
播放/暂停媒体（Play/Pause Media）	空格（Space）
切换媒体静音（Toggle Media Mute）	m
提高/降低媒体播放速度（Increase / Decrease Media Playback Speed）	+ / -
将媒体时间向后移动1-9秒（按住Ctrl向前移动）[Move Media Time Backward by 1,…,9 sec. (Ctrl to move forward)]	Ctrl + 1、2……9
将媒体时间向前或向后移动1帧（按住Shift移动5帧）[Move Media Time Forward or Backward by 1 frame (Shift to move by 5 frames)]	Shift + n / p
在当前时间添加时间片段（Add Temporal Segment at Current Time）	a
将最后添加片段的边缘（左或右）更新到当前时间[Update the edge (left or right) of last added segment to current time]	Shift + a
选择上一个/下一个时间片段时间轴（Select Previous / Next Temporal Segment Timeline）	↑ / ↓
选择[上一个]下一个时间片段（如3秒到5秒）[Select [Previous] Next Temporal Segment (e.g. 3sec to 5sec)]	Shift + Tab
选择当前时间的时间片段（如有）[Select Temporal Segment at Current Time (if any)]	Enter
移动到上一个/下一个视频帧（Move to Previous / Next Video Frame）	l / r
跳转到视频开头/结尾（Jump to Start/End of Video）	Shift + s / e
将可见时间轴移动1秒（Shift Visible Timeline by 1 sec.）	← / →
将可见时间轴移动60秒（Shift Visible Timeline by 60 sec.）	Shift + ← / →
放大/缩小时间片段时间轴（Zoom In/Out the Temporal Segment Timeline）	鼠标滚轮（Mouse Wheel）
水平平移时间片段时间轴（Pan the Temporal Segment Timeline Horizontally）	Shift + 鼠标滚轮（Mouse Wheel）

当一个时间片段被选中时

在这里插入图片描述
当一个时间片段被选中时

命令（Command）	快捷键（Shortcut）
播放/暂停锁定到片段边界的视频（Play/Pause Video Locked to Segment Boundary）	Spc
删除选中的时间片段（Delete Selected Temporal Segment）	Backspace
选择[上一个]下一个时间片段（Select [Previous] Next Temporal Segment）	[Shift] + Tab
取消选择时间片段（Unselect Temporal Segment）	Esc
增加/减少左边缘范围（按住Ctrl以1秒为单位更新）[Increase/Decrease the Extent of Left Edge (Ctrl updates by 1 sec.)]	[Ctrl] + ↑ / ↓
增加/减少右边缘范围（按住Ctrl以1秒为单位更新）[Increase/Decrease the Extent of Right edge (Ctrl updates by 1 sec.)]	[Ctrl] + → / ←
跳转到时间片段的开头/结尾（Jump to Start/End of Temporal Segment）	s / e
移动选中的时间片段（按住Ctrl以1秒为单位更新）[Move Selected Temporal Segment (Ctrl updates by 1 sec.)]	[Ctrl] + ← / →
将选中的时间片段与左侧/右侧的片段合并（Merge Selected Temporal Segment with the Segment on Left/Right）	Shift + ← / →
在当前播放位置拆分时间片段（Split temporal segment at current playback position）	x