Changelog
📣 Subsequent update logs will be moved to Release, and will no longer be updated here
🚩2023-10-08 v2.2.8 update
- Adapt the relevant parameters of
rapidocr_onnxruntime
, which can be passed in through the RapidVideOCR class, so as to more flexibly specify models of different languages.
♦ 2023-08-05 v2.2.4 update
-
Fix the index error in batch recognition mode.
-
Add a logging module to facilitate the use of the desktop version, quickly record problems, and facilitate feedback.
🛶2023-07-19 v2.2.3 update
- Added adaptation to VSF parameters. When in command line mode and during class initialization, you can specify the same-name parameters of the VSF command. For detailed usage, please refer to link
🤓2023-07-08 v2.2.2 update
- Fixed the problem that Chinese paths cannot be read during batch recognition
- Fixed the problem of skipping in SRT when missing axes. Currently, when an axis fails to be recognized, a position will be vacated for easy proofreading.
- Keep the intermediate results of VSF recognition
🐲2023-06-22 v2.2.0 update
- This version is compatible with
v2.1.x
, which means that the previous usage is still possible. - Integrate VSF’s CLI into the library, just specify the full path of
VideoSubFinderWXW.exe
. - Added batch recognition function, specify the video directory, and automatically extract all video subtitles in the directory
- For usage examples, see: demo.py
😀2023-05-12 v2.1.7 update
- Optimized code
- Added
save_name
parameter, which can flexibly specify the name of the savedsrt | txt
file, the default isresult
🐱2023-03-27 v2.1.6 update
- Fixed the timeline misalignment problem, see issue 23 for details
👽2023-03-23 v2.1.5 update
- Added control parameter
is_print_console
for printing to the screen - Adjust the
out_format
parameter position to the initialization class
😀2023-03-14 v2.1.3 update
- Fix the error when passing in the
TXTImages
directory and identifying duplicate characters
😜2023-03-12 v2.1.2 update
- Fix index error, #22
🎢2023-03-11 v2.1.1 update
-
Fix the difference between single image recognition and previous versions
-
The default recognition mode is changed to single image recognition. Whether to use duplicate image recognition is up to you
🥇2023-03-10 v2.1.0 update
- Added duplicate character recognition function, faster, and the default is duplicate character recognition function
🎈2023-03-02 v2.0.5~7 update
- Fix format errors in generated srt files, #19
🎫2023-02-17 v2.0.4 update
- Optimized passing in the
TXTImages
directory. Compared with passing inRGBImages
, it will be faster and more accurate. It is recommended to pass in theTXTImages
directory
💎2023-02-17 v2.0.2 update
- Fix the problem of missing spaces in peer subtitle recognition
🎈2023-01-29 v1.1.10 update
- Fix the error of empty index when converting frame index to timestamp
🧨2023-01-28 v1.1.9 update
- Fix the problem of timeline misalignment
👊 2023-01-15 v1.1.4 update
-
Add the option of outputting txt format. Currently, the v1.1.4 version outputs srt and txt formats by default
-
Add the option to adjust the box size of subtitle selection according to the screen size of the running program
🌈2023-01-10 v1.0.3 update
- Replace decord with OpenCV because decord has a memory leak when processing MP4. For details, see: #208
🎄2022-12-04 update
- Added interactive subtitle positioning function, which is enabled by default and is more user-friendly. For details, please refer to the GIF below. Thanks to @Johndirr for the suggestion.
- Optimize the code structure, put RapidOCR related models and configuration files in the
rapidocr
directory - The configuration files of
rapidvideocr
are also placed in the corresponding directory
🌼2022-05-08 update
- Add interactive determination of the threshold of the binary subtitle image, which only supports Windows and can be used by
is_select_threshold = True
- Optimize the code
🎉2022-05-03 update
- Add GPU support, for specific configuration tutorials, see: onnxruntime-gpu version inference configuration
- Added support for Japanese, which can support more languages. For details, see: Supported Language List
💡2022-05-01 update
-
Added speech module part is located in the branch
asr_module
-
Added speech recognition module. Since the decoding part of this module can only run on Linux and Mac, if you want to use this module, please use Linux and Mac.
-
Currently, the speech recognition code comes from the RapidASR/python part. The model comes from PaddleSpeech
-
After a simple test, the speech recognition module is not very accurate. -_-!
2022-03-09 update
- Added FAQ module to help everyone skip common small problems
2021-12-14 update
-
Background removal is not effective, resulting in the loss of some frames
-
Tried to use the image segmentation method. After testing, the inference speed under CPU is too slow, so it is temporarily abandoned
-
Currently using a fixed binary threshold
-
(2021-12-14) Improve the corresponding English document
-
(2021-12-14) Add running time benchmark
-
Add specific parameter description
-
Make a project logo
-
More tests
Last updated 21 May 2025, 19:10 -0600 .