XiangBai 发表于 2017-2-11 14:15:24

[开源代码与数据集]场景文字检测与识别(from McLab)

本帖最后由 XiangBai 于 2018-11-10 23:32 编辑



端到端场景文本识别
[*]M. Liao, B. Shi, X. Bai, X. Wang, W. Liu. TextBoxes: A fast text detector with a single deep neural network. In: Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17), San Francisco, California, 2017. (Oral Presentation) [code]
[*]M. Liao, B. Shi, X. Bai. TextBoxes++: A single-shot oriented scene text detector. IEEE Transactions on Image Processing (TIP), 27(8): 3676-3690, 2018 . [code]
[*]P. Lyu, M. Liao, C. Yao, W. Wu, X. Bai. Mask textspotter: An end-to-end trainable neural network for spotting text with arbitrary shapes. In: Proceedings of European Conference on Computer Vision (ECCV'18), Munich, Germany, 2018.


场景文本检测

[*]M. Liao, Z. Zhu, B. Shi, G. Xia, X. Bai. Rotation-sensitive regression for oriented scene text detection. In: Proceedings of the 31th IEEE Conference on Computer Vision and Pattern Recognition (CVPR'18), Salt Lake, Utah, 2018. [code]
[*]B. Shi, X. Bai, S. Belongie. Detecting oriented text in natural images by linking segments. In: Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR'17), Honolulu, Hawaii, 2017. (Spotlight) [ppt] [code]
[*]Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, X. Bai. Multi-oriented text detection with fully convolutional networks. In: Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR'16), Las Vegas, 2016. [code]
[*]Z. Zhang, W. Shen, C. Yao, X. Bai. Symmetry-based text line detection in natural scenes. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR'15), Boston, MA, June 2015.[code]


场景文本识别

[*]B. Shi, M. Yang, X. Wang, P. Lyu, C. Yao, X. Bai. ASTER: An attentional scene text recognizer with flexible rectification. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), to appear. [code]
[*]B. Shi, X. Bai, C. Yao. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), accepted. [Music Score Recognition Datasets] [code]
[*]B. Shi, X. Wang, P. Lv, C. Yao, X. Bai. Robust scene text recognition with automatic rectification. In: Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR'16), Las Vegas, 2016. (代码整理中)
[*]X. Bai, C. Yao, W. Liu. Strokelets: A learned multi-scale mid-level representation for scene text recognition. IEEE Transactions on Image Prococessing (TIP), 25(6): 2789-2802, 2016. [code][readme]


中文场景文本检测与识别数据集Dataset: (http://mclab.eic.hust.edu.cn/icdar2017chinese/)
Competition Report: ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17). B Shi, C Yao, M Liao, M Yang, P Xu, L Cui, S Belongie, S Lu, X Bai (arXiv preprint arXiv:1708.09585)

场景语种识别数据集

[*]B. Shi, X. Bai, C. Yao. Script identification in the wild via discriminative convolutional neural network. Pattern Recognition (PR), 52: 448-458, 2016. [project page & datasets]


多方向文本检测数据集(MSRA-TD 500)

[*]C. Yao, X. Bai, W. Liu, Y. Ma, Z. Tu. Detecting texts of arbitrary orientations in natural images. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12), Providence, Rhode Island, 2012. (MSRA-TD 500 Dataset)



多方向文本识别数据集(HUST-TR 400)

[*]C. Yao, X. Bai, W. Liu. A unified framework for multi-oriented text detection and recognition. IEEE Transactions on Image Processing (TIP), 23(11): 4737-4749, 2014.




后续会继续更新








页: [1]
查看完整版本: [开源代码与数据集]场景文字检测与识别(from McLab)