Authors
Adam Brzeski, Kamil Grinholc, Kamil Nowodworski, Adam Przybyłek
Publication date
2019
Conference
Computer Information Systems and Industrial Management: 18th International Conference, CISIM 2019, Belgrade, Serbia, September 19–21, 2019, Proceedings 18
Pages
3-11
Publisher
Springer International Publishing
Description
In this paper we evaluated a set of potential improvements to the successful Attention-OCR architecture, designed to predict multiline text from unconstrained scenes in real-world images. We investigated the impact of several optimizations on model’s accuracy, including employing dynamic RNNs (Recurrent Neural Networks), scheduled sampling, BiLSTM (Bidirectional Long Short-Term Memory) and a modified attention model. BiLSTM was found to slightly increase the accuracy, while dynamic RNNs and a simpler attention model provided a significant training time reduction with only a slight decline in accuracy.
Total citations
201920202021325
Scholar articles
A Brzeski, K Grinholc, K Nowodworski, A Przybyłek - … Information Systems and Industrial Management: 18th …, 2019