Using BLSTM for interpretation of 2-D languages
Case of handwritten mathematical expressions
Pages 135 à 157
Citer cet article
- ZHANG, Ting,
- MOUCHÈRE, Harold
- et VIARD-GAUDIN, Christian,
- Zhang, Ting.,
- et al.
- Zhang, T.,
- Mouchère, H.
- et Viard-Gaudin, C.
(a) ’2 + 2’ written with four strokes; (b) the symbol relation tree of ’2 + 2’; (c) the SLG of ’2 + 2’. The four strokes are indicated as s1, s2, s3, s4 in writing order. (ver.) and (hor.) are added to differentiate the vertical and the horizontal strokes for ’+’. ’R’ is for left-right relationship
(a) The time path (red) in SLG; (b) the time path; (c) the built SLG of ’2 + 2’, added edges are depicted as bold
(a) Peo written with four strokes; (b) the SRT of Peo; (c) r2h written with three strokes; (d) the SRT of r2h, the red edge cannot be generated by the time sequence of strokes
The illustration of on-paper points (blue) and in-air points (red) in time path, a1 + a2 written with 6 strokes
Local CTC forward-backward algorithm. Black circles represent labels and white circles represent blanks. Arrows signify allowed transitions. Forward variables are updated in the direction of the arrows, and backward variables are updated in the reverse direction
Illustration for the decision of the label of stroke. For being more readable, all the strokes are given the correct label except stroke 6
The illustration of (a) θi, φi and (b) ψi used in feature description. The points related to feature computation at pi are depicted in red
The symbol level evaluation results on CROHME 2014 test set, including the experiment results in this work and CROHME 2014 participant results (Top 4 by recall of Segments)
The expression level evaluation results on CROHME 2014 test set, including the experiment results in this work and CROHME 2014 participant results (Top 4)
(a) a ≥ b written with four strokes; (b) the built SLG of a ≥ b according to the recognition result, all labels are correct
(a) written with six strokes; (b) the ground-truth SLG; (c) the 44 rebuilt SLG according to the recognition result. Three edge errors occurred: the Right relation between stroke 2 and 4 was missed because there is no edge from stroke 2 to 4 in the time path; the edge from stroke 4 to 3 was missed for the same reason; the edge from stroke 2 to 3 was wrongly recognized and it should be labeled as NoRelation