related documents Unsupervised Clustering with Smoothing for Detecting Paratext Boundaries in Scanned Documents Conference Proceeding