SCUT-SCC

SCUT Similar Chinese Character (SCUT-SCC) Dataset

Introduction of SCUT-SCC

SCUT-SCC is an open similar Chinese character dataset, which contains 2-similar-character set, 5-similar-character set, 10-similar-character set and complete-set. The 2/5/10-similar-character set each has 3755 similar-character subsets, while the complete set consists of 3083 similar-character subsets. It is worth noting that SCUT-SCC dataset is built on SCUT-COUCH GB1 dataset (http://www.hcii-lab.net/data/scutcouch/EN/introduction.html).

The SCUT-SCC dataset is available for similar Chinese character research. You can download the dataset by clicking the links below. Additionally, please contact us (Email: lianwen.jin@gmail.com, shuye.cheung@gmail.com) to ask passwords for uncompressing the files since the files have been encrypted.

※ 2-similar-character-set（194M）

※ 5-similar-character-set（486M）

※ 10-similar-character-set（973M）

※ complete-set (98M)

Contact:

Email: lianwen.jin@gmail.com, shuye.cheung@gmail.com

HCII> SCUT-SCC

SCUT Similar Chinese Character (SCUT-SCC) Dataset

Introduction of SCUT-SCC

Contact: