Chinese character description languages

In order for a component to fit into the allotted portion of the whole character's square, A set of fewer than 50 strokes allow one to construct approximately 1,000 components, which may in turn describe tens of thousands of characters.

[1] Chapter 18 of The Unicode Standard (version 15.0) defines the "Ideographic Description Sequences" (IDS) syntax used to describe characters in featural terms, by arrangements of components with code points.

For example, the sawndip character encoded in CJK Unified Ideographs Extension F as U+2DA21 𭨡 can be described as ⿰書史.

Unicode's specification for these sequences is based on the characters and syntax of the earlier GBK encoding.

The IDSgrep free software package by Matthew Skala[2][3] extends Unicode's IDS syntax to include additional features for dictionary lookup; it is capable of converting KanjiVG's database to its own extended IDS format, or of searching EIDS files generated by the related Tsukurimashou font family.