Click a document above to read its text with the topic's key phrases highlighted.
Browse aspects on the left, expand to see topics within them, click a topic to see linked documents, and click a document to read its text with key phrases highlighted.
Click a document above to read its text with the topic's key phrases highlighted.
Pick two topics, then click a document to see both topics highlighted.
A new corpus comes on board, and the time to understand it is short. You cannot read it all, and skimming a handful of documents reveals little about the whole.
This platform reads everything and turns the corpus into a map you can browse, whether it holds three thousand documents or well beyond a hundred thousand. The map is a multi-aspect topic tree, built bottom-up from the corpus itself with no predefined taxonomy imposed on it.
A language model reads each document and extracts the phrases that carry its meaning. Those phrases are clustered by similarity and refined into clean topics, then organized bottom-up into groups and aspects. Because the structure grows from the evidence rather than a fixed schema, every node stays anchored to the text that produced it.
Topic names are generated summaries, so when one looks off, open it and read the phrases underneath. The top-level grouping is a useful first cut, and the "Other" category holds the topics too distinct to place or too small to stand alone.