Suffix Array Substring deduplication

#research #deduplication

Suffix Array deduplication

Progress

TABLE
	Identifier AS "Identifier",
	Language AS "Language",
	row["Physical Size"] AS "Physical Size",
	row["Total Text Size"] AS "Total Text Size (bytes)",
	row["Substring Length Threshold"] AS "Substring Length Threshold",
	row["Substring Duplicate Size"] AS "Substring Duplicate Size (bytes)"
FROM #deduplication AND #projectnotes 
SORT Identifier

^447e08

Goals

Goals
Links to this page