🗜️ Echo-DSRN-114M Semantic Compressor Demo (CPU)

This demo highlights the O(1) memory footprint and CPU execution of the Echo-DSRN architecture.

⚠️ Note for Public Space: To prevent OOM crashes on the shared CPU tier, text inputs and document uploads are strictly limited to 50,000 characters per request.

What can you do with this .npy file?

  • Semantic Clustering: Load multiple state vectors in Python (np.load()) and cluster them (e.g., K-Means) to group similar documents.
  • RAG Pre-filtering: Use the vectors to perform Cosine Similarity searches across multiple documents.
  • Cross-Attention Memory: Treat this vector as a compressed "Gist" and feed it into a larger model (e.g., 7B) via cross-attention.
  • Style Mimicry: Train a small linear classifier on top of these vectors to detect the author's writing style or sentiment.