Open Source

TimeCapsuleLLM

A LLM trained only on data from certain time periods to reduce modern bias.

Source: GitHub Pricing: Open Source
💻 View Code

About This Project

A language model trained from scratch exclusively on data from certain places and time periods to reduce modern bias and emulate the voice, vocabulary, and worldview of the era. Focuses on curating historical data and building a custom tokenizer.

Tags

history LLM Machine Learning

Installation & Setup

1. Gather and Prepare Historical Texts (.txt files). 2. Build a Custom Tokenizer (run train_tokenizer.py). 3. Train Your Model (refer to nanoGPT).

Reviews & Ratings

Share your experience

User Reviews (0)

No reviews yet. Be the first to share your experience!