Open Source

TimeCapsuleLLM

A LLM trained only on data from certain time periods to reduce modern bias.

Source: GitHub Pricing: Open Source

💻 View Code

Project snapshot

TimeCapsuleLLM

Tech stack

Python

About This Project

A language model trained from scratch exclusively on data from certain places and time periods to reduce modern bias and emulate the voice, vocabulary, and worldview of the era. Focuses on curating historical data and building a custom tokenizer.

Installation & Setup

1. Gather and Prepare Historical Texts (.txt files). 2. Build a Custom Tokenizer (run train_tokenizer.py). 3. Train Your Model (refer to nanoGPT).

Reviews & Ratings

Share your experience

User Reviews (0)

No reviews yet. Be the first to share your experience!

TimeCapsuleLLM

About This Project

Tags

Installation & Setup

Reviews & Ratings

Share your experience

User Reviews (0)

Share TimeCapsuleLLM