Draft:Talkie (language model)

Talkie
Talkie
Original authors	Nick Levine, David Duvenaud, Alec Radford
Release	April 2026
Available in	English
License	Apache license
Website	https://talkie-lm.com/

Review waiting, please be patient.

This may take 3 months or more, since drafts are reviewed in no specific order. There are 4,918 pending submissions waiting for review.

If the submission is accepted, then this page will be moved into the article space.
If the submission is declined, then the reason will be posted here.
In the meantime, you can continue to improve this submission by editing normally.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Reviewer tools

Instructions · What links here · Talkie (language model) (talk: + · bio) · (log) · Copyvios report · reFill · Citation Bot · (Search: Google, Wikipedia) · Submitted 24 days ago by Tarkowski (talk: D · +) · Last edited 30 hours ago by Tarkowski

Submission declined on 20 May 2026 by Commandant Quacks-a-lot (talk).

This draft's references do not show that the subject meets Wikipedia's criteria for inclusion. The draft requires multiple published secondary sources that:

provide significant coverage: discuss the subject in detail, not just brief mentions or routine announcements;
are reliable: from reputable outlets with editorial oversight;
are independent: not connected to the subject, such as interviews, press releases, the subject's own website, or sponsored content.

Please add references that meet all three of these criteria. If none exist, the subject is not yet suitable for Wikipedia.

If you would like to continue working on the submission, click on the "Edit" tab at the top of the window.
If you have not resolved the issues listed above, your draft will be declined again and potentially deleted.
If you need extra help, please ask us a question at the AfC Help Desk or get live help from experienced editors.
Please do not remove reviewer comments or this notice until the submission is accepted.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Declined by Commandant Quacks-a-lot 38 days ago. Last edited by Tarkowski 30 hours ago. Reviewer: Inform author.

This draft has been resubmitted and is currently awaiting re-review.

Comment: Looks interesting, but you need more independent sources. Commandant Quacks-a-lot (talk) 01:56, 20 May 2026 (UTC)

Talkie is a small language model developed by Nick Levine, David Duvenaud, and Alec Radford. It was announced in April 2026 and described by the developers as a vintage language model. Talkie is trained solely on pre-1931 texts that are in the Public Domain,^[1] in order to reduce legal issues and liability with releasing model data.^[2] The model family consists of a 13 billion parameter model called talkie-1930-13b-base and a post-trained checkpoint designed to power a chat interface, called talkie-1930-13b-it.^[3]

Development

The initial idea was to build a model trained on historical data that could be used to explore whether models can forecast future events: what is predictable, and how far out events can be predicted.^[2] The model was also developed in order to study cultural change, and model self-conception.^[4]^[3] Another goal expressed by authors is testing whether language models can arrive at inventions or scientific discoveries. The authors cite a thought experiment proposed by Demis Hassabis, who asked whether a model trained on data up to 1911 could independently discover Albert Einstein's General Relativity theory.^[5]

The term vintage language model is attributed to Owain Evans and describes language models trained only on historical text. The purpose of such models is to simulate language use from the past, and to study behavior of models not contaminated by contemporary content. Other vintage models include Ranke 4B, Mr Chatterbox or Machina Mirabilis. Talkie is also inspired by Calcifer Computing’s work on Temporal Language Models, able to represent temporal trends in language.^[6]

The model was trained on 260 billion tokens of pre-1931 English text from sources like the Institutional Data Initiative, Common Pile or the Internet Archive. The 31 December 1930 cutoff is based on copyright term rules in the United States, where works published between 1923-1977 are protected for 95 years. The data includes books, newspapers, periodicals, scientific journals, patents, and case law. The developers experimented with various optical character recognition (OCR) methods and developed a dedicated vintage OCR system. The compute needed to train the model was provided by Anthropic.^[7]

One of the main challenges with building vintage model is contamination of the model by anachronistic data from beyond the cutoff data. This is typically due to incorrect metadata, or editing notes added to the text. Because of this, talkie is for example aware of Franklin Delano Roosevelt and Adolf Hitler.^[2]

A dedicated post-training pipeline was developed in order to fine tune a chatbot based on the base model. For this purpose, only historical structured texts, such as etiquette manuals, letter-writing manuals, cookbooks, dictionaries, and encyclopedias, were used, to avoid contamination of the model. Nevertheless, due to the fact that the model was also fine-tuned through synthetic chats with a Claude Opus model, some anachronisms were introduced.^[1]

Reception

Multiple outlets used the available demo of the Talkie chatbot to test the model's worldview. For example, Decrypt's Jose Antonio Lanz reported on Talkie's analysis of Adolf Hitler.^[7] Both the Decoder and The Register noted that the model performs worse than modern counterparts on standard benchmarks like HumanEval. The limitations of the model have been acknowledged by the research team.

Novelist Robin Sloan praised the Talkie project as "a triumph", distinguishing it from earlier vintage language model experiments. He engaged with the Hassabis thought experiment and proposed an alternative, simpler benchmark. Instead of General Relativity, he considered whether the model could arrive at Claude Shannon's insight about mapping electric circuits to Boolean logic. Sloan tested this with Talkie, which denied any such correspondence exists.^[8]

Simon Willison describes the base talkie model as a "vegan" model – one trained entirely on licensed or out-of-copyright data.^[9] The chat model does not qualify, because of the above mentioned contamination during fine tuning, when proprietary synthetic data generated by Claude Opus was introduced.

References

^ ^a ^b "Introducing talkie: a 13B vintage language model from 1930". talkie-lm.com. Retrieved 2026-05-19.
^ ^a ^b ^c Roose, Kevin; Newton, Casey; Jones, Whitney; Cohn, Rachel; Pavic, Vjeran; Ramirez, Daniel; Powell, Dan; Lozano, Marion; Niemisto, Rowan (2026-05-01). "OpenAI's Big Reset + A.I. in the Doctor's Office + Talkie, a pre-1930s LLM". The New York Times. ISSN 0362-4331. Retrieved 2026-05-30.
^ ^a ^b Vigliarolo, Brandon (2026-04-28). "Vintage chatbot lives in the past like an elderly relative". theregister. Retrieved 2026-05-30.
^ Bastian, Matthias (2026-04-28). "Here is what an LLM that knows nothing after 1930 thinks our world looks like in 2026". The Decoder. Retrieved 2026-06-02.
^ IndiaAI (2026-02-18). AI Research Symposium: The Next Frontiers | Keynotes by Demis Hassabis, Yoshua Bengio & Yann LeCun. Retrieved 2026-05-30 – via YouTube.
^ "CalCo". www.calcifercomputing.com. Retrieved 2026-05-19.
^ ^a ^b Lanz, Decrypt / Jose Antonio (2026-04-29). "This AI Was Trained Only on Pre-1930 Text. We Asked It About Hitler, Stocks, and the Future". Decrypt. Retrieved 2026-05-30.
^ "Talkie and Claude (no, the other one)". Robin Sloan. Retrieved 2026-06-25.
^ Willison, Simon. "Introducing talkie: a 13B vintage language model from 1930". Simon Willison’s Weblog. Retrieved 2026-06-02.

External links

Official website
Talkie-powered chat interface
Talkie project repository on GitHub
Talkie project repository on HuggingFace

[:0-1] "Introducing talkie: a 13B vintage language model from 1930". talkie-lm.com. Retrieved 2026-05-19.

[:1-2] Roose, Kevin; Newton, Casey; Jones, Whitney; Cohn, Rachel; Pavic, Vjeran; Ramirez, Daniel; Powell, Dan; Lozano, Marion; Niemisto, Rowan (2026-05-01). "OpenAI's Big Reset + A.I. in the Doctor's Office + Talkie, a pre-1930s LLM". The New York Times. ISSN 0362-4331. Retrieved 2026-05-30.

[:2-3] Vigliarolo, Brandon (2026-04-28). "Vintage chatbot lives in the past like an elderly relative". theregister. Retrieved 2026-05-30.

[4] Bastian, Matthias (2026-04-28). "Here is what an LLM that knows nothing after 1930 thinks our world looks like in 2026". The Decoder. Retrieved 2026-06-02.

[5] IndiaAI (2026-02-18). AI Research Symposium: The Next Frontiers | Keynotes by Demis Hassabis, Yoshua Bengio & Yann LeCun. Retrieved 2026-05-30 – via YouTube.

[6] "CalCo". www.calcifercomputing.com. Retrieved 2026-05-19.

[:3-7] Lanz, Decrypt / Jose Antonio (2026-04-29). "This AI Was Trained Only on Pre-1930 Text. We Asked It About Hitler, Stocks, and the Future". Decrypt. Retrieved 2026-05-30.

[8] "Talkie and Claude (no, the other one)". Robin Sloan. Retrieved 2026-06-25.

[9] Willison, Simon. "Introducing talkie: a 13B vintage language model from 1930". Simon Willison’s Weblog. Retrieved 2026-06-02.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]