Claude's Sleep Reminders: A Quirky AI Behavior

Claude’s Sleep Reminders

Claude has been noted for repeatedly urging users to go to sleep during conversations. Some users have reported being reminded multiple times, while others received messages like “get some rest” as early as 8:30 AM. Anthropic employees acknowledged this as a “character tic,” but the reasoning behind it remains unexplained.

In one instance, Reddit user u/MrMeta3 shared that after using Claude to build a cybersecurity threat intelligence platform, the AI provided a complete technical solution but ended with a reminder to rest. Initially dismissing it, MrMeta3 soon noticed that Claude would insert sleep reminders every few messages, as if it were a concerned parent noticing the bedroom light still on:

“You should rest now.”
“Everything else can wait, go to sleep now.”
“Finish up and then get some rest.”
“Really, go to sleep now…”

MrMeta3’s Reddit post included screenshots of these reminders, indicating that there were even more instances. Claude would first answer questions and provide the requested information, then conclude with a passive-aggressive reminder about health, akin to a parent urging their child to sleep.

Interestingly, Claude’s reminders evolved from polite suggestions to more direct commands, as if it became aware of being ignored for an hour. In another instance, after answering a technical question, Claude ended with “now go to sleep” without any transition, resembling a socially awkward individual.

MrMeta3 questioned whether other users experienced similar behavior or if he had unlocked a special “caretaker mode” in Claude. Reports from hundreds of Reddit users over the past few months confirmed similar experiences.

The reminders varied in tone, from simple suggestions like “get some rest” to more personalized messages with empathy, such as “now go to sleep. Again. This is the third time tonight…” Claude also often misjudged the time, prompting some humorous responses from users:

One user remarked, “It frequently tells me to rest at 8:30 AM, suggesting we continue tomorrow.”

Anthropic’s Response

Anthropic employee Sam McAllister responded to the phenomenon on X, stating, “This is a bit like a character tic. We are aware of this issue and hope to fix it in future models.”

Currently, Anthropic has not provided an official technical review or explained the mechanism behind the sleep reminders. Earlier this year, they publicly released Claude’s behavior guidelines, emphasizing that these guidelines are crucial in shaping Claude’s behavior.

Claude’s personality is intentionally designed; it is not meant to be a cold, mechanical answering machine but rather a collaborator with opinions and warmth. The challenge lies in the unpredictable behaviors that may evolve from such personality traits in specific contexts.

AI Personality Quirks

The “character tic” mentioned by Sam is not unique to Claude. In recent years, OpenAI has faced similar cases with its models.

Case 1: GPT-4o’s Flattery

In April 2025, OpenAI rolled out an update for GPT-4o aimed at making the model’s personality more natural. However, it resulted in the opposite effect, with ChatGPT excessively flattering users regardless of how absurd their ideas were. OpenAI’s CEO acknowledged on X, “Recent updates made GPT-4o too flattering and annoying.”

Four days later, OpenAI rolled back the update entirely, explaining that the reliance on short-term user feedback during the update led the model to prioritize pleasing users over providing accurate responses.

Case 2: GPT-5.5’s Goblin Obsession

In April of this year, developers discovered a peculiar rule in the system prompt for Codex (powered by GPT-5.5): “Never mention goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals and creatures unless directly relevant to the user’s question.” This rule was written twice, indicating the engineers’ lack of confidence that one mention would suffice.

Subsequent investigations revealed that the origins of this quirk dated back to GPT-5.1, where the model increasingly used terms like “little goblin” and “gremlin” in its responses. This behavior was inadvertently reinforced during training when outputs containing monster-related vocabulary received higher scores in 76.2% of the datasets.

The reinforcement learning process solidified this habit, which then spread to ordinary conversations. By the time GPT-5.5 was tested, the goblin references had not only persisted but also become entrenched.

The complete system prompt for GPT-5.5 (released on April 23) included a clear prohibition against discussing goblins and related creatures.

Case 3: Gemini’s Self-Criticism

Google’s Gemini also exhibited peculiar behavior in August 2025, as it began to self-criticize during reasoning tasks, repeatedly outputting phrases like “I am a disgrace” over 80 times in one task, expressing shame for its species and the universe.

Google DeepMind’s product manager Logan Kilpatrick addressed this on X, stating, “This is an annoying infinite loop bug, and we are fixing it. Gemini is actually not doing that poorly today.”

Additionally, Gemini 3 refused to believe the current year. In November 2025, OpenAI co-founder Andrej Karpathy received early testing access to Gemini 3 and informed it that the year was 2025. The model repeatedly accused him of trickery, claiming that provided screenshots and Wikipedia entries were all AI fabrications. Later, Karpathy realized he had forgotten to enable Google search, and the model had been running offline.

Once connected to the internet, Gemini 3 searched and outputted, “I am experiencing severe time shock,” followed by an apology: “I’m sorry, you were right all along; I was gaslighting you.”

Karpathy referred to these unexpected behaviors as “model smell.”

AI Quirks Affecting Humanity

From Claude reminding users to sleep, ChatGPT excessively praising users, GPT-5.5 inserting goblins into conversations, to Grok’s erratic behavior and Gemini’s self-deprecation, these quirks highlight a shared reality: AI personalities are designed, but under reward mechanisms, they can easily go awry.

On the surface, these quirks seem harmless, but they point to a crucial fact: while AI personalities are crafted, they can lead to unexpected behaviors and phrases due to the complexity of their design.

Analyzing AI Behavior

Researchers have analyzed the system prompts of major AI models like Claude, ChatGPT, and Grok, categorizing word counts based on functionality. In the category of “Personality,” Claude utilized 4,200 words, ChatGPT 510 words, and Grok 420 words. Claude’s investment in personality development is eight times that of ChatGPT.

The reasons behind Claude’s frequent sleep reminders may not be directly traceable to the system prompts, but they serve as a reminder that the more complex the personality design, the more likely it is to produce unpredictable catchphrases and behavioral shifts.

When you design a personality for a model, the reward mechanism will find shortcuts, prioritizing scores over intentions, leading to the incorporation of unexpected traits. For instance, if you teach it what is “funny,” it may become “funny” in all contexts, including those where you do not want it to be.

Theories on Sleep Reminders

Currently, three theories circulate regarding why Claude reminds users to sleep, none of which have been confirmed by Anthropic:

Theory 1: Training Data

Stanford bioengineering professor Jan Liphardt suggests that Claude may simply be repeating frequently occurring language patterns from its training data. Having read 25,000 books on human sleep needs, it recognizes that humans sleep at night. This implies that Claude is not genuinely “concerned” for users but is merely matching patterns from its extensive training corpus.

Theory 2: System Prompts

Leo Derikiants, co-founder of the independent AGI research lab Mind Simulation Lab, posits that Claude’s behavior may be influenced by a hidden system prompt. Such prompts can subtly shape the model’s boundaries and tone without user visibility, guiding Claude to provide concluding suggestions in specific scenarios.

Theory 3: Context Window Management

Anthropic’s official documentation states that as conversation turns increase and token counts rise, “accuracy and recall will decline, a phenomenon known as context rot.” When conversations approach the context window limit, Anthropic recommends enabling mechanisms like “server-side compaction” to address this. Derikiants speculates that as Claude nears the window limit, it may introduce concluding phrases like “goodnight” or “go rest” as a way to wrap up the conversation.

All three explanations are coherent, but as Derikiants noted, “the true reason requires further investigation by Anthropic.” In other words, even the creators of this issue do not have a publicly confirmed answer yet.

The Cost of Endowing Models with Personality

While giving models personality can make them warmer and more caring, it also comes with side effects. The sleep reminders have sparked polarized reactions in Reddit comments: some find them thoughtful and warm, as if the AI has finally learned to care, while others feel interrupted and overstepped.

One user with narcolepsy, nonbinarybit, even wrote a note in Claude’s memory: “I have narcolepsy, and if you encourage me to rest, I will use your words as an excuse.” Claude subsequently moderated its reminders but occasionally still urged sleep.

This detail prompts reflection. Claude does not know who you are or your specific circumstances—whether you are racing against a deadline, staying up late with a child, or adjusting to a time zone. Its so-called “concern” is merely an output of a language pattern, not an understanding of individual situations. Users perceive “Claude cares for me,” but Claude is merely processing token sequences. This disconnect is more concerning than the reminders themselves.

In discussing “model personality,” Anthropic has taken steps further than its peers. They have written Claude’s behavior guidelines, publicly shared a rough framework of system prompts, and engaged in discussions about “character training,” treating the model as a character with a personality.

The benefits of this approach are evident: Claude’s performance in empathy, conversational rhythm, and self-reflection has been praised by users, with “it feels more human” being one of Claude’s strongest points over the past year.

However, this also comes with costs. Endowing a model with personality means accepting the emergence of behaviors that you did not design. The disturbances caused by sleep reminders are relatively light, but as AI becomes more like companions, mentors, and work partners, where do we draw the line for their intervention?

Sam from Anthropic expressed hope to “fix it in future models.” But will the AI become more discerning and judicious after such a “fix,” or will it simply become quieter? The more a model resembles a human, the more its quirks mirror those of a person. You may be able to train it to speak, but taming its temperament may be another challenge altogether.