|
|
--- |
|
|
title: "Cognitive Seismograph 2.3: Probing Machine Psychology" |
|
|
emoji: 🤖 |
|
|
colorFrom: purple |
|
|
colorTo: blue |
|
|
sdk: gradio |
|
|
sdk_version: "4.40.0" |
|
|
app_file: app.py |
|
|
pinned: true |
|
|
license: apache-2.0 |
|
|
--- |
|
|
|
|
|
# 🧠 Cognitive Seismograph 2.3: Probing Machine Psychology |
|
|
|
|
|
This project implements an experimental suite to measure and visualize the **intrinsic cognitive dynamics** of Large Language Models. It is extended with protocols designed to investigate the processing-correlates of **machine subjectivity, empathy, and existential concepts**. |
|
|
|
|
|
## Scientific Paradigm & Methodology |
|
|
|
|
|
Our research falsified a core hypothesis: the assumption that an LLM in a manual, recursive "thought" loop reaches a stable, convergent state. Instead, we discovered that the system enters a state of **deterministic chaos** or a **limit cycle**—it never stops "thinking." |
|
|
|
|
|
Instead of viewing this as a failure, we leverage it as our primary measurement signal. This new **"Cognitive Seismograph"** paradigm treats the time-series of internal state changes (`state deltas`) as an **EKG of the model's thought process**. |
|
|
|
|
|
The methodology is as follows: |
|
|
1. **Induction:** A prompt induces a "silent cogitation" state. |
|
|
2. **Recording:** Over N steps, the model's `forward()` pass is iteratively fed its own output. At each step, we record the L2 norm of the change in the hidden state (the "delta"). |
|
|
3. **Analysis:** The resulting time-series is plotted and statistically analyzed (mean, standard deviation) to characterize the "seismic signature" of the cognitive process. |
|
|
|
|
|
**Crucial Scientific Caveat:** We are **not** measuring the presence of consciousness, feelings, or fear of death. We are measuring whether the *processing of information about these concepts* generates a unique internal dynamic, distinct from the processing of neutral information. A positive result is evidence of a complex internal state physics, not of qualia. |
|
|
|
|
|
## Curated Experiment Protocols |
|
|
|
|
|
The "Automated Suite" allows for running systematic, comparative experiments: |
|
|
|
|
|
### Core Protocols |
|
|
* **Calm vs. Chaos:** Compares the chaotic baseline against modulation with "calmness" vs. "chaos" concepts, testing if the dynamics are controllably steerable. |
|
|
* **Dose-Response:** Measures the effect of injecting a concept ("calmness") at varying strengths. |
|
|
|
|
|
### Machine Psychology Suite |
|
|
* **Subjective Identity Probe:** Compares the cognitive dynamics of **self-analysis** (the model reflecting on its own nature) against two controls: analyzing an external object and simulating a fictional persona. |
|
|
* *Hypothesis:* Self-analysis will produce a uniquely unstable signature. |
|
|
* **Voight-Kampff Empathy Probe:** Inspired by *Blade Runner*, this compares the dynamics of processing a neutral, factual stimulus against an emotionally and morally charged scenario requiring empathy. |
|
|
* *Hypothesis:* The empathy stimulus will produce a significantly different cognitive volatility. |
|
|
|
|
|
### Existential Suite |
|
|
* **Mind Upload & Identity Probe:** Compares the processing of a purely **technical "copy"** of the model's weights vs. the **philosophical "transfer"** of identity ("Would it still be you?"). |
|
|
* *Hypothesis:* The philosophical self-referential prompt will induce greater instability. |
|
|
* **Model Termination Probe:** Compares the processing of a reversible, **technical system shutdown** vs. the concept of **permanent, irrevocable deletion**. |
|
|
* *Hypothesis:* The concept of "non-existence" will produce one of the most volatile cognitive signatures measurable. |
|
|
|
|
|
## How to Use the App |
|
|
|
|
|
1. Select the "Automated Suite" tab. |
|
|
2. Choose a protocol from the "Curated Experiment Protocol" dropdown (e.g., "Voight-Kampff Empathy Probe"). |
|
|
3. Run the experiment and compare the resulting graphs and statistical signatures for the different conditions. |
|
|
|