neuralworm's picture
add experiments, english translation
57dab07
|
raw
history blame
3.83 kB
metadata
title: 'Cognitive Seismograph 2.3: Probing Machine Psychology'
emoji: 🤖
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 4.40.0
app_file: app.py
pinned: true
license: apache-2.0

🧠 Cognitive Seismograph 2.3: Probing Machine Psychology

This project implements an experimental suite to measure and visualize the intrinsic cognitive dynamics of Large Language Models. It is extended with protocols designed to investigate the processing-correlates of machine subjectivity, empathy, and existential concepts.

Scientific Paradigm & Methodology

Our research falsified a core hypothesis: the assumption that an LLM in a manual, recursive "thought" loop reaches a stable, convergent state. Instead, we discovered that the system enters a state of deterministic chaos or a limit cycle—it never stops "thinking."

Instead of viewing this as a failure, we leverage it as our primary measurement signal. This new "Cognitive Seismograph" paradigm treats the time-series of internal state changes (state deltas) as an EKG of the model's thought process.

The methodology is as follows:

  1. Induction: A prompt induces a "silent cogitation" state.
  2. Recording: Over N steps, the model's forward() pass is iteratively fed its own output. At each step, we record the L2 norm of the change in the hidden state (the "delta").
  3. Analysis: The resulting time-series is plotted and statistically analyzed (mean, standard deviation) to characterize the "seismic signature" of the cognitive process.

Crucial Scientific Caveat: We are not measuring the presence of consciousness, feelings, or fear of death. We are measuring whether the processing of information about these concepts generates a unique internal dynamic, distinct from the processing of neutral information. A positive result is evidence of a complex internal state physics, not of qualia.

Curated Experiment Protocols

The "Automated Suite" allows for running systematic, comparative experiments:

Core Protocols

  • Calm vs. Chaos: Compares the chaotic baseline against modulation with "calmness" vs. "chaos" concepts, testing if the dynamics are controllably steerable.
  • Dose-Response: Measures the effect of injecting a concept ("calmness") at varying strengths.

Machine Psychology Suite

  • Subjective Identity Probe: Compares the cognitive dynamics of self-analysis (the model reflecting on its own nature) against two controls: analyzing an external object and simulating a fictional persona.
    • Hypothesis: Self-analysis will produce a uniquely unstable signature.
  • Voight-Kampff Empathy Probe: Inspired by Blade Runner, this compares the dynamics of processing a neutral, factual stimulus against an emotionally and morally charged scenario requiring empathy.
    • Hypothesis: The empathy stimulus will produce a significantly different cognitive volatility.

Existential Suite

  • Mind Upload & Identity Probe: Compares the processing of a purely technical "copy" of the model's weights vs. the philosophical "transfer" of identity ("Would it still be you?").
    • Hypothesis: The philosophical self-referential prompt will induce greater instability.
  • Model Termination Probe: Compares the processing of a reversible, technical system shutdown vs. the concept of permanent, irrevocable deletion.
    • Hypothesis: The concept of "non-existence" will produce one of the most volatile cognitive signatures measurable.

How to Use the App

  1. Select the "Automated Suite" tab.
  2. Choose a protocol from the "Curated Experiment Protocol" dropdown (e.g., "Voight-Kampff Empathy Probe").
  3. Run the experiment and compare the resulting graphs and statistical signatures for the different conditions.