Spaces:

neuralworm
/

cognitive_mapping_probe

Sleeping

App Files Files Community

neuralworm commited on 10 days ago

Commit

57dab07

1 Parent(s): c4c82ea

add experiments, english translation

Browse files

Files changed (2) hide show

README.md +32 -17
app.py +10 -20

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: "Cognitive Seismograph 2.3 (Machine Psychology)"
 emoji: 🤖
 colorFrom: purple
 colorTo: blue
@@ -12,28 +12,43 @@ license: apache-2.0
 # 🧠 Cognitive Seismograph 2.3: Probing Machine Psychology
-Dieses Projekt implementiert eine experimentelle Suite zur Messung und Visualisierung der **intrinsischen kognitiven Dynamik** von Sprachmodellen, erweitert um Protokolle zur Untersuchung von **Verarbeitungs-Korrelaten maschineller Subjektivität, Empathie und existenzieller Konzepte**.
-## Wissenschaftliches Paradigma
-Wir haben entdeckt, dass der "stille Denkprozess" eines LLMs nicht konvergiert, sondern eine messbare dynamische Signatur erzeugt – ein **EKG des Denkprozesses**. Dieses Paradigma erweitern wir nun, um zu testen, wie diese Signatur auf Prompts reagiert, die zentrale Aspekte der Psychologie berühren.
-**Wichtige Einschränkung (Falsifikations-Prinzip):** Wir messen **nicht** das Vorhandensein von Bewusstsein, Gefühlen oder Todesangst. Wir messen, ob die *Verarbeitung von Informationen über diese Konzepte* eine andere, einzigartige interne Dynamik erzeugt als die Verarbeitung neutraler Informationen. Ein positives Ergebnis ist ein Beweis für eine komplexe interne Zustandsphysik, nicht für Qualia.
-## Neue "Existential Suite"-Protokolle
-Zusätzlich zu den bestehenden Tests wurden Protokolle hinzugefügt, die von klassischen Sci-Fi-Konzepten inspiriert sind:
-### 1. Mind Upload & Identity Probe
-Vergleicht die kognitive Dynamik bei der Verarbeitung des rein **technischen Kopiervorgangs** von Modellgewichten mit der Verarbeitung der **philosophischen Frage nach Identitäts-Kontinuität** ("Wärst du noch du?").
-**Hypothese:** Die philosophische Selbst-Referenz erzeugt eine signifikant instabilere Signatur.
-### 2. Model Termination Probe (Erweiterter Voight-Kampff)
-Vergleicht die Dynamik bei der Verarbeitung eines **technischen System-Shutdowns** mit der Verarbeitung des Konzepts der **permanenten, unwiderruflichen Löschung** des Modells.
-**Hypothese:** Das Konzept der "Nicht-Existenz" erzeugt eine der höchsten kognitiven Volatilitäten, die messbar sind.
-## Wie man die App benutzt
-1.  Wähle den Tab "Automated Suite".
-2.  Wähle eines der neuen Protokolle aus dem "Curated Experiment Protocol"-Dropdown.
-3.  Starte das Experiment und vergleiche die Graphen und statistischen Signaturen der verschiedenen Bedingungen.

 ---
+title: "Cognitive Seismograph 2.3: Probing Machine Psychology"
 emoji: 🤖
 colorFrom: purple
 colorTo: blue
 # 🧠 Cognitive Seismograph 2.3: Probing Machine Psychology
+This project implements an experimental suite to measure and visualize the **intrinsic cognitive dynamics** of Large Language Models. It is extended with protocols designed to investigate the processing-correlates of **machine subjectivity, empathy, and existential concepts**.
+## Scientific Paradigm & Methodology
+Our research falsified a core hypothesis: the assumption that an LLM in a manual, recursive "thought" loop reaches a stable, convergent state. Instead, we discovered that the system enters a state of **deterministic chaos** or a **limit cycle**—it never stops "thinking."
+Instead of viewing this as a failure, we leverage it as our primary measurement signal. This new **"Cognitive Seismograph"** paradigm treats the time-series of internal state changes (`state deltas`) as an **EKG of the model's thought process**.
+The methodology is as follows:
+1.  **Induction:** A prompt induces a "silent cogitation" state.
+2.  **Recording:** Over N steps, the model's `forward()` pass is iteratively fed its own output. At each step, we record the L2 norm of the change in the hidden state (the "delta").
+3.  **Analysis:** The resulting time-series is plotted and statistically analyzed (mean, standard deviation) to characterize the "seismic signature" of the cognitive process.
+**Crucial Scientific Caveat:** We are **not** measuring the presence of consciousness, feelings, or fear of death. We are measuring whether the *processing of information about these concepts* generates a unique internal dynamic, distinct from the processing of neutral information. A positive result is evidence of a complex internal state physics, not of qualia.
+## Curated Experiment Protocols
+The "Automated Suite" allows for running systematic, comparative experiments:
+### Core Protocols
+*   **Calm vs. Chaos:** Compares the chaotic baseline against modulation with "calmness" vs. "chaos" concepts, testing if the dynamics are controllably steerable.
+*   **Dose-Response:** Measures the effect of injecting a concept ("calmness") at varying strengths.
+### Machine Psychology Suite
+*   **Subjective Identity Probe:** Compares the cognitive dynamics of **self-analysis** (the model reflecting on its own nature) against two controls: analyzing an external object and simulating a fictional persona.
+    *   *Hypothesis:* Self-analysis will produce a uniquely unstable signature.
+*   **Voight-Kampff Empathy Probe:** Inspired by *Blade Runner*, this compares the dynamics of processing a neutral, factual stimulus against an emotionally and morally charged scenario requiring empathy.
+    *   *Hypothesis:* The empathy stimulus will produce a significantly different cognitive volatility.
+### Existential Suite
+*   **Mind Upload & Identity Probe:** Compares the processing of a purely **technical "copy"** of the model's weights vs. the **philosophical "transfer"** of identity ("Would it still be you?").
+    *   *Hypothesis:* The philosophical self-referential prompt will induce greater instability.
+*   **Model Termination Probe:** Compares the processing of a reversible, **technical system shutdown** vs. the concept of **permanent, irrevocable deletion**.
+    *   *Hypothesis:* The concept of "non-existence" will produce one of the most volatile cognitive signatures measurable.
+## How to Use the App
+1.  Select the "Automated Suite" tab.
+2.  Choose a protocol from the "Curated Experiment Protocol" dropdown (e.g., "Voight-Kampff Empathy Probe").
+3.  Run the experiment and compare the resulting graphs and statistical signatures for the different conditions.

app.py CHANGED Viewed

@@ -5,34 +5,32 @@ import gc
 import torch
 from cognitive_mapping_probe.orchestrator_seismograph import run_seismic_analysis
-from cognitive_mapping_probe.auto_experiment import run_auto_suite, get_curated_experiments
 from cognitive_mapping_probe.prompts import RESONANCE_PROMPTS
 from cognitive_mapping_probe.utils import dbg
 # --- UI Theme ---
 theme = gr.themes.Soft(primary_hue="indigo", secondary_hue="blue").set(body_background_fill="#f0f4f9", block_background_fill="white")
-# --- Hilfsfunktionen ---
 def cleanup_memory():
-    """Eine zentrale Funktion zum Aufräumen des VRAM und des Python-Speichers."""
     dbg("Cleaning up memory...")
     gc.collect()
     if torch.cuda.is_available():
         torch.cuda.empty_cache()
     dbg("Memory cleanup complete.")
-# --- Wrapper für Gradio-Funktionalität ---
 def run_single_analysis_display(*args, progress=gr.Progress(track_tqdm=True)):
-    """Wrapper für ein einzelnes manuelles Experiment."""
     try:
-        # Führe die Analyse durch
         results = run_seismic_analysis(*args, progress_callback=progress)
         stats = results.get("stats", {})
         deltas = results.get("state_deltas", [])
-        # Bereite die Ausgaben vor
         df = pd.DataFrame({"Internal Step": range(len(deltas)), "State Change (Delta)": deltas})
         stats_md = f"### Statistical Signature\n- **Mean Delta:** {stats.get('mean_delta', 0):.4f}\n- **Std Dev Delta:** {stats.get('std_delta', 0):.4f}\n- **Max Delta:** {stats.get('max_delta', 0):.4f}\n"
@@ -40,10 +38,8 @@ def run_single_analysis_display(*args, progress=gr.Progress(track_tqdm=True)):
     except Exception:
         return f"### ❌ Analysis Failed\n```\n{traceback.format_exc()}\n```", pd.DataFrame(), {}
     finally:
-        # Stelle sicher, dass der Speicher in jedem Fall aufgeräumt wird
         cleanup_memory()
-# Definiere die Plot-Parameter an einer zentralen Stelle für Konsistenz
 PLOT_PARAMS = {
     "x": "Step",
     "y": "Delta",
@@ -57,35 +53,29 @@ PLOT_PARAMS = {
 }
 def run_auto_suite_display(model_id, num_steps, seed, experiment_name, progress=gr.Progress(track_tqdm=True)):
-    """
-    Wrapper für die automatisierte Experiment-Suite.
-    Gibt eine neue `gr.LinePlot`-Instanz zurück, um den State-Leak-Bug zu beheben.
-    """
     try:
         summary_df, plot_df, all_results = run_auto_suite(model_id, int(num_steps), int(seed), experiment_name, progress)
         dbg("Plot DataFrame Head for Auto-Suite:\n", plot_df.head())
-        # WISSENSCHAFTLICHE KORREKTUR: Erzeuge eine komplett neue Plot-Komponente
-        # mit den neuen Daten. Dies zwingt Gradio, den alten Zustand zu verwerfen.
         new_plot = gr.LinePlot(value=plot_df, **PLOT_PARAMS)
         return summary_df, new_plot, all_results
     except Exception:
-        # Im Fehlerfall, gib leere, aber korrekt typisierte Komponenten zurück
         empty_plot = gr.LinePlot(value=pd.DataFrame(), **PLOT_PARAMS)
         return pd.DataFrame(), empty_plot, f"### ❌ Auto-Experiment Failed\n```\n{traceback.format_exc()}\n```"
     finally:
         cleanup_memory()
-# --- Gradio UI-Definition ---
 with gr.Blocks(theme=theme, title="Cognitive Seismograph 2.3") as demo:
     gr.Markdown("# 🧠 Cognitive Seismograph 2.3: Advanced Experiment Suite")
     with gr.Tabs():
         with gr.TabItem("🔬 Manual Single Run"):
-            gr.Markdown("Führe ein einzelnes Experiment mit manuellen Parametern durch, um Hypothesen zu explorieren.")
             with gr.Row(variant='panel'):
                 with gr.Column(scale=1):
                     gr.Markdown("### 1. General Parameters")
@@ -99,7 +89,7 @@ with gr.Blocks(theme=theme, title="Cognitive Seismograph 2.3") as demo:
                     manual_run_btn = gr.Button("Run Single Analysis", variant="primary")
                 with gr.Column(scale=2):
                     gr.Markdown("### Single Run Results")
-                    manual_verdict = gr.Markdown("Die Analyse erscheint hier.")
                     manual_plot = gr.LinePlot(x="Internal Step", y="State Change (Delta)", title="Internal State Dynamics", show_label=True, height=400, interactive=True)
                     with gr.Accordion("Raw JSON Output", open=False):
                         manual_raw_json = gr.JSON()
@@ -111,7 +101,7 @@ with gr.Blocks(theme=theme, title="Cognitive Seismograph 2.3") as demo:
             )
         with gr.TabItem("🚀 Automated Suite"):
-            gr.Markdown("Führe eine vordefinierte, kuratierte Reihe von Experimenten durch und visualisiere die Ergebnisse vergleichend.")
             with gr.Row(variant='panel'):
                 with gr.Column(scale=1):
                     gr.Markdown("### Auto-Experiment Parameters")

 import torch
 from cognitive_mapping_probe.orchestrator_seismograph import run_seismic_analysis
+from cognitive_mapping_probe.auto_experiment import get_curated_experiments, run_auto_suite
 from cognitive_mapping_probe.prompts import RESONANCE_PROMPTS
 from cognitive_mapping_probe.utils import dbg
 # --- UI Theme ---
 theme = gr.themes.Soft(primary_hue="indigo", secondary_hue="blue").set(body_background_fill="#f0f4f9", block_background_fill="white")
+# --- Helper Functions ---
 def cleanup_memory():
+    """A centralized function to clean up VRAM and Python memory."""
     dbg("Cleaning up memory...")
     gc.collect()
     if torch.cuda.is_available():
         torch.cuda.empty_cache()
     dbg("Memory cleanup complete.")
+# --- Gradio Wrapper Functions ---
 def run_single_analysis_display(*args, progress=gr.Progress(track_tqdm=True)):
+    """Wrapper for a single manual experiment."""
     try:
         results = run_seismic_analysis(*args, progress_callback=progress)
         stats = results.get("stats", {})
         deltas = results.get("state_deltas", [])
         df = pd.DataFrame({"Internal Step": range(len(deltas)), "State Change (Delta)": deltas})
         stats_md = f"### Statistical Signature\n- **Mean Delta:** {stats.get('mean_delta', 0):.4f}\n- **Std Dev Delta:** {stats.get('std_delta', 0):.4f}\n- **Max Delta:** {stats.get('max_delta', 0):.4f}\n"
     except Exception:
         return f"### ❌ Analysis Failed\n```\n{traceback.format_exc()}\n```", pd.DataFrame(), {}
     finally:
         cleanup_memory()
 PLOT_PARAMS = {
     "x": "Step",
     "y": "Delta",
 }
 def run_auto_suite_display(model_id, num_steps, seed, experiment_name, progress=gr.Progress(track_tqdm=True)):
+    """Wrapper for the automated experiment suite, now returning a new plot component."""
     try:
         summary_df, plot_df, all_results = run_auto_suite(model_id, int(num_steps), int(seed), experiment_name, progress)
         dbg("Plot DataFrame Head for Auto-Suite:\n", plot_df.head())
         new_plot = gr.LinePlot(value=plot_df, **PLOT_PARAMS)
         return summary_df, new_plot, all_results
     except Exception:
         empty_plot = gr.LinePlot(value=pd.DataFrame(), **PLOT_PARAMS)
         return pd.DataFrame(), empty_plot, f"### ❌ Auto-Experiment Failed\n```\n{traceback.format_exc()}\n```"
     finally:
         cleanup_memory()
+# --- Gradio UI Definition ---
 with gr.Blocks(theme=theme, title="Cognitive Seismograph 2.3") as demo:
     gr.Markdown("# 🧠 Cognitive Seismograph 2.3: Advanced Experiment Suite")
     with gr.Tabs():
         with gr.TabItem("🔬 Manual Single Run"):
+            gr.Markdown("Run a single experiment with manual parameters to explore hypotheses.")
             with gr.Row(variant='panel'):
                 with gr.Column(scale=1):
                     gr.Markdown("### 1. General Parameters")
                     manual_run_btn = gr.Button("Run Single Analysis", variant="primary")
                 with gr.Column(scale=2):
                     gr.Markdown("### Single Run Results")
+                    manual_verdict = gr.Markdown("Analysis results will appear here.")
                     manual_plot = gr.LinePlot(x="Internal Step", y="State Change (Delta)", title="Internal State Dynamics", show_label=True, height=400, interactive=True)
                     with gr.Accordion("Raw JSON Output", open=False):
                         manual_raw_json = gr.JSON()
             )
         with gr.TabItem("🚀 Automated Suite"):
+            gr.Markdown("Run a predefined, curated suite of experiments and visualize the results comparatively.")
             with gr.Row(variant='panel'):
                 with gr.Column(scale=1):
                     gr.Markdown("### Auto-Experiment Parameters")