Home/ Materials/ Session 1 Facilitation

Pilot script

Session 1: Text

60–90 min Facilitation guide Pilot ready

Use this script for a 60–90 minute pilot focused on text generation, tokenization, probability, and the difference between fluent output and understanding.

Core question. When a language model writes, what is it actually doing?

Materials

Run of show

TimeSegmentFacilitator moveParticipant action
0–5Welcome & normsName the question. Remind everyone direct AI use is optional.Choose a participation pathway.
5–15Unplugged predictionShow a sentence stem; collect guesses in Zoom chat, then paste the chat block into the Next-Token Prediction Game to tally the room beside the model's top-k.Predict next words; compare the room's distribution with the model's and notice how context shapes guesses.
15–35Tokenizer + TemperatureShow token chunks, then run one prompt at low and high temperature.Notice token boundaries, probability bars, greedy vs. sampled.
35–55ELIZA vs. LLMInspect matched rule, pattern, and response template.Compare visible rule-matching with pre-generated LLM examples.
55–75InvestigationAssign one Text Experiment Board section.Run or analyse a default test, prompt variation, or comparison.
75–90DebriefUse the frame: human, machine, system, ethics, pedagogy.Share one evidence-based claim and one question.

Optional pilot-evidence move: after the investigation, ask one group to enter its baseline, temperature change, and greedy/sample comparison in the A/B/C Comparison Board, then export Markdown for facilitator notes.

Facilitator prompts

  • "What made your next-word guess feel obvious?"
  • "What does the tokenizer split that you would not have split as a human reader?"
  • "At low temperature, what becomes more predictable?"
  • "At high temperature, what becomes more surprising or unstable?"
  • "What does ELIZA make visible that a modern LLM hides?"
  • "Where could fluency be mistaken for understanding?"

Investigation prompt

Run the same starting phrase at low and high temperature. Then compare greedy decoding with sampling. What changed in the output, and what did not change about the mechanism?

Low-AI / No-AI pathway

Participants can complete the session using only the visualizer, ELIZA, and pre-generated comparison examples. They do not need to log into or prompt a live LLM.

Fallback plan

  • If the Tokenizer tool feels too dense, use only the example buttons and temperature slider.
  • If projection space is limited, focus on the probability chart and generated text stream.
  • If ELIZA examples feel sensitive, use low-stakes prompts about school, projects, or planning rather than emotional disclosures.
  • If a participant does not want to use AI, ask them to design a classroom version of the prediction game.

Pilot QA notes

During the pilot, note:

  • Which controls participants found without explanation.
  • Whether "temperature" became understandable through the tool.
  • Whether ELIZA's rule inspector was legible on screen.
  • Any moment where participants confused probability with truth.
  • Any ethical discomfort, especially around chatbot intimacy or trust.