Zarf Updates: The Visible Zorker

The Visible Zorker

Tuesday, January 14, 2025 (updated 1 day later)

Comments: 33 (latest 2 days later)

Tagged: if, interactive fiction, zork, infocom, zil, zarf

Here's a little something I've been working on: The Visible Zorker!

This screenshot has spoilers for Zork 1. This whole project is spoilers for Zork 1. That's the point.

Really, go give it a shot. It's a toy. You can read the rest of this post later.

...Okay, a quick introduction. The left pane is regular old Parchment, the Z-code interpreter, playing Zork 1. You type commands; the game responds.

Just regular old Parchment? Not quite! This is Parchment exposed. The upper right pane shows the stack trace for the current turn. That's all the ZIL functions called, and all the text printed, when executing the most recent command.

And the bottom right pane shows the ZIL source code -- the original text, written by Infocom folks in the 1980s. Click on any function or printed string; it'll show you that code in context.

Now check out the other tabs!

A list of rooms and objects from Zork. A list of variables, starting with "HERE: EAST-OF-HOUSE", "SCORE: 0", "MOVES: 3". A list of timer functions: "I-LANTERN count 200", "I-CANDLES count 40", "I-THIEF count -4", "I-SWORD count 0-1". Only I-THIEF is marked as active. A list of filenames: "zork1.zil", "1actions.zil", "1dungeon.zil", etc.

The "World" tab shows the game world as nested objects. The "State" tab shows ZIL global variables. "Timers" is the table of timers and daemons -- functions called every turn or counting down to a future call.

All of these displays update live, every turn, as you play the game. You can click on any line to see the ZIL source that implements it.

And those green buttons? Those display my comments on the source. ZIL isn't the easiest language to read (it's a Lisp derivative), so I wrote up some helpful footnotes.

Really, go play with it. Run around. See how Zork works. Haven't you always wondered?

(I mean it about the spoilers, though.)

Seriously, you did what?

Infocom's games are among the best-researched works in videogame history. The Z-machine format has long since been documented. The games have been disassembled and analyzed. And then, in 2019, we got their original ZIL source code.

But most players have never read this stuff. What if I built a way to visualize the Z-machine as it executed? Like the Visible Woman at the science museum. Internals illuminated; cheerfully explaining itself; transgressively fascinating. (Especially if you're a twelve-year-old science nerd... boy.)

I think of it as a kind of exploratory programming. It's on the code-reading side rather than code-writing -- but reading code is so much of software development!

Or you can think of it as the Penn-and-Teller approach to the magic of game design. Zork is a great trick, and knowing how it works makes it greater.

And wow, this was a fun project to work on. A challenge, on several levels.

What was hard about this?

The first problem was extracting the data that the Visible Zorker needs.

I said that Zork (and the Z-machine) had been analyzed to the bones right? Yes, but not in the way I needed. Remember, ZIL is a compiled language. All the functions in the source code have been converted to numeric opcodes, operating on numbers.

Here's a bunch of opcodes extracted from the compiled game file. This is the function at memory address $100D8. We've had this listing since the 1990s:

Routine 100d8, 2 locals (0000, 0000)

100dd:  GET_PROP        L00,#07 -> L01
100e1:  JL              L01,#00 [FALSE] RTRUE
100e5:  SUB             #00,L01 -> -(SP)
100e9:  PUT_PROP        L00,#07,(SP)+
100ee:  GET_PROP        L00,#11 -> -(SP)
100f2:  CALL            (SP)+ (#04) -> -(SP)
100f7:  RTRUE

And here's the corresponding ZIL source, which we got in 2017:

<ROUTINE AWAKEN (O "AUX" (S <GETP .O ,P?STRENGTH>))
     <COND (<L? .S 0>
        <PUTP .O ,P?STRENGTH <- 0 .S>>
        <APPLY <GETP .O ,P?ACTION> ,F-CONSCIOUS>)>
     T>

If you have a reference, you can see how these match up. The first line gets property 07 from the object in local variable 00 -- that must be the STRENGTH property. It stores that value in local variable 01. Then it checks whether that's less than zero. (JL is "jump if less than...") And so on.

But -- here's the trick -- how did I know that these definitions went together? How did I know that function $100D8 corresponded to the AWAKEN routine rather than, say, I-FIGHT or INFESTED?

In some cases it's easy. Here's another disassembled routine:

Routine 10a3e, 0 locals ()

10a3f:  JE              G78,#39,#23,#2b [FALSE] 10a4f
10a46:  PRINT_RET       "You can't do that."
10a4f:  JE              G78,#38 [FALSE] RFALSE
10a53:  PRINT           "It looks pretty much like a "
10a66:  PRINT_OBJ       G76
10a68:  PRINT_RET       "."

The PRINT and PRINT_RET opcodes contain embedded string data -- the disassembler knows how to decode this. It's easy to find the ZIL code that corresponds to that. It must be this function:

<ROUTINE DUMB-CONTAINER ()
     <COND (<VERB? OPEN CLOSE LOOK-INSIDE>
        <TELL "You can't do that." CR>)
           (<VERB? EXAMINE>
        <TELL "It looks pretty much like a " D ,PRSO "." CR>)>>

So the first thing I did was write a ZIL parser. It runs through the source files and parses all the functions. For each function, it records (a) the function name; (b) the location in the ZIL source; (c) all the strings used in TELL statements.

And then I wrote a parser for the disassembly dump, which runs through and extracts (a) the function address and (b) all the embedded strings in PRINT opcodes.

I figured I'd have to write a fussy search algorithm to match up functions in the first list with functions in the second list. And for function with no embedded text, like AWAKEN? I'd have to match them up by hand!

...Then it turned out that the ZIL compiler generated functions in strict source code order. I didn't have to do any searching; the two lists were already in the same order. Exploratory programming, right?

(It wasn't quite that easy. ZIL supports conditional compilation -- like #ifdef in C -- and my parser had to account for that. Just a bit more work. On the up side, I needed those source code locations for the app anyhow.)

Well, that takes care of the functions. What about the objects? Here's a ZIL object definition:

<OBJECT LAMP
    (IN LIVING-ROOM)
    (SYNONYM LAMP LANTERN LIGHT)
    (ADJECTIVE BRASS)
    (DESC "brass lantern")
    (FLAGS TAKEBIT LIGHTBIT)
    (ACTION LANTERN)
    (FDESC "A battery-powered brass lantern is on the trophy case.")
    (LDESC "There is a brass lantern (battery-powered) here.")
    (SIZE 15)>

The same disassembler can generate a list of the object data:

164. Attributes: 17, 31
     Parent object: 193  Sibling object: 183  Child object:   0
     Property address: 1a97
         Description: "brass lantern"
          Properties:
              [18] 44 51 44 5f 44 c8 
              [17] 6e 32 
              [16] e9 
              [15] 00 0f 
              [14] 87 4d 
              [11] 87 5f

Happily, the object description ("brass lantern") is embedded in the object data, so that's easy to match up.

...or is it? What about this object dump?

 59. Attributes: 5, 6
     Parent object:  82  Sibling object:  60  Child object:   0
     Property address: 1091
         Description: "Maze"
          Properties:
              [30] 3c 
              [29] 36 
              [23] 3a 
              [11] 90 cd

The description is "Maze"... just like the other fourteen "Maze" rooms. How do I tell those apart?

Turns out the property data describes the exits. Property 23 is UP, 29 is WEST, 30 is EAST, so can we find a maze room definition with pattern? We can. And hey, that tells us what rooms $36, $3A, and $3C are too...

Mind you, at first I didn't know what property matched with which direction! Extra puzzle fun. But it was solvable, working backwards from the dead ends and the Troll Room.

Working through this mapping was a real deja vu moment. I was mapping the Zork maze! One room at a time, checking the exits... It felt like 1980 all over again.

Then I did it all again for the global variables list, the properties, the attributes...

After all that, I remembered that Allen Garvin, Ben Rudiak-Gould, and Ethan Dicks did lot of this analysis work back in 2007. That didn't solve all my problems -- they didn't have the ZIL source, so they made up their own function names and so on. ($100D8 is CheckStrength in that file.) But it confirmed the property, attribute, and global variable numbers pretty well.

So after that it was easy, right?

Hooking up Parchment to a display UI was pretty easy. That was a question of collecting internal Z-machine info into a JS object and exporting it. (A list of global variable values, a list of object locations, a list of function addresses called this turn... Just numbers.) Then I had to convert all the address mappings I'd worked out (and objects, globals, etc) into JSON data. The UI loads all that JSON, and then it can display $100D8 as AWAKEN.

Designing that UI was a journey. Again, exploratory: a very iterative process.

I started out with the basic ideas of a call tree, a list of printed strings, a table of objects. But how is that presented? Does the call tree include printing strings, or are those separate tabs? What does the source-code pane display at any given time?

I built a display pane, tried it out, and asked "What can't I see?" Then I did it again. And again. "What button am I reaching for that doesn't exist?" (I didn't know that the source pane needed forwards/backwards buttons until I reached for them.)

The Timers tab wasn't even an idea until I asked "Where is the lamp's battery counter stored, anyway?" I had unconsciously assumed it would be a property of the lamp object, because that's how Inform works. But it's not. It's not a global variable either. Where the heck is it?

Turns out it's a timer function which counts down from 200. When that runs out, it displays a message from LAMP-TABLE and resets to 100. Then 70, then 15, then it's dead. So the total lamp life is 385, but you have to dig quite a bit to understand why.

But you can't illuminate the workings of Zork without showing the lamp counter! So I added the Timers tab. Once I looked at it, I realized it was indispensable.

Updates

Jan 15th: Fixed some syntax-coloring bugs. Local variables are tinted brown now. Added some commentary.

Comments from Mastodon

josh g. (January 14, 2025 at 11:42 AM):

@zarfeblong This is fantastic, thanks!

Andrew Plotkin (January 14, 2025 at 11:54 AM):

@joshg You’re welcome!

Brian Kerr (January 14, 2025 at 11:59 AM):

@zarfeblong

This is really, really nicely done. Looking forward to messing around with it some more later. Thank you.

sb :q! (January 14, 2025 at 12:02 PM):

@zarfeblong
This is amazing! Thank you!

Jason Heiser (January 14, 2025 at 12:37 PM):

@zarfeblong Wonderful! cc: @andybaio

Alexander Shendi (January 14, 2025 at 3:17 PM):

@zarfeblong

Does it show the MDL or ZIL code?
Anyway **HUGE** thanks!

Andrew Plotkin (January 14, 2025 at 3:19 PM):

@alexshendi ZIL. This is release 88, dated 1984, from the Masterpieces CD etc. (But not the Invisiclues version.)

arcanetrivia (January 14, 2025 at 5:09 PM):

@zarfeblong Man, this kicks butt. I do wish it were globally usable for any game, but I can see why you wouldn't put the work in to doing all of that.

Andrew Plotkin (January 14, 2025 at 10:04 PM):

@arcanetrivia Truly general isn't possible, sadly. Even supporting all Infocom games wouldn't have a lot of shortcuts.

gjm (January 14, 2025 at 6:58 PM):

@zarfeblong I concur with everyone else: this is _extremely_ neat. Thank you, and well done!

Jeff Palmer (January 14, 2025 at 7:07 PM):

@zarfeblong This is amazing! Very, very cool. Thank you! 🙏

Aram Sinnreich (January 14, 2025 at 9:34 PM):

@zarfeblong my inner 12-year-old thanks you profoundly.

Torbjörn Andersson (January 15, 2025 at 2:08 AM):

@zarfeblong Reminds me of the "Wilderland" and "Foggy London" ZX Spectrum emulators, though I've mostly just read about them. I don't know how well the work.

http://veronikamegler.com/WL/wl.htm

Torbjörn Andersson (January 15, 2025 at 6:38 AM):

@zarfeblong One thing I could see this sort of thing being used for would be a version of Suspended with a built-in map showing where all the robots are. Perhaps even where they are currently going. Someone at Infocom must have had similar ideas, because the last archived source code had a custom status line to show the robot locations. (It doesn't work.)

Not making any requests, of course. This is just a piece of trivia that came to mind.

It also makes minor text changes. (Not shown here.)

Screenshot from Suspended, built (hastily) from the last archived version of the source code, showing the opening text. Unlike the release version, this has a three-line status line showing the headers "Cryolink to:", "Casualties this Cycle:", "Cycle:", "Auda:", "Iris:", "Poet:", "Sensa:", "Waldo:", and "Whiz:". No values have been printed for them yet as the opening screen ends with a "[MORE]" prompt.

Screenshot from the same version of Suspended. In addition to the status line, the prompt shows "(FC linked to Iris)". Unfortunately, the statusline shows mostly garbage strings.

Andrew Plotkin (January 15, 2025 at 10:18 AM):

@et_andersson Yeah, Suspended is the clear argument for a custom interpreter that would purely improve the game, no spoiler worries.

Andrew Plotkin (January 15, 2025 at 4:17 PM):

Ooh, blogged by Adafruit!

https://blog.adafruit.com/2025/01/15/the-visible-zorker-visualizes-zork-data-files-vintagecomputing-gaming-reverseengineering/

JP Sugarbroad (January 15, 2025 at 5:30 PM):

@zarfeblong Holy shit. Very very nice work. Now I need to go learn how the Z-machine worked back then.

Zorkmid (January 15, 2025 at 10:29 PM):

@zarfeblong

Very impressive work!

Marc

Andrew Plotkin (January 15, 2025 at 10:56 PM):

@Zorkmid Thank you!

Zorkmid (January 15, 2025 at 11:18 PM):

@zarfeblong

Why didn’t we think of this 40 years ago? Would have made finding bugs a lot easier...

dsyzling (January 16, 2025 at 10:22 AM):

@zarfeblong wow that's interesting !

Comments from Bluesky

Brian T Seagull (January 14, 2025 at 1:05 PM):

Very cool.

Full Metal (January 14, 2025 at 1:50 PM):

Zork. The choose your own adventure

Vivienne Dunstan (January 14, 2025 at 12:27 PM):

That is brilliant :) Thanks!

Cydniey Buffers (January 14, 2025 at 3:31 PM):

Ooh goody goody! 🤪 *jumps up and down on bed*

Brian Uri! (January 14, 2025 at 12:16 PM):

Thanks for the behind-the scenes write-up. Mapping the Zork maze by unmaking it into source code seems like a very zarfian conceit for an IF comp game!

David Finster (January 14, 2025 at 11:56 AM):

Just finished reading the blog post and had to come pay my respects. Awesome work! I look forward to digging in deeper. I played all these Infocom games when they were new. Good times. It’s how I learned to type.

BirdDroneOne (January 14, 2025 at 4:09 PM):

Not just cool. XYZZY cool.

Ben (January 14, 2025 at 4:01 PM):

thats a fun project! I built a python game inspired by this, and the thing that continues to amaze me about Zork is the language / interaction parser like how did Zork developer figure out how to contextually understand the meaning of 'stab theif with dagger'? Definitely not NLP, so how?

Andrew Plotkin (January 14, 2025 at 4:11 PM):

The underlying idea is pretty simple, and is still used in modern IF systems like Inform. You just have a set of patterns like "STAB obj WITH obj", and you run through them matching words against either fixed terms ("STAB") or object synonym lists ("NASTY/KNIFE/BLADE").

Andrew Plotkin (January 14, 2025 at 4:13 PM):

There's a *lot* of detail to get right about disambiguation, inferring missing nouns, conditional word matches, etc. And you have to put lots of synonyms in the lists. But it's not very magical. This is "gsyntax.zil" in the app.

Ben (January 14, 2025 at 4:16 PM):

thats approx what I built as well, but the # of interactions to support (or not) can get quite huge <obj class / subclass> was slightly better but still gets unwieldy depending on that first action <get>, <put>, <attack> it was just a marvel how complicated that got and how well zork did at it

Andrew Plotkin (January 14, 2025 at 10:23 PM):

Action dispatching is a completely separate step! This is one of the key insights.

Zarf Updates

Interactive fiction, narrative in games, and so on

Posts by

My links

Search (via DDG)

Blog archive

Tag archive

Previous post

Next post

Feeds

Games are for everyone

The Visible Zorker

Tuesday, January 14, 2025 (updated 1 day later)

Comments: 33 (latest 2 days later)

Tagged: if, interactive fiction, zork, infocom, zil, zarf

Seriously, you did what?

What was hard about this?

So after that it was easy, right?

More questions...

Updates

Comments from Mastodon

Comments from Bluesky