using the logic analyzer, i proved that the irq14 handler...

using the logic analyzer, i proved that the irq14 handler never gets called. the only code that *ever* sets the interrupt_detected flag exists in this handler. it's declared as a volatile so it can't be cached in a register.

Like 4 Oct 2023 at 2:16 | Open on mastodon.social

87 comments

Tube❄️Time replied to Tube❄️Time

I wrote the flag value out to an unused IO port, 0x4F, so I can see it on the logic analyzer. a neat trick!

4 Oct 2023 at 2:17 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so i don't know how this flag is getting set. my hack is to preemptively clear the flag right before starting DMA, and so far, it seems to be working.

i think this code was "working" with the real ESDI drive because that one uses burst mode DMA and it finishes up very quickly, before the irq14wait routine can exit early.

4 Oct 2023 at 2:20 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

decided to look at the real drive. and guess what--it's not using burst mode. the POS registers have it turned off by default. it's also slow to read the data from the spinning disk, so IBM must have figured that it wasn't really necessary.

4 Oct 2023 at 23:15 | Open on mastodon.social

Brian Danger Hicks replied to Tube❄️Time

@tubetime When you find yourself asking, "How did this ever work?" and it turns out the answer is "It didn't."

4 Oct 2023 at 23:27 | Open on teh.entar.net

Tube❄️Time replied to Tube❄️Time

now I'm reading up on accessing SD cards from the Teensy 4.1. looks like SdFat is the library? could it be so easy?

5 Oct 2023 at 14:11 | Open on mastodon.social

Chris Hammond replied to Tube❄️Time

@tubetime I've used that library before, yes it really is easy. You probably won't set any throughput records, but it was great for writing diagnostic logs that were later read back and uploaded.

5 Oct 2023 at 16:12 | Open on hachyderm.io

Paula Maddox replied to Tube❄️Time

@tubetime Yes, it really is that easy :)

6 Oct 2023 at 7:59 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

turns out it's easy but I had to reformat the SD card using the official sdcard.org utility. anyway, I've read the first sector from a real disk image!

7 Oct 2023 at 20:59 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

the drive now gives the POS ID. let's try to boot!

7 Oct 2023 at 22:12 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

hmm 01048200 is a drive select acknowledgement error.

7 Oct 2023 at 22:12 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

the BIOS runs faster than the DIFDIAG utility, and so it seems like it is hitting a timing problem that i didn't hit before.

my drive code seems to randomly hang up and not respond correctly.

7 Oct 2023 at 22:14 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

it's occasionally getting a spurious end-of-interrupt command which is really odd and points to an issue with the mailboxes (again, sigh).

but it's SO DARN CLOSE. it's transferring sectors from the IML region in the disk image.

7 Oct 2023 at 23:12 | Open on mastodon.social

Daniel Schwarz :toad: replied to Tube❄️Time

@tubetime What’s the end goal here, boot the laptop using a SD card that’s emulating a Micro Channel hard drive interface, via Teensy?

8 Oct 2023 at 0:09 | Open on toad.social

Tube❄️Time replied to Daniel Schwarz :toad:

@dschwarz yup!

8 Oct 2023 at 0:11 | Open on mastodon.social

Daniel Schwarz :toad: replied to Tube❄️Time

@tubetime I know it’s not the point of your project, but didn’t this laptop have PCMCIA? Would one of these work as a boot drive?

Digigear SD SDHC SDXC to PCMCIA PC Card, Adapter Supports, ATA Flash Memory https://a.co/d/agF7LO9

8 Oct 2023 at 0:46 | Open on toad.social

Tube❄️Time replied to Daniel Schwarz :toad:

@dschwarz nope, the memory card slot is pre-PCMCIA

8 Oct 2023 at 0:55 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

figured out one problem. the disk boot routines slam the drive with an ATN and the first command word in 5.5us. the Teensy code takes too long to see the ATN and clears the command register full flag, which drops the first word. oops.

8 Oct 2023 at 22:02 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so it *almost* boots now. in fact it successfully loads the IML sectors from the hidden partition on the drive, and no longer throws an I999... error code!

8 Oct 2023 at 22:13 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

my drive doesn't implement this weird feature called pseudo RBAs--it's a way to artificially limit the maximum possible block address, presumably so they can hide the partition data.

i suspect the BIOS checks this, so i'll have to implement it. ugh. that means i need to figure out this incomprehensible diagram.

8 Oct 2023 at 22:26 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

holy crap it's booting I can't believe it sdfadfsdfsdfsffasdf

8 Oct 2023 at 22:44 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

well, it's working well enough to run qbasic. right now the drive is read-only.

8 Oct 2023 at 23:07 | Open on mastodon.social

Marsh Ray replied to Tube❄️Time

@tubetime “It’s working well enough to run QBASIC”

Love it 😂

9 Oct 2023 at 0:39 | Open on infosec.exchange

Ian Hanschen replied to Tube❄️Time

@tubetime this is incredibly rad

9 Oct 2023 at 4:19 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

i think i need to dig into the 01290200 cache error that has been coming up. i'm concerned that an issue with my DBA-ESDI card has caused it, but i'm not sure.

14 Oct 2023 at 19:46 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

looks like the cache is inside the CPU. i can't find any cache chips on the motherboard.

14 Oct 2023 at 19:47 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

see? no cache or memory chips. the larger devices are probably semicustom gate array parts that IBM was fond of using. doubt they contain any cache memory.

14 Oct 2023 at 19:52 | Open on mastodon.social

Fritz Adalis replied to Tube❄️Time

@tubetime
@kenshirriff just posted about these CPUs. They have most of the support circuitry in the cpu itself.

14 Oct 2023 at 20:01 | Open on infosec.exchange

Tube❄️Time replied to Tube❄️Time

looks like the error is generated by an NMI that gets tripped when the cache is being set up. could be a number of causes but in general it is an issue with the internal CPU cache.

14 Oct 2023 at 21:15 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

could also be this test of the DMA controller which is also included in the same set of tests and triggers the same error code, for some reason.

14 Oct 2023 at 21:32 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

this gives me an idea.

14 Oct 2023 at 21:49 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

pulling the CMOS battery...

14 Oct 2023 at 22:24 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

hmm, the error still comes up. so i just tried what i *should have tried* at the start -- the 700 series diagnostic disk.

14 Oct 2023 at 22:37 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

when the diagnostic detects the cache error, it asks if you have replaced the CPU card. i *lied to it* and said that I had, so when it asked if i wanted to keep the cache disabled, i said "N".

14 Oct 2023 at 22:38 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

aaaand that fixed it! we're now booting to DOS off my DBA-ESDI disk replacement.

14 Oct 2023 at 22:40 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so here's what i think happened:
1. my early version of the FPGA code had a typo that caused the BURST# line to be held low
2. this caused the DMA controller to get stuck and time out during the cache test, presumably a very early CPU test that checks for cache coherency.
3. this error is *sticky* and gets written to some nonvolatile memory (perhaps not CMOS since i couldn't clear it by pulling the battery.)

14 Oct 2023 at 22:42 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

this is all very good because i know the root cause and it's not something terrible like data bus contention, and it's thankfully not permanent damage.

14 Oct 2023 at 22:42 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

it boots windows 3.1 now. it was trying to run a weird hdd power saving mode command I hadn't implemented. it also complains about the swap file because the filesystem is read only still.

16 Oct 2023 at 23:38 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so about that write issue: it's an off-by-two error somewhere. two bytes being a single 16-bit word, so it's really an off-by-one error.

16 Oct 2023 at 23:41 | Open on mastodon.social

Chuck replied to Tube❄️Time

@tubetime of course it is. 🙂 Thinking it was an off by two error was off by one. 😂

17 Oct 2023 at 0:10 | Open on chaos.social

Tube❄️Time replied to Tube❄️Time

figured it out and fixed it. i forget to set the "transfer request" flag to kick off DMA.

in another routine, it sees that this flag is clear and assumes that a word has already been read using DMA, so it reads a crap value and then sets the transfer request flag again to start the next DMA transfer. that "crap value" pushes the valid data forward by one word.

17 Oct 2023 at 0:22 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

on to the next issues: randomly the ATN register mailbox flag gets set but the data in it is stale. also, the status interface register will randomly get read from by the host.

I think these are two facets of the same problem: the mailbox flags sometimes respond when you access a register that they are not supposed to be monitoring!

18 Oct 2023 at 3:46 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

the mystery deepens. according to the logic analyzer, temp_atn_set never goes high. reg_atn_set (for crossing clock domains) is always 000. flag_atn is only set to 1 on this single line of code!

and yet, somehow, it magically flips to a 1.

19 Oct 2023 at 0:04 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

looking at the generated logic, i see no explanation either. temp_atn_set (aka sd_cmd, my test point) never goes high. no glitches, no nothing. to set the flop, EN must be high and R must be low, and a clock edge must occur.

19 Oct 2023 at 0:28 | Open on mastodon.social

Show 36 replies

dumb future replied to Tube❄️Time

@tubetime This was really enjoyable to follow along with! Congrats on the progress

16 Oct 2023 at 23:41 | Open on hachyderm.io

Brian Swetland replied to Tube❄️Time

@tubetime Inconvenient!

14 Oct 2023 at 21:34 | Open on chaos.social

Samantaz Fox replied to Tube❄️Time

@tubetime Woohoo! Congrats!

9 Oct 2023 at 6:27 | Open on infosec.exchange

Vlad Vukicevic replied to Tube❄️Time

@tubetime "No command can not access" so.. uh.. commands can access? 🤯

14 Oct 2023 at 22:31 | Open on hachyderm.io

Tube❄️Time replied to Vlad

@vvuk the document is full of typos and errors.

14 Oct 2023 at 22:53 | Open on mastodon.social

Vlad Vukicevic replied to Tube❄️Time

@tubetime Your project is an ESDI drive emulator, right? (specific drive type, but ESDI interface) Would it work in another computer that had an ESDI controller and understood the IBM drive?

(I assume so, but then you mentioned microchannel which confused me -- I'm assuming the "creaky old IBM laptop" interface is ESDI?)

15 Oct 2023 at 0:57 | Open on hachyderm.io

Alan Martello replied to Tube❄️Time

@tubetime Your recent posts make me believe you’re a time traveller - perhaps the great-great-grandson of an IBM engineer who as a child found an old notebook complaining about this problem your grandpa just couldn’t solve and that he was fired for and altered the arc of his life. You studied your whole life as an engineer for this moment to come back to today when the hardware was still available, fix the problem, then go back and help your grandpa. Let us know how it turns out.

9 Oct 2023 at 0:56 | Open on hachyderm.io

Tube❄️Time replied to Alan

@amart not too far off! my grandfather worked there. i once reverse engineered a prototype floppy drive he worked on, and got it working again. https://twitter.com/TubeTimeUS/status/1617703291483467776

9 Oct 2023 at 3:16 | Open on mastodon.social

Go Up