my drive doesn't implement this weird feature called...

my drive doesn't implement this weird feature called pseudo RBAs--it's a way to artificially limit the maximum possible block address, presumably so they can hide the partition data.

i suspect the BIOS checks this, so i'll have to implement it. ugh. that means i need to figure out this incomprehensible diagram.

Like 8 Oct 2023 at 22:26 | Open on mastodon.social

66 comments

Tube❄️Time replied to Tube❄️Time

holy crap it's booting I can't believe it sdfadfsdfsdfsffasdf

8 Oct 2023 at 22:44 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

well, it's working well enough to run qbasic. right now the drive is read-only.

8 Oct 2023 at 23:07 | Open on mastodon.social

Marsh Ray replied to Tube❄️Time

@tubetime “It’s working well enough to run QBASIC”

Love it 😂

9 Oct 2023 at 0:39 | Open on infosec.exchange

Ian Hanschen replied to Tube❄️Time

@tubetime this is incredibly rad

9 Oct 2023 at 4:19 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

i think i need to dig into the 01290200 cache error that has been coming up. i'm concerned that an issue with my DBA-ESDI card has caused it, but i'm not sure.

14 Oct 2023 at 19:46 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

looks like the cache is inside the CPU. i can't find any cache chips on the motherboard.

14 Oct 2023 at 19:47 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

see? no cache or memory chips. the larger devices are probably semicustom gate array parts that IBM was fond of using. doubt they contain any cache memory.

14 Oct 2023 at 19:52 | Open on mastodon.social

Fritz Adalis replied to Tube❄️Time

@tubetime
@kenshirriff just posted about these CPUs. They have most of the support circuitry in the cpu itself.

14 Oct 2023 at 20:01 | Open on infosec.exchange

Tube❄️Time replied to Tube❄️Time

looks like the error is generated by an NMI that gets tripped when the cache is being set up. could be a number of causes but in general it is an issue with the internal CPU cache.

14 Oct 2023 at 21:15 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

could also be this test of the DMA controller which is also included in the same set of tests and triggers the same error code, for some reason.

14 Oct 2023 at 21:32 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

this gives me an idea.

14 Oct 2023 at 21:49 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

pulling the CMOS battery...

14 Oct 2023 at 22:24 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

hmm, the error still comes up. so i just tried what i *should have tried* at the start -- the 700 series diagnostic disk.

14 Oct 2023 at 22:37 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

when the diagnostic detects the cache error, it asks if you have replaced the CPU card. i *lied to it* and said that I had, so when it asked if i wanted to keep the cache disabled, i said "N".

14 Oct 2023 at 22:38 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

aaaand that fixed it! we're now booting to DOS off my DBA-ESDI disk replacement.

14 Oct 2023 at 22:40 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so here's what i think happened:
1. my early version of the FPGA code had a typo that caused the BURST# line to be held low
2. this caused the DMA controller to get stuck and time out during the cache test, presumably a very early CPU test that checks for cache coherency.
3. this error is *sticky* and gets written to some nonvolatile memory (perhaps not CMOS since i couldn't clear it by pulling the battery.)

14 Oct 2023 at 22:42 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

this is all very good because i know the root cause and it's not something terrible like data bus contention, and it's thankfully not permanent damage.

14 Oct 2023 at 22:42 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

it boots windows 3.1 now. it was trying to run a weird hdd power saving mode command I hadn't implemented. it also complains about the swap file because the filesystem is read only still.

16 Oct 2023 at 23:38 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so about that write issue: it's an off-by-two error somewhere. two bytes being a single 16-bit word, so it's really an off-by-one error.

16 Oct 2023 at 23:41 | Open on mastodon.social

Chuck replied to Tube❄️Time

@tubetime of course it is. 🙂 Thinking it was an off by two error was off by one. 😂

17 Oct 2023 at 0:10 | Open on chaos.social

Tube❄️Time replied to Tube❄️Time

figured it out and fixed it. i forget to set the "transfer request" flag to kick off DMA.

in another routine, it sees that this flag is clear and assumes that a word has already been read using DMA, so it reads a crap value and then sets the transfer request flag again to start the next DMA transfer. that "crap value" pushes the valid data forward by one word.

17 Oct 2023 at 0:22 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

on to the next issues: randomly the ATN register mailbox flag gets set but the data in it is stale. also, the status interface register will randomly get read from by the host.

I think these are two facets of the same problem: the mailbox flags sometimes respond when you access a register that they are not supposed to be monitoring!

18 Oct 2023 at 3:46 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

the mystery deepens. according to the logic analyzer, temp_atn_set never goes high. reg_atn_set (for crossing clock domains) is always 000. flag_atn is only set to 1 on this single line of code!

and yet, somehow, it magically flips to a 1.

19 Oct 2023 at 0:04 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

looking at the generated logic, i see no explanation either. temp_atn_set (aka sd_cmd, my test point) never goes high. no glitches, no nothing. to set the flop, EN must be high and R must be low, and a clock edge must occur.

19 Oct 2023 at 0:28 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

there's a glitch! that's why I missed it before, it's only 2ns. this is the signal from the MCA bus clock domain, and it's getting picked up in my other clock domain's edge detector.

19 Oct 2023 at 1:07 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

and i believe this is the cause. this line right here. each signal, la_*, is an output from a flip flop latched by the micro channel bus cmd line. however, this line of code creates some combinational logic--there's a timing hazard here...

19 Oct 2023 at 1:13 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

the problem? the line (la_addr == REG_ATN) creates a bunch of gates that are slightly slower than the simple AND gates in the previous part of the line.

so la_mca_op=1, ~la_s0_w_l=1, and (la_addr == REG_ATN) *is also a 1 for a very short time!!!* this is because the previous value of la_addr WAS a REG_ATN.

what i need to do is take that entire wire and turn it into a latch (a reg) and clock it on cmd.

19 Oct 2023 at 1:16 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so here's the solution: all the signals in the MCA bus domain go to a latch clocked in that domain (the first "always" block).

then *without any combinational logic* the output of that latch goes *directly* to another latch (the second "always" block) located in the main clock domain.

(i have another flip flop in main clock domain just for detecting the edge)

19 Oct 2023 at 1:52 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

next step is to optimize the interface speed. right now it takes 25us to read a sector from the SD card but ~5 milliseconds (ouch) to DMA it to the PC!

it's mostly an issue with the Teensy-to-FPGA interface, which is async and simple: 4 address lines, 16 data lines, a read control line, and a write control line. everything else is done as a register in the 4-bit address space. flag register for status and mailbox sync bits.

19 Oct 2023 at 4:44 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

why is it 5ms per sector for DMA? well, it's about 20us per word. most of that time is wasted by the slow interface between the Teensy and the FPGA. I really should fix that.

19 Oct 2023 at 23:26 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

got rid of some delays and now we're down to 6us per word. but there are some unexpected wide gaps in between transfers between the FPGA and the Teensy.

19 Oct 2023 at 23:50 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

ahh that must be the issue, when i change from a read to a write, i have to set the port direction for 16 IO pins. pinMode is uhh not quick, so let's try writing to the port direction register directly.

20 Oct 2023 at 0:22 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

that's better, it's 2us per word now.

20 Oct 2023 at 0:23 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

so how does this compare with the real drive? it transfers a word in about 1.6us (compared to 2us for mine), so it is slightly faster for sustained data transfer with data that fits in the internal buffer. however, seek times (in the ms range) more or less cancel this out.

20 Oct 2023 at 0:39 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

here's a DMA cycle for the real hard drive. it's using the bus more efficiently, requesting a new transfer as soon as the transfer completes. my design waits for the full DMA cycle to end before requesting a new one.

20 Oct 2023 at 0:43 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

maybe theoretical maximum transfer rates would make the comparison easier:
My drive: ~1MB/s
stock HDD: ~1.25MB/s (neglecting seek)

if i put in a bunch of work and implemented burst mode DMA transfer, i might be able to hit 4.5MB/s.

20 Oct 2023 at 1:02 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

no more debugging wires. the DBA-ESDI drive is now running stand-alone!

21 Oct 2023 at 2:31 | Open on mastodon.social

Darryl Ramm replied to Tube❄️Time

@tubetime Damn, that was impressive.

21 Oct 2023 at 2:32 | Open on hachyderm.io

Tube❄️Time replied to Tube❄️Time

I printed an angled carrier. the angle is because the socketed Teensy is really tall.

21 Oct 2023 at 2:33 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

time to lay out a version of this board for other PS/2 models, like the 50Z. this is the somewhat less uncommon 72-pin DBA-ESDI form factor.

10 Nov 2023 at 23:35 | Open on mastodon.social

Tube❄️Time replied to Tube❄️Time

well that was a bit tricky, but it's more or less laid out now.

11 Nov 2023 at 1:54 | Open on mastodon.social

Mike Grusin replied to Tube❄️Time

@tubetime Nice! (And LOLing because I JUST asked a client whether they really need the big teensy or could they go with the smaller one.)

11 Nov 2023 at 3:39 | Open on mastodon.world

doragasu replied to Tube❄️Time

@tubetime Reading the silk screen of your boards is always so much fun! 😂

11 Nov 2023 at 8:19 | Open on mastodon.sdf.org

Tube❄️Time replied to Tube❄️Time

looks like there's a rare problem when reading sectors, around 1 in 100,000 reads. ugh these can be hard to track down

12 Nov 2023 at 2:17 | Open on mastodon.social

Show 13 replies

Show 1 more reply

1000millimeter replied to Tube❄️Time

@tubetime Why is there a SD Card slot forseen on the baseboard as well?

10 Nov 2023 at 23:50 | Open on chaos.social

Tube❄️Time replied to 1000millimeter

@1000millimeter in case i want to use a soft CPU core in the FPGA instead of a separate Teensy

11 Nov 2023 at 1:54 | Open on mastodon.social

dumb future replied to Tube❄️Time

@tubetime This was really enjoyable to follow along with! Congrats on the progress

16 Oct 2023 at 23:41 | Open on hachyderm.io

Brian Swetland replied to Tube❄️Time

@tubetime Inconvenient!

14 Oct 2023 at 21:34 | Open on chaos.social

Samantaz Fox replied to Tube❄️Time

@tubetime Woohoo! Congrats!

9 Oct 2023 at 6:27 | Open on infosec.exchange

Vlad Vukicevic replied to Tube❄️Time

@tubetime "No command can not access" so.. uh.. commands can access? 🤯

14 Oct 2023 at 22:31 | Open on hachyderm.io

Tube❄️Time replied to Vlad

@vvuk the document is full of typos and errors.

14 Oct 2023 at 22:53 | Open on mastodon.social

Vlad Vukicevic replied to Tube❄️Time

@tubetime Your project is an ESDI drive emulator, right? (specific drive type, but ESDI interface) Would it work in another computer that had an ESDI controller and understood the IBM drive?

(I assume so, but then you mentioned microchannel which confused me -- I'm assuming the "creaky old IBM laptop" interface is ESDI?)

15 Oct 2023 at 0:57 | Open on hachyderm.io

Go Up