Vista de Lectura

Hay nuevos artículos disponibles. Pincha para refrescar la página.

Convert Any Book to a DIY Audiobook?

6 Julio 2025 at 08:00

If the idea of reading a physical book sounds like hard work, [Nick Bild’s] latest project, the PageParrot, might be for you. While AI gets a lot of flak these days, one thing modern multimodal models do exceptionally well is image interpretation, and PageParrot demonstrates just how accessible that’s become.

[Nick] demonstrates quite clearly how little code is needed to get from those cryptic black and white glyphs to sounds the average human can understand, specifically a paltry 80 lines of Python. Admittedly, many of those lines are pulling in libraries, and some are just blank, so functionally speaking, it’s even shorter than that. Of course, the whole application is mostly glue code, stitching together other people’s hard work, but it’s still instructive and fun to play with.

The hardware required is a Raspberry Pi Zero 2 W, a camera (in this case, a USB webcam), and something to hold it above the book. Any Pi with the ability to connect to a camera should also work, however, with just a little configuration.

On the software side, [Nick] pulls in the CV2 library (which is the interface to OpenCV) to handle the camera interfacing, programming it to full HD resolution. Google’s GenAI is used to interface the Gemini 2.5 Flash LLM via an API endpoint. This takes a captured image and a trivial prompt, and returns the whole page of text, quick as a flash.

Finally, the script hands that text over to Piper, which turns that into a speech file in WAV format. This can then be played to an audio device with a call out to the console aplay tool. It’s all very simple at this level of abstraction.

Yes, we know it’s essentially just doing the same thing OCR software has been doing for decades. Still, the AI version is remarkably low-effort and surprisingly accurate, especially when handling unusual layouts that confound traditional OCR algorithms. Extensions to this tool would be trivial; for example, adjusting the prompt to ask it to translate the text to a different language could open up a whole new world to some people.

If you want to play along at home, then head on over to the PageParrot GitHub page and download the script.

If this setup feels familiar, you’d be quite correct. We covered something similar a couple of years back, which used Tesseract OCR, feeding text to Festvox’s CMU Flite tool. Whilst we’re talking about text-to-speech, here’s a fun ESP32-based software phoneme synthesiser to recreate that distinctive 1980s Speak & Spell voice.

Fancy Adding a Transputer Or Two To Your Atari ST?

Hackaday

Dave Rowntree

15 Mayo 2025 at 05:00

Has anybody heard of the ATW800 transputer workstation? The one that used a modified Atari ST motherboard as a glorified I/O controller for a T-series transputer? No, we hadn’t either, but transputer superfan [Axel Muhr] has created the ATW800/2, an Atari Transputer card, the way it was meant to be.

The transputer was a neat idea when it was conceived in the 1980s. It was designed specifically for parallel and scientific computing and featured an innovative architecture and dedicated high-speed serial chip-to-chip networking. However, the development of more modern buses and general-purpose CPUs quickly made it a footnote in history. During the same period, a neat transputer-based parallel processing computer was created, which leveraged the Atari ST purely for its I/O. This was the curious ATW800 transputer workstation. That flopped as well, but [Axel] was enough of a fan to take that concept and run with it. This time, rather than using the Atari as a dumb I/O controller, the card is explicitly designed for the Mega-ST expansion bus. A second variant of the ATW800/2 is designed for the Atari VME bus used by the STe and TT models—yes, VME on an Atari—it was a thing.

The card hosts an FPGA module, specifically the Tang 20k, that handles the graphics, giving the Atari access to higher resolutions, HDMI output, and GPU-like acceleration with the right code. The FPGA also contains a ‘synthetic’ transputer core, compatible with the Inmos T425, with 6Mb of RAM to play with. Additionally, the board contains an original Inmos C011 link adapter chip and a pair of size-1 TRAM slots to install two physical transputer cards. This allows a total of two transputers, each with its dedicated RAM, to be installed and networked with the synthetic transputer and the host system. The FPGA is configured to allow the host CPU and any of the transputers direct access to the video RAM, so with proper coding, the same display can mix 68K and parallel computing applications simultaneously. The original ATW800 couldn’t do that!

In addition to the transputer support and boosted graphics, the card also provides a ROM big enough to switch between multiple Atari TOS versions, USB loop-through ports to hook up to a lightning-ST board, and a MicroSD slot for extra local storage. What a project!

If you don’t know what the transputer is (or was), read our quick guide. Of course, forty-year-old silicon is rare and expensive nowadays, so if you fancy playing with some hardware, might we suggest using a Pi Pico instead?

Thanks to [krupkaj] for the tip!

Simulating High-Side Bootstrap Circuits With LTSpice

Hackaday

Dave Rowntree

13 Mayo 2025 at 08:00

LTSpice is a tool that every electronics nerd should have at least a basic knowledge of. Those of us who work professionally in the analog and power worlds rely heavily on the validity of our simulations. It’s one of the basic skills taught at college, and essential to truly understand how a circuit behaves. [Mano] has quite a collection of videos about the tool, and here is a great video explanation of how a bootstrap circuit works, enabling a high-side driver to work in the context of driving a simple buck converter. However, before understanding what a bootstrap is, we need to talk a little theory.

Bootstrap circuits are very common when NMOS (or NPN) devices are used on the high side of a switching circuit, such as a half-bridge (and by extension, a full bridge) used to drive a motor or pump current into a power supply.

A simple half-bridge driving illustrates the high-side NMOS driving problem.

From a simplistic viewpoint, due to the apparent symmetry, you’d want to have an NMOS device at the bottom and expect a PMOS device to be at the top. However, PMOS and PNP devices are weaker, rarer and more expensive than NMOS, which is all down to the device physics; simply put, the hole mobility in silicon and most other semiconductors is much lower than the electron mobility, which results in much less current. Hence, NMOS and NPN are predominant in power circuits.

As some will be aware, to drive a high-side switching transistor, such as an NPN bipolar or an NMOS device, the source end will not be at ground, but will be tied to the switching node, which for a power supply is the output voltage. You need a way to drive the gate voltage in excess of the source or emitter end by at least the threshold voltage. This is necessary to get the device to fully turn on, to give the lowest resistance, and to cause the least power dissipation. But how do you get from the logic-level PWM control waveform to what the gate needs to switch correctly?

The answer is to use a so-called bootstrap capacitor. The idea is simple enough: during one half of the driving waveform, the capacitor is charged to some fixed voltage with respect to ground, since one end of the capacitor will be grounded periodically. On the other half cycle, the previously grounded end, jumps up to the output voltage (the source end of the high side transistor) which boosts the other side of the capacitor in excess of the source (because it got charged already) providing a temporary high-voltage floating supply than can be used to drive the high-side gate, and reliably switch on the transistor. [Mano] explains it much better in a practical scenario in the video below, but now you get the why and how of the technique.

We see videos about LTSpice quite a bit, like this excellent YouTube resource by [FesZ] for starters.

Rebooting An 1973 Art Installation Running On A Nova

Hackaday

Dave Rowntree

12 Mayo 2025 at 08:00

Electronics-based art installations are often fleeting and specific things that only a select few people who are in the right place or time get to experience before they are lost to the ravages of ‘progress.’ So it’s wonderful to find a dedicated son who has recreated his father’s 1973 art installation, showing it to the world in a miniature form. The network-iv-rebooted project is a recreation of an installation once housed within a departure lounge in terminal C of Seattle-Tacoma airport.

You can do a lot with a ‘pi and a fistful of Teensies!

The original unit comprises an array of 1024 GE R6A neon lamps, controlled from a Data General Nova 1210 minicomputer. A bank of three analog synthesizers also drove into no fewer than 32 resonators. An 8×8 array of input switches was the only user-facing input. The switches were mounted to a floor-standing pedestal facing the display.

For the re-creation, the neon lamps were replaced with 16×16 WS2811 LED modules, driven via a Teensy 4.0 using the OctoWS2811 library. The display Teensy is controlled from a Raspberry Pi 4, hooked up as a virtual serial device over USB. A second Teensy (you can’t have too many Teensies!) is responsible for scanning a miniature 8×8 push button array as well as running a simulation of the original sound synthesis setup. Audio is pushed out of the Teensy using a PT8211 I²S audio DAC, before driving a final audio power amp.

Attempting to reproduce accurately how the original code worked would be tricky, if downright impossible, but fear not, as the network-iv-rebooted is running the original code. Since the artist was astute enough to keep not only the engineering drawings and schematics, but also the original paper tape of the Nova 1210 program, it could be successfully run using the SIMH Nova emulator. The simulator needed to be modified to support the optional ‘device 76’ GPIO device added to the Nova 1210 for handling the extra connectivity. This was a small price to pay compared to the alternative. That said, most of the heavy lifting on the I/O side is performed by the pair of Teensies, with modern coding methods making life a lot easier.

Mechanics and code for the reproduction are being collected on this GitHub repo for those interested in building a clone. The opus20 page has a few photos and details of the original installation, but many more pieces can be found on the sculptures page, complete with a neat video tour, which we also include below. Check out those circuit sculptures! Groovy!

We’ve recently featured some retro electronic art, drooled over some circuit sculptures, and swooned at some PCB art. We just can’t get enough!

A short video about James Seawright’s other pieces: