Close
0%
0%

Poetroid - Poetry Capturing Camera

Morph the mundane memory-catcher into a mosaic maker of metaphors, marshaling mementos not in megapixels but in mellifluous meter.

Public Chat
Similar projects worth following
Welcome to the whimsical world of Poetroid, where technology waltzes with words! Morph the mundane memory-catcher into a mosaic maker of metaphors, marshaling mementos not in megapixels but in mellifluous meter.

🎚 Dial a Poet: Twist the top-side dial to select your poet.

🖨 Print Poetic Visions: Each poem is instantly printed out on a thermal prinr for you to keep, scrapbook, or pin to the refrigerator.
🔧 Off-the-shelf Heart: Built with accessible, DIY-friendly parts and a whole lot of love for the whole lot of you.

Poetroid is powered by a Single Board Computer (SBC) running Linux, a thermal printer, screen, and mechanical keyboard controls. It is powered by a Bezos-Barn battery pack of questionable legitimacy. 

Check it out: 

The image recognition and poetry-writing is powered by self-hosted and open source models and libraries. They can be run on the SBC itself, slowly, or on a server with a GPU. 

Check the sidebar for code repo. Server code coming soon.

  • 1 × Single Board Computer Any SBC that runs linux with a GUI and 3x USB ports. This build used the Orange Pi Zero 3 + USB extension board. See build details for considerations.
  • 1 × Webcam A standard USB webcam. This build is using Logitech C270 potato.
  • 1 × Screen A raspberry-pi compatible screen of whatever size.
  • 1 × Mechanical Macro Keyboard A programmable usb macro keypad with a clickable dial and three buttons.
  • 1 × Battery Any USB battery pack with enough USB ports. Preferably with an on/off switch.

View all 7 components

  • Part: The SBC

    stopsendingmejunk01/30/2024 at 21:47 0 comments

    Ideally, Poetroid runs the language models on board. Currently easier said than done. 

    I initially wanted to find a low-powered board that could run the language model itself. While it is possible to run the models on many SBCs, according to the information I could find and my own testing on the few boards I own, processors are not powerful enough and language models are not small enough yet to do this well, quickly, and easily. 

    Some of the options I considered included Rockchip's latest, the zimaboards, intel N100 (tested), the Pocket AI from ADLink, AMD mobile GPUs (tested), and many others advertised with AI or NPU capabilities. 

    There are various issues in the current generation of things. AI-accelerated embeddable devices seem to be optimized for vision-related tasks, which need different compute than the matrix multiplication required by language models. 

    AMD's mobile 7000 or 8000 series (I have tested on the Win Mini and the Steam Deck) could possibly be fast enough and efficient enough, but running language models on AMD mobile GPUs is not as of last week straightforward. Projects like MLC exist to run LLMs accelerated on AMD hardware, but are currently limited in multimodal capabilities.

    Surprisingly, the best performance-per-watt board that I found is the m1 mac mini. 5-7 watts standby and 39 watts at full load, and it's possible to run multimodal models with ollama at acceptable speeds. And don't think I didn't take the possibility seriously. The M1 mini is an impressively small SBC:

    Ultimately, however, none of these boards meet the right matrix of features needed for this project. I believe there are AI-accelerated boards focused on matrix-multiplication in the works and we will quickly see drastic improvements in this area. 

    So for now, I decided not to invest in any of the current solutions while I wait for a better SBC/LLM combo that can meet the needs of the project. To that end, the server software is written separately from the client software so it can be installed on another machine on the network (ideally one with cuda cores). While you can technically run it right on your camera's SBC, it will be slow for now. 

    The Poetroid is currently powered by an Orange Pi Zero 3 to run the client software. It has an add-on board with two USB ports and an audio jack.

    The USB ports are a neccessity and I was particularly interested in the audio jack. Even offloading the processing to an RTX 3090 there is a decent wait. The instant camera magic is instant, even though the image appears after a wait. With this, the wait is first, so it's more like a photo-booth. The idea of including a small speaker is useful, not just for the click noise of the "shutter", but to play a bunch of other random sounds like 90s dial up modem noises during the 20-30 second wait. 

    More 

  • Goals and Motivations

    stopsendingmejunk01/30/2024 at 20:52 0 comments

    The goal is to create a portable and self-contained instant-style camera that prints poems instead of pictures. Some of the motivations for this project:

    - A desire to create a fun art project to share with my friends while playing with my favorite things: impossibly tiny computers, language models, and usb cables. 

    Goals:

    - Self-contained. Meaning it requires no external servers or devices to run. Everything is on board from power to processing.

    - Simple to maintain. Unlike many of my projects which require 15 minute of tinkering and troubleshooting to run once before immediately breaking and become forgotten forver, I'd like this to be simple enough that I can loan it to a friend and they could use it without needing to jack into the matrix and hack the Gibson first. And then I can forget about it.

    - Off-the-porch parts. I'm not gonna lie, most of this stuff came from the Bezos Barn. Much of it can be found cheaper on ali-express and other Chinese suppliers if you are willing to wait longer. 

View all 2 project logs

Enjoy this project?

Share

Discussions

Similar Projects

Does this project spark your interest?

Become a member to follow this project and never miss any updates