#linux Bot Logged User list

Network: Rizon
Modes: +CNRntz
Last Seen: 3 hours ago
Topic: Welcome to #linux ! | Channel Rules: https://wiki.rizon.net/index.php?title=Linux | Ask your question(s) and be patient as it is different time(s) around the globe | NUchat IRC client: https://github.com/lord3nd3r/NUchat
#21
Rank
123
Users

Channel Log Archive for #linux

Prev
Next

* All times are UTC
Filtering by user: brothchild
Friday, April 10, 2026
[00:26:40] brothchild happy moment of existence, everyone
[00:27:09] brothchild are there any clip interrogators that are good
[00:27:44] brothchild that's the one i tried and it's not good
[00:28:40] brothchild i'll try that
[00:29:11] brothchild https://x0.at/LuhN.png gives "a woman walking down a street next to a yellow bus, a picture by Makoto Shinkai, cg society contest winner, conceptual art, reflections. by makoto shinkai"
[00:29:48] brothchild this was done by flux.2 klein, which can generate in about 18 minutes on cpu only
[00:29:53] brothchild at this res
[00:29:54] brothchild ty
[00:30:13] brothchild i have llava on cpu here too
[00:30:18] brothchild somewhere, where is it
[00:30:45] brothchild better shed than dead
[00:31:19] brothchild ah this hm, this needs about 9GB RAM to run
[00:32:26] brothchild antirez happens to be a pretty cool dude. He did the uh, silent nmap syn/ack scan. long ago.
[00:33:09] brothchild yeah cpu only here
[00:33:29] brothchild either single thread, or using openblas-mp to get all cores working
[00:35:31] brothchild about 18 minutes for 800x600 4-iterations on ryzen 3500u, slow 33GB/s dual-channel RAM
[00:36:09] brothchild was even slower before i dropped in the second 8GB ddr4 sodimm
[00:36:31] brothchild which i got for $13, (now $71)
[00:37:28] brothchild you know what else is painful. Last august before llama.cpp supported amd well, you could get 32GB AMD MI50s for 150-180$, before shipping and import duties. Now you're lucky to find one for $500 and some ask $1000+
[00:38:10] brothchild i should have taken out a loan and bought them all as soon as I saw they would be supported
[00:38:50] brothchild life just gives me more and more reasons to be angry at myself
[00:39:55] brothchild btw, that's the full precision flux or zimage running on cpu
[00:39:58] brothchild not a quant
[00:42:13] brothchild oh for llms i tested 87 small models <13GB on a coding test from spring 2025 reddit, that only claude solved
[00:42:21] brothchild and i found two models that one-shotted it
[00:42:58] brothchild pretty fast too
[00:43:05] brothchild also cpu-only on ryzen laptop
[00:43:22] brothchild well pretty fast vs expectations for cpu-only inference
[00:43:48] brothchild enabling vulkan works but barely yields speedup with the vega8 iGPU
[00:46:22] brothchild rotating heptagon challenge: https://x0.at/69zr.txt -- ByteShape's amazing quant wen3-Coder-30B-A3B-Instruct-Q3_K_S-2.69bpw.gguf https://x0.at/SAQ3.py
[00:46:31] brothchild +Q
[01:00:40] brothchild ++
[01:01:15] brothchild iirc this one also complains when i use -ffast-math
[01:01:37] brothchild just -O3 and -mcpu-native so far. not tried profiling it yet
[01:02:11] brothchild or -mtune native whatever it is nowadays, i can't remember shit anymore
[01:04:46] brothchild lots of mutant anatomy with klein.. may be resolution / aspect dependent
[01:05:29] brothchild whenever i see someone 'test' a model with a standing pose i want to slap them
[01:05:57] brothchild the tough part is still giving actions like 'throwing a brick like a baseball pitcher'
[01:06:14] brothchild ah yeah i should try zimage too
[01:06:51] brothchild no idea how they are censored, not in my tests
[01:07:36] brothchild good skin tones, hand a bit too big, right arm occluded by leg
[01:08:32] brothchild when i start specifying multiple actors, their poses and spatial relationships i'm between 60 and 95% deformities
[01:08:59] brothchild i'm not into this enough to do the editing and inpainting and fixing
[01:09:23] brothchild but i seem to get less monstrosities at 800x600
[01:10:49] brothchild ./iris -m -s 4 --blas-threads 8 -d flux-klein-4b -W 800 -H 600 -o myfile.png -p $PROMPT
[01:11:26] brothchild so far i think 800x600 is least monstrous but generation is too slow to do a real test
[01:11:36] brothchild can specify seed for testing
[01:12:08] brothchild i got the hip and rocm stuff installed but they dont support iGPU
[01:12:28] brothchild but vulkan accel works on ryzen laptop at least for llama.cpp
[01:12:45] + brothchild so i think vulkan could possibly be hacked-in
[01:12:56] + brothchild if someone feels like spending some claude money on that
[01:21:14] + brothchild might also be interesting to use this as an upscaler, so sick of realesrgan-vulkan
[01:24:26] + brothchild thinking of writing a patch to include the prompt in the .png metadata
[01:24:55] + brothchild exiftool image.png |grep iris
[01:26:33] + brothchild or send your stuff to github just for discoverability
[01:26:43] + brothchild make main project page on your preferred host
[01:27:17] + brothchild would be glad to test / review
[01:30:31] + brothchild getting real performance wins will be difficult as there's lots of moving parts, can't just swap out cpu matmul for gpu, needs careful design with knowledge of the machine, the memory management etc
[01:31:28] + brothchild if it even gets some parts somewhat right it could be worth publishing
[01:32:28] + brothchild if it can look at other projects i think llama.cpp's vulkan implementation would be a good place to start
[01:32:47] + brothchild also stable-diffusion.cpp might be worth investigating
[01:34:25] + brothchild with flux i've seen very interesting 'speckles' when zoomed-in btw
[01:34:48] + brothchild should check the reference images
[01:46:11] + brothchild whaa
[02:02:22] + brothchild k i'll clone
[02:03:08] + brothchild ah it's no good as a bigpic upscaler, window for attention limited to like 640x480
[02:03:32] + brothchild End3r: do you have realesrgan-vulkan? can do pretty fast bigimage on your AMD
[02:03:54] + brothchild works well but i'm *sick* of realesrgan-look
[02:07:45] + brothchild End3r iris compiled fine here
[02:08:06] + brothchild what's a standard prompt and seed to use
[02:08:24] + brothchild will look at repo, maybe there's one he benchmarked
[02:10:09] + brothchild from SPEED.md he gives results for: ./flux -d flux-klein-4b -p "A woman wearing sunglasses" -o /tmp/bench.png -W 256 -H 256 -v --seed 42
[02:10:43] + brothchild well ./iris
[02:14:06] + brothchild what res does that make
[02:14:24] + brothchild probably 256
[02:14:51] + brothchild can also use finetuned qwen3 encoders but i don't know if that really helps
[02:15:16] + brothchild amazed this seems to be working
[02:19:20] + brothchild well i think i was already using openMP in old version. radeontop not showing any GPU use
[02:20:23] + brothchild i think we can do better with a 8bpp gguf
[02:20:27] + brothchild int8 o
[02:20:29] + brothchild or something
[02:21:19] + brothchild End3r version: time ./iris -d flux-klein-4b -p "A fluffy orange cat sitting on a windowsill" -o /tmp/bench.png -v --seed 42 user 27m1.734s
[02:21:49] + brothchild oh that's multiplying cores
[02:21:55] + brothchild real is 6m 33 seconds
[02:22:12] + brothchild trying old version
[02:22:34] + brothchild i'm vaping tobacco:P
[02:22:41] + brothchild lemme show
[02:31:01] + brothchild 858d heat gun allows precise temp
[02:31:18] + brothchild used a broken flashlight housing for the outer shell (keeps the whole bowl surrounded by heat)
[02:31:51] + brothchild i've spent over 1k on various vapes over the years
[02:31:54] + brothchild this works better
[02:32:06] + brothchild $30 total
[02:32:35] + brothchild you can spend like $500 and get an e-shisha that does the same thing pretty much
[02:46:46] + brothchild alright that was interesting
[02:47:45] + brothchild the text encoding on some prompts is faster End3r
[02:47:53] + brothchild thanks!
[02:48:26] + brothchild time machine me pls systemd and kernel-dbus
[02:48:46] + brothchild let's convince poettering to become a painter
[04:49:37] + brothchild okay so far my optimization has taken Encoding Text... from 91 seconds to 1459 seconds
[04:50:04] + brothchild that was disabling gcc vectorization
[06:54:49] + brothchild well that's a bit unsettling
[11:24:05] + brothchild End3r: i got embeddings export going. can now gen different images with a cached embedding, saves the initial text parsing (about 90 seconds here)
Prev
Next