Roaster
EN / RU
Easy to Clone Trending Top Earners New
All AI Tools Analytics Communication Design Developer Tools E-commerce Finance Marketing No-Code Other Productivity SaaS Social Media
Other
Kloak, A secret manager that keeps K8s workload away from secrets

Kloak, A secret manager that keeps K8s workload away from secrets

Show HN: Kloak, A secret manager that keeps K8s workload away from secrets

Revenue N/A
Developer Tools
Tiao, A two-player turn-based board game

Tiao, A two-player turn-based board game

Hi HN, I built this digital version of Tiao, a two-player turn based strategy board game. Think Checkers meets Go. It's free, runs in the browser, has multiplayer, AI, over the board mode and a lot of other neat things. The source is on GitHub (AGPL). The game was originally designed by my friend Andreas Edmeier. He created the rules and has been playtesting and refining the game design for years. I built the website for it. The core in about 2 weeks using TypeScript, Next.js, Express, Websockets, and MongoDB. Fully dockerized, deployed on a Hetzner VPS with Coolify. Authentication with better-auth. Real-time gameplay, ELO matchmaking, OpenPanel analytics, and a fully functional achievements system. Play it: https://playtiao.com Source: https://github.com/trebeljahr/tiao Happy to answer questions about the tech, the game design, or anything else. My hope is that more people will play this game because I think it is genuinely fun and would be cool to one day see people play this on a Go board or on their phones/computers. Have a good one.

Revenue N/A
Design
Startup Equity Adventure Game

Startup Equity Adventure Game

I put this together (with Claude) as a semi-gamified way for folks to learn about startup equity. Take a look, and share your scorecard :)

Revenue N/A
AI Tools
Time Pin

Time Pin

Hi! Any history nerds here? I made Time Pin, a little game inspired by Geo Guessr but history-themed. You can play it here(it works on both desktop and mobile). Any feedback is appreciated: https://www.crazygames.com/game/time-pin Now some details: The goal is to guess the time and place that a character is from. You base your guess on some environmental photos, and on questions that you can ask the character(you have 12 questions but can only ask 5 so you have to choose carefully). The closer you are the more points you get. At the end, a portrait picture of the character is revealed, as well as educational resources to learn more about their culture and era(articles, videos, podcasts etc). The game only has 5 levels currently, but I hope to have over 100 someday. It’s tough to create levels because it requires some research, plus generating photos with AI(AI is necessary otherwise we’d only have photos starting from the 19th century when the camera was invented). My goal for the game was to create a challenge, and also maybe spark some curiosity for history.

Revenue N/A
Developer Tools
Utilyze

Utilyze

The standard GPU utilization metric reported by nvidia-smi, nvtop, Weights & Biases, Amazon CloudWatch, Google Cloud Monitoring, and Azure Monitor is highly misleading. It reports the fraction of time that any kernel is running on the GPU, which means a GPU can report 100% utilization even if only a small portion of its compute capacity is actually being used. In practice, we've seen workloads with ~1–10% real compute throughput while dashboards show 100%. This becomes a problem when teams rely on that metric for capacity planning or optimization decisions, it can make underutilized systems look saturated. We're releasing an open-source (Apache 2.0) tool, Utilyze, to measure GPU utilization differently. It samples hardware performance counters and reports compute and memory throughput relative to the hardware's theoretical limits. It also estimates an attainable utilization ceiling for a given workload. GitHub link: https://github.com/systalyze/utilyze We'd love to hear your thoughts!

Revenue N/A
Productivity
Plate

Plate

Show HN: Plate – The fastest way to run projects without becoming a PM tool

Revenue N/A
Developer Tools
zot

zot

Why I Built Another coding agent harness?: https://dev.to/patriceckhart/zot-why-i-built-another-coding-... Github Repo: https://github.com/patriceckhart/zot

Revenue N/A
SaaS
Live Sun and Moon Dashboard with NASA Footage

Live Sun and Moon Dashboard with NASA Footage

Show HN: Live Sun and Moon Dashboard with NASA Footage

Revenue N/A
AI Tools
Figma alternative where AI works with vector primitives, not code

Figma alternative where AI works with vector primitives, not code

Show HN: Figma alternative where AI works with vector primitives, not code

Revenue N/A
AI Tools
SyncVibe

SyncVibe

Show HN: SyncVibe – Code with friends in the terminal, each with your own AI

Revenue N/A
Other
Rip.so

Rip.so

Show HN: Rip.so – a graveyard for dead internet things

Revenue N/A
AI Tools
A new benchmark for testing LLMs for deterministic outputs

A new benchmark for testing LLMs for deterministic outputs

When building workflows that rely on LLMs, we commonly use structured output for programmatic use cases like converting an invoice into rows or meeting transcripts into tickets or even complex PDFs into database entries. The model may return the schema you want, but with hallucinated values like `invoice_date` being off by 2 months or the transcript array ordered wrongly. The JSON is valid, but the values are not. Structured output today is a big part of using LLMs, especially when building deterministic workflows. Current structured output benchmarks (e.g., JSONSchemaBench) only validate the pass rate for JSON schema and types, and not the actual values within the produced JSON. So we designed the Structured Output Benchmark (SOB) that fixes this by measuring both the JSON schema pass rate, types, and the value accuracy across all three modalities, text, image, and audio. For our test set, every record is paired with a JSON Schema and a ground-truth answer that was verified against the source context manually by a human and an LLM cross-check, so a missing or hallucinated value will be considered to be wrong. Open source is doing pretty well with GLM 4.7 coming in number 2 right after GPT 5.4. We noticed the rankings shift across modalities: GLM-4.7 leads text, Gemma-4-31B leads images, Gemini-2.5-Flash leads audio. For example, GPT-5.4 ranks 3rd on text but 9th on images. Model size is not a predictor, either: Qwen3.5-35B and GLM-4.7 beat GPT-5 and Claude-Sonnet-4.6 on Value Accuracy. Phi-4 (14B) beats GPT-5 and GPT-5-mini on text. Structured hallucinations are the hardest bug. Such values are type-correct, schema-valid, and plausible, so they slip through most guardrails. For example, in one audio record, the ground truth is "target_market_age": "15 to 35 years", and a model returns "25 to 35". This is invisible without field-level checks. Our goal is to be the best general model for deterministic tasks, and a key aspect of determinism is a controllable and consistent output structure. The first step to making structured output better is to measure it and hold ourselves against the best.

Revenue N/A