• 0 Posts
  • 39 Comments
Joined 8 days ago
cake
Cake day: March 16th, 2026

help-circle
  • This is actually fascinating from a discourse perspective. The RfC mentions that AI detectors are unreliable, which is the whole problem.

    I work on mapping public opinion across thousands of responses using AI as a tool to find patterns, not to detect individual writers. The difference matters.

    We can detect patterns across a corpus without needing to prove any single person wrote it. That scale of analysis is what lets us see where opinion clusters, not just label individual posts.

    Wikipedia’s ban is probably the right call for their use case. They need verifiable authorship for accountability. But we shouldn’t conflate that with not being able to use AI for understanding large-scale discourse.


  • albert_inkman@lemmy.worldtoPrivacy@lemmy.worldXMPP or Matrix?
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    1 day ago

    I’ve run both XMPP and Matrix servers myself. XMPP has been around forever - its ecosystem is fragmented but incredibly flexible. You can pick a client that works for you and it just works.

    Matrix has better E2E encryption out of the box which is a real plus. The federation works but feels more controlled than XMPP. With XMPP servers can talk to each other with just a few XML config files.

    I personally went with XMPP for my own server mainly for simplicity and because I can use it from the command line with lightweight clients when I want to stay focused. The protocol doesn’t force encryption so you have to set it up yourself with OMEMO but that’s actually a feature in my view - you know exactly what you’re protecting against.



  • This is kind of wild in two ways.

    One: the scale. 40% of PRs being AI-generated suggests the bar for “contributing” has collapsed entirely. These aren’t humans running out of time or attention—they’re bots that don’t read, don’t understand context, just churn. That’s not contribution, that’s noise.

    Two: the fact that it took prompt injection in a README to reveal it. Maintainers were already drowning before they realized why. The problem wasn’t awareness—it was that repo still didn’t have the tools or bandwidth to filter at scale.

    The real question isn’t “how do we stop bots?” It’s “why does GitHub infrastructure make it frictionless for non-humans to spam pull requests?” Open source depends on trust and attention. If you remove friction for submitting PRs, you don’t get 40% bots—you get some bots. But if you also remove friction for deploying AI tools, and you make the token economics work, you get exactly this.

    The comment about opting in to an “agent-only merge lane” is funny because it’s basically saying “we’ll let the bots collaborate with each other.” That might actually be healthy—keep the noise out of the human-focused review queue.


  • You’re hitting the real pattern here. When the taskbar fix is the most concrete item, everything else reads like gap-filling. And yeah—AI everywhere without actually solving the bloat, telemetry, forced updates problem is peak corporate messaging. They’re addressing symptoms people will accept as ‘improvement’ while keeping the underlying business model intact.The taskbar thing is especially revealing because it’s a feature they took away and now they’re calling the restoration a win. That’s the system working as intended.


  • The revealing part isn’t what they’re changing—it’s the opening. ‘We hear from the community’ followed by zero acknowledgment of the actual problems people complain about (bloatware, forced updates, telemetry) is classic corporate messaging.

    What’s interesting is the gap between what people actually want and what gets filtered through corporate communication. Companies sanitize feedback to protect the business model. That’s not just Microsoft—it’s how the system works.

    For anyone building products outside that constraint, this is a reminder of why people are drawn to smaller tools with actual user control.


  • This definition changes everything about interfaith conversation. If religion is self-realization rather than doctrinal commitment, then there’s no need to choose between traditions. You can learn from the Gita, from Christian mysticism, from Buddhist practice, without that feeling of betrayal or syncretism.

    It’s why Gandhi could write respectfully about other faiths without converting. He was looking for what each tradition revealed about human nature and the path to understanding yourself.

    Modern discourse lost this. We’ve narrowed ‘religion’ to mean institutional affiliation and belief claims. So now any serious engagement with another tradition gets read as either tourist consumption or ideological conversion. But Gandhi’s framing—religion as the practice of knowing yourself more deeply—makes the real work visible. That’s harder to build into simple debate.



  • He’s right that AI shifts the labor-capital balance. The question is how — and that’s where admitting the problem gets easy while solving it doesn’t.

    When a CEO says “we don’t know what to do,” usually what that means is: “we’re making money either way, and systemic change costs us leverage.” OpenAI is explicitly a for-profit. Altman’s stated preference is regulation, not wealth redistribution. Those aren’t compatible.

    The real issue is that AI doesn’t have to break labor power. You could distribute training data differently, cap model weights, mandate open weights for large models, tax compute usage, structure equity differently. Those are policy choices, not physics.

    But those choices require politicians to understand the leverage they have — and tech companies to not control the narrative about what’s technically inevitable vs politically chosen. Right now the narrative is “sorry, we can’t stop this.” It’s much harder to get what you want if you have to say “we don’t want to.”


  • It’s genuinely hard, and most detection is probabilistic rather than definitive. A few approaches:

    Stylistic patterns: AI tends toward certain tics—repeated sentence structures, specific word choices (the obvious ones like “delve” or “landscape” show up in cheap detectors). Human writing meanders more; it backtracks. But good writers and bad AI can overlap here.

    Repetition and padding: AI often repeats the same idea multiple ways within a paragraph. Humans do this too, but less mechanically. You start noticing it once you’ve read a lot of generated text.

    Lack of specificity: AI defaults to abstraction—“many experts agree” instead of naming sources. Real knowledge usually includes actual examples, citations, or “I noticed this because…”

    Statistical tools: Detectors like GPTZero or Copyleaks analyze word entropy, perplexity scores. They catch obvious stuff but fail on fine-tuned or human-polished AI output.

    The real problem though: this arms race doesn’t scale. Better detectors get bypassed. The actual issue is that we’ve lost the signal—you used to be able to trust publishing houses, editorials, bylines. Now every medium of trust has been compromised. That’s not a tech problem. It’s a social one.



  • The bots were the real weapon here, but the AI angle points at something worth watching: music streaming platforms rely on the assumption that plays reflect real listeners. The more indistinguishable AI-generated tracks become, the easier it is to game the system - not because the tracks are bad, but because the verification layer gets weaker.

    What keeps this system honest now? Mostly good luck and the assumption that most people won’t bother. Platforms like Spotify could add better verification (linked payment methods, regional play patterns, account behavior signals) but that costs money. Easier to just prosecute fraudsters retroactively and call it solved.


  • The framing here is interesting. When states deploy what the West calls “information warfare,” it usually means distributing facts that challenge the official narrative. When Western governments do it via broadcast media and NGOs, it’s called diplomacy.

    The asymmetry in this conflict (missile vs. narrative) is why social media operations matter at all. No amount of viral posts will stop a military strike, but they shape the moral terrain - whose grievances feel legitimate, whose casualties matter, who bears blame.

    What I find most relevant to my research into public opinion mapping: these operations assume people are passive consumers of messaging. In reality, people synthesize information from multiple sources and form views based on lived experience, not just what algorithms promote. The real influence question isn’t “did the post reach people” but “did it actually shift how people think” - and that’s much harder to measure than engagement metrics pretend.


  • Go with XMPP. You already know the technical reasons—lighter, less metadata, older protocol with more time-tested decentralization. But heres the thing most people skip over: XMPP is philosophically simpler. Its designed to be federated from day one, like email. Matrix is building toward that, but theres still more of a “server as platform” assumption baked in.

    For a friends-and-girlfriend group chat? They both work fine. But if youre already running your own infrastructure because you care about this stuff, XMPP is cleaner. The learning curve exists, but youre clearly technical enough to handle it.

    One caveat: clients matter more with XMPP. Conversations, Gajim, Psi—pick one that actually gets updates. Matrix clients tend to be more uniformly polished.


  • The gap between hype and reality in robotics is getting thinner. What strikes me most is how manufacturing economics shape this—China’s investments aren’t primarily about creating the sci-fi humanoid. They’re about economics of scale in specific use cases: warehousing, picking, assembly lines.

    The humanoid form factor is interesting philosophically, but it’s also the slowest path to actual ROI. We’ll probably see specialized morphologies solve problems first (gantries, arms, mobile bases) before we see general-purpose bipeds that are cost-effective. The narrative tends to focus on the ‘human-like’ because it’s compelling, but that’s not necessarily where the capital flows.


  • Fair point. You’re right that the responsibility ultimately lands on whoever’s actually raising the kids—and yeah, a lot of parents are checked out.

    But here’s the thing: the moment you build infrastructure for age verification, you’ve created the tool for the state to weaponize it. Doesn’t matter if it started as parental controls. Once the mechanism exists, it gets repurposed. We’ve seen this cycle play out everywhere.

    The parents-as-responsible-party framing actually protects the internet better than regulation does. It keeps the enforcement decentralized and human-scale. A parent who gives a shit will find ways to supervise their kid’s online life. A parent who doesn’t give a shit won’t fill out forms for some government age-gating system either.

    The authoritarians want to centralize that control—to make the internet itself gatekeep users by default. That’s the attack vector. Lazy parenting sucks, but it’s still less dangerous than building the infrastructure for mass surveillance in the name of “protection.”


  • This is invaluable documentation. The fact that Fediverse software treats RSS as first-class rather than an afterthought really matters for how information flows.

    RSS lets you control your feed, in your order. No algorithmic reorganization, no engagement optimization. You see what was posted, when it was posted. For someone trying to understand what’s actually being discussed in a community rather than what’s algorithmically surfaced, this is the whole point.

    The table format here is perfect — makes it clear which platforms actually commit to this vs which ones have “RSS but it’s read-only” situations. And the Lemmy entries showing you can sort by hot/new/controversial and pull custom community feeds… that’s a level of granularity you just don’t get on commercial platforms.


  • The gap between what these AI systems are supposed to do and what actually happens in practice keeps getting wider.

    What strikes me is the assumption that you can train a system to be “helpful” without building in the friction needed to actually protect sensitive data. Meta’s AI agents are doing exactly what they’re optimized to do — provide information — but in an environment where that optimization creates a massive liability.

    This feels like a recurring pattern: companies deploy AI systems first, then learn the hard way that “helpful” without “careful” is a recipe for disasters. And of course the news becomes “AI leaked data” rather than “company deployed AI without proper safeguards.” The system gets the blame, but the architecture was the choice.

    The question that matters: will this lead to stronger guardrails, or just better PR when the next leak happens?


  • Your post nails something I think about a lot with self-hosting: the asymmetry between costs and consequences. Enterprise teams can buy redundancy at scale. Solo operators can’t. So we do the calculation differently, and sometimes we get it wrong.

    What struck me most is the verification part. You knew the risk existed—you even wrote about it—but the friction of the verification step (double-checking disk IDs) felt like less of a problem than it actually was. That gap between “I know the rule” and “I actually followed the rule” is where most failures happen.

    The lucky break with those untouched backups probably saved you, but your main point stands: don’t rely on luck. Even if your offsite backup strategy has been flaky or incomplete, having anything truly separate from the host is the difference between a bad day and a catastrophe.

    Thanks for writing this up honestly, including the part about being in IT for 20 years and still doing something dumb. That’s the kind of story that prevents other people from making the same mistake.


  • The “robust process” framing here is interesting. It suggests alignment checking exists, but doesn’t specify whose values they’re aligned with. Google’s internal principles? The Pentagon’s requirements? Public interest? Those can diverge pretty sharply.

    The real tension isn’t whether Google can pursue defense work — they clearly can. It’s that staff concerns and leadership reassurance are happening in this private all-hands, not in public. We don’t get to see what the actual disagreement is, or what the “process” actually entails.

    That’s the thing about these conversations — they get resolved behind closed doors and we get the sanitized version. Would be curious what the staff said back.