• 0 Posts
  • 226 Comments
Joined 2 years ago
cake
Cake day: July 5th, 2024

help-circle







  • but there were obviously going to be either a number of false positives or negatives that make it less valuable as such.

    It really comes down to the specifics of the technology and how it’s implemented. The system I worked on was astonishingly good. Even the local police of a particular installment wanted to do a test of the system (basically walk around with various guns of all different sizes) and they were stunned at how well it worked.

    The funny thing was that we built the system to work 100% locally, and we even insisted on air-gapped networks (but wasn’t a requirement). The amount of people and companies who asked if we could connect the system to the internet for easier access was worrying. Whenever we tried to explain the basics of data security and the potential issues we were just looked at like we were nuts.


  • The cheap system I have with a Google Coral and FOSS software

    I’m guessing you’re using Frigate?

    Having such systems as a later if defense is good. As the only defense, not so much.

    Agreed. The system I had developed was built explicitly as a human-in-the-loop system. It never made any decisions on its own. It was just a tool to enable the existing security staff to have better visibility. That’s it.

    You can make whatever argument you want about viability and efficacy. The only point I’m making is that our system was just an additional tool for security to use; not the only one.


  • There are several examples of exactly what I said

    No one is denying the existence of vision based LLM models. The issue is performance. It takes in the order of double (or even triple) digit seconds to process an image through an LLM. Even if it took a single second to process an image using decent server-grade hardware (which starts at about $10k per card), that’s way too much and still not fast enough.

    On just 10 cameras at a facility it would require north of $100k on just GPUs alone.

    Whereas a specialized computer vision model could process several dozen camera streams, in real-time, on just one of those $10k cards.

    An LLM would process an image in 10 seconds (generous) whereas a computer vision model operates in the milliseconds. We’re talking about a 1000x difference in required processing power.

    That’s why you’re wrong and have zero clue what you’re talking about.

    You’re arguing that that family uses a fully loaded semi-trailer to go 200m to the local park. It’s a clueless and asinine argument.




  • you could paint marker “not a gun” on the side of a gun and guess what would happen.

    It would flag it as a gun. How do I know? I worked on and developed a similar system at one point. It worked extremely well. We weren’t an American company and ultimately covid killed us (it was US American orgs that were the most interested in our stuff).

    It has some uses, but 95% of what is being used for and 100% of the data centers aren’t it.

    Do you think LLMs are being used for this sort of thing? Putting aside the sheer technical mountain of a hurdle that slapping an LLM vision model on top of dozens and dozens of real-time camera streams, the hardware requirements would put the company out of business before they made their first sale.

    Computer vision models, which are NOT LLMs, have been around for quite a while now and are very good at doing one thing and one thing only. And they’ll do it well for a miniscule fraction of what it takes to run an LLM.

    No, datacentres are not being used for real-time gun detection. The company might have other kinds of infrastructure located in a DC, but not the main video processing hardware.