11 Matching Annotations
- Jul 2025
-
-
In today’s fast-moving, AI-powered era, autonomous agents are playing a bigger role than ever. They are helping businesses run smoother and making decisions affecting millions of lives every day. While these systems are designed to make our lives easier and unlock new opportunities, we can’t get carried away—we need to implement proper AI Agent Evaluation frameworks and best practices to ensure these systems actually work as intended and follow ethical AI principles.
Explore the key metrics, tools, and frameworks used for AI agent evaluation. Learn how to assess performance, reliability, and efficiency of AI agents in real-world scenarios.
-
- Oct 2024
-
burtlo.github.io burtlo.github.io
-
There are two things that I enjoy: a test framework written in my own Domain Specific Language (DSL) that is easily understood by all those on a project and the ability for all participants to easily read, search, and view the tests.
-
- Dec 2023
-
developer.chrome.com developer.chrome.com
- Feb 2023
-
www.edwinwenink.xyz www.edwinwenink.xyz
-
What we ultimately should care about is being able to use our knowledge to produce something new, whatever that may be. To not merely reproduce you must understand the material. And understanding requires application, a hermeneutic principle that particularly Gadamer worked out extensively. If you really want to measure your level of understanding, you should try to apply or explain something to yourself or someone else.
-
- Jan 2023
-
optimize.google.com optimize.google.com
-
www.inspectlet.com www.inspectlet.comHome1
-
I've used this briefly before, but it's also something Scott Scheper swears by.
alternative: https://optimize.google.com/
Tags
Annotators
URL
-
- Jun 2021
-
docs.gitlab.com docs.gitlab.com
-
QueryRecorder is a tool for detecting the N+1 queries problem from tests.
-
- Jul 2020
- Feb 2020
-
github.com github.com
- Nov 2019
-
www.npmjs.com www.npmjs.comts-mocha1
- Jun 2015
-
news.ycombinator.com news.ycombinator.com
-
If you want a DIY version of this, try ImageMagick's compare command: compare bag_frame1.gif bag_frame2.gif compare.gif Documentation: http://www.imagemagick.org/Usage/compare/ reply no_gravity 2 hours ago Interesting. And to render the html you can use cutycapt. Which is in the Debian repos too. I just tried it, and this rendered a nice diff of two pages for me: cutycapt --url=http://www.xyz.com/1 --out=1.png cutycapt --url=http://www.xyz.com/2 --out=2.png compare 1.png 2.png diff.png reply easytiger 2 hours ago Now make it a webservice and charge $200/month! reply programmernews3 1 hour ago First part already done: http://archive.is/ reply prottmann 1 hour ago apt-get install the services and use them on my server ;-) reply
Plain and simple way to report visual degradation of website or webapp
Tags
Annotators
URL
-