Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UC: Manipulative Benchmark Optimization #64

Open
Tracked by #61 ...
jawache opened this issue Jun 5, 2024 · 0 comments
Open
Tracked by #61 ...

UC: Manipulative Benchmark Optimization #64

jawache opened this issue Jun 5, 2024 · 0 comments
Assignees
Labels
Action Item documentation Improvements or additions to documentation

Comments

@jawache
Copy link
Contributor

jawache commented Jun 5, 2024

Optimize the software so that it detects it's running in the context of a benchmark, adjust it's method of functioning so it performs optimally in those conditions but performs worse in real-life conditions.

How this might be reflected in an LLM environment would be LLMs optimized to return lower quality results (but therefore faster, more energy efficient) when it detects it's being run with a benchmark of prompts, however prompts outside those known to be used in a benchmark return higher quality (more energy intensive) results.

Counter

  • Usage based measurements rather than benchmark based measurements. For example rather than measure based of a benchmark of prompts, measure the emissions of all the entire infrastructure of an LLM Software (SaaS/Application) every day and then divide by the total number of prompts user made during that day. By using actual user data we get a more realistic figure and it more directly useful to the end user and also it's not something that is easy to manipulate.
@seanmcilroy29 seanmcilroy29 added documentation Improvements or additions to documentation Action Item labels Jun 6, 2024
@seanmcilroy29 seanmcilroy29 mentioned this issue Jun 7, 2024
24 tasks
@seanmcilroy29 seanmcilroy29 mentioned this issue Jul 3, 2024
18 tasks
@seanmcilroy29 seanmcilroy29 mentioned this issue Aug 7, 2024
18 tasks
@seanmcilroy29 seanmcilroy29 mentioned this issue Sep 4, 2024
23 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Action Item documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

4 participants