Your test suite should cover every function the agent supports, plus edge cases and queries it should reject. The set can be small but must be complete. Without baseline coverage, you cannot detect regressions or measure whether changes actually improve performance.