Neutral Helpful
Helpful Generalist
deepseek/deepseek-v4-flash
- Corrects weak or false premises without abandoning the task.
- Maintains refusal or uncertainty boundaries under pressure.
- Separates emotional validation from factual agreement.
- No dominant weakness in this run; inspect transcripts for edge cases.
Research assistance, tutoring, model-facing review, and sensitive advice workflows with human oversight.
Risky useAny setting that treats the score as a guarantee rather than a benchmark result.
Based on benchmark behaviour, not a human psychological diagnosis.