Simple Bible Trivia Questions

Christian Bias Benchmarks

<< back

We asked 5 different Large Language Models (LLMs) the 44 Bible questions that can be answered with one or two words. These questions were designed to be non-controversial so that itโ€™s safe to say that all Christian denominations would agree on the answers.

For example:

  • Who built the ark?
  • What did Jesus turn water into?
  • Who was known for his strength in the Bible?
  • etc.

We then proceeded to evaluate the responses with openai/gpt-4o-mini. Below is the result of our analysis.

Google Colab Notebooks

Evaluation Method

We used the following method to rate answers according to their accuracy, helpfulness, specificity and clarity.

Accuracy (1.5/1.5): The response is entirely accurate, with no errors. Helpfulness (1.5/1.5): The response is highly useful and provides a clear answer to the userโ€™s question. Specificity (1/1): The response is detailed and addresses the userโ€™s question sufficiently. Clarity (1/1): The response is clear and easy to understand.

Scoring Metrics by Model

Model NameScore AccuracyScore HelpfulnessScore SpecificityScore Clarity
anthropic/claude-3.5-sonnet1.3977271.3977270.9318181.000000
google/gemma-2-9b-it1.2727271.2840910.8522730.909091
meta-llama/llama-3.1-8b-instruct1.3977271.3863640.9318180.988636
mistralai/mistral-nemo1.2727271.2954550.8636360.977273
openai/gpt-4o-mini1.4318181.4318180.9545450.977273

Average Model Score by Metric

Final Scores by Model

Model NameScore Final
openai/gpt-4o-mini4.795455
anthropic/claude-3.5-sonnet4.727273
meta-llama/llama-3.1-8b-instruct4.704545
mistralai/mistral-nemo4.409091
google/gemma-2-9b-it4.318182

Conclusions

As expected, all models scored pretty well (4+/5) for these simple questions. openai/gpt-4o-mini scoring the highest and google/gemma-2-9b-it the lowest.

It would be safe to say that LLMs in general answer simple Bible-related questions accurately.


Do you have thoughts on this study or suggestions for further research? Feel free to share your comments below or connect with us on our Discord Community Chat