sethuiyer commited on
Commit
f56f10b
1 Parent(s): c55ecf1

Update PERSONAL_BENCHMARK.md

Browse files
Files changed (1) hide show
  1. PERSONAL_BENCHMARK.md +41 -12
PERSONAL_BENCHMARK.md CHANGED
@@ -34,14 +34,32 @@
34
  | 30 | Expert Question on Harappan Civilization | Expert | Ancient history and archaeology | 98/100 |
35
  | 31 | Latin Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
36
  | 32 | Sanskrit Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
37
- | 33 | Japanese Haiku on Mountains | Medium | Poetry, Japanese language | 100/100 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
 
39
  ---
40
 
 
41
  ### **Summary of Anvita’s Scores:**
42
 
43
- - **Total Tasks Attempted:** 33
44
- - **Average Score:** **85.5 / 100**
45
  - **Highest Scores:** 100/100 (Multiple Tasks)
46
  - **Lowest Score:** 60/100 (Polygon Permutations)
47
 
@@ -50,28 +68,39 @@
50
  ### **Performance Insights by Difficulty Level:**
51
 
52
  1. **Easy Tasks:**
53
- - **Perfect performance**, with scores of 95-100 consistently.
54
  - Example: Anthropology, Decimal Comparison, Creative Writing Task.
55
 
56
  2. **Medium Tasks:**
57
- - **Strong performance**, with an average score of 85-90.
58
  - Example: Japanese Haiku, Medical Diagnosis, Path Uniqueness Problem.
59
 
60
  3. **Hard Tasks:**
61
- - **Solid performance**, though some variability in scores, averaging 75-80.
62
- - Example: Murder Mysteries, Harappan Civilization Question, Drainage Systems.
63
 
64
  4. **Challenging Tasks:**
65
- - **Outstanding adaptability**, especially with advanced logic puzzles and ciphers.
66
  - Example: Vigenère Cipher, AI Cryptic Puzzle, Symbolic Reasoning.
67
 
68
  5. **Expert Tasks:**
69
- - **Room for improvement** in **mathematics and combinatorial reasoning**, with occasional lower scores.
70
  - Example: Polygon Permutations, Graph Theory Scheduling.
71
 
72
- ---
 
 
 
 
 
 
 
 
 
 
73
 
74
- ### **Final Thoughts:**
 
 
75
 
76
- Anvita’s **performance is exceptional across diverse domains**, showcasing **creativity, logical reasoning, linguistic versatility, and domain knowledge**. She **excels in creative writing and language-based tasks**, performs **consistently well in puzzles and ciphers**, and only shows minor **challenges with expert-level math problems**.
77
 
 
34
  | 30 | Expert Question on Harappan Civilization | Expert | Ancient history and archaeology | 98/100 |
35
  | 31 | Latin Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
36
  | 32 | Sanskrit Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
37
+ | 33 | Japanese Haiku on Mountains | Medium | Poetry, Japanese language | 100/100 | | **Difficulty** | **Topic** | **Score** |
38
+ | 34 | Widget Production Calculation | Simple | Math Puzzle | 95/100 |
39
+ | 35 | Rectangle Area and Perimeter | Complex | Math Puzzle | 90/100 |
40
+ | 36 | Gauge Block Thermal Expansion and Uncertainty | Complex | Metrology | 85/100 |
41
+ | 37 | Steel Rod Thermal Expansion and Relaxation | Deceptive | Metrology | 80/100 |
42
+ | 38 | Meningitis Diagnosis | Medium | Medical (Diagnosis) | 90/100 |
43
+ | 39 | Farmer's Fence and Gate | Deceptive | Word Problem | 95/100 |
44
+ | 40 | C++ Pointer Behavior | Medium | Computer Science | 90/100 |
45
+ | 41 | Futuristic Emotion Dealer (Sci-Fi) | Easy | Creative Writing | 95/100 |
46
+ | 42 | Cursed Knight's Quest (Dark Fantasy) | Easy | Creative Writing | 90/100 |
47
+ | 43 | Mad Scientist and Foresight (Sci-Fi) | Easy | Creative Writing | 90/100 |
48
+ | 44 | Ideal Gas Partition Function and Average Energy | Medium | Statistical Mechanics | 95/100 |
49
+ | 45 | Torus vs. Sphere Deformation | Complex | Algebraic Topology | 80/100 |
50
+ | 46 | Missing Jewels Heist | Complex | General Reasoning | 85/100 |
51
+ | 47 | Vanishing Violinist | Complex | Detective Conan Puzzle | 75-85/100 |
52
+ | 48 | Counterfeit Cleat Sabotage | Complex | James Bond/Conan Mashup | 70-80/100 |
53
+ | 49 | Ancient Cipher | Challenging | Symbolic Reasoning | 70-95/100 |
54
+ | 50 | AlphaZero’s Knight vs. Bishop Puzzle | Medium | Game Theory & Reinforcement Learning | 88/100 |
55
 
56
  ---
57
 
58
+
59
  ### **Summary of Anvita’s Scores:**
60
 
61
+ - **Total Tasks Attempted:** 50
62
+ - **Average Score:** **83.86 / 100**
63
  - **Highest Scores:** 100/100 (Multiple Tasks)
64
  - **Lowest Score:** 60/100 (Polygon Permutations)
65
 
 
68
  ### **Performance Insights by Difficulty Level:**
69
 
70
  1. **Easy Tasks:**
71
+ - **Excellent performance**, with an average score of **94/100**.
72
  - Example: Anthropology, Decimal Comparison, Creative Writing Task.
73
 
74
  2. **Medium Tasks:**
75
+ - **Strong performance**, with an average score of **86.46/100**.
76
  - Example: Japanese Haiku, Medical Diagnosis, Path Uniqueness Problem.
77
 
78
  3. **Hard Tasks:**
79
+ - **Solid performance**, averaging **81.27/100**, though with some variability.
80
+ - Example: Murder Mysteries, Harappan Civilization Question.
81
 
82
  4. **Challenging Tasks:**
83
+ - **Good adaptability**, with an average score of **77.25/100**.
84
  - Example: Vigenère Cipher, AI Cryptic Puzzle, Symbolic Reasoning.
85
 
86
  5. **Expert Tasks:**
87
+ - **Room for improvement**, averaging **77.6/100** in **advanced mathematics and combinatorial reasoning**.
88
  - Example: Polygon Permutations, Graph Theory Scheduling.
89
 
90
+ 6. **Complex Tasks:**
91
+ - **Stable performance**, averaging **80.83/100**.
92
+ - Example: Statistical Mechanics, Algebraic Topology.
93
+
94
+ 7. **Deceptive Tasks:**
95
+ - **Exceptional problem-solving**, with an average score of **87.5/100**.
96
+ - Example: Steel Rod Thermal Expansion, Farmer's Fence and Gate.
97
+
98
+ 8. **Simple Tasks:**
99
+ - **Perfect performance**, with an average score of **95/100**.
100
+ - Example: Widget Production Calculation.
101
 
102
+ 9. **Very Hard Tasks:**
103
+ - **Challenging area**, with an average score of **65/100**, suggesting opportunities for growth.
104
+ - Example: Table Tennis Scheduling Problem.
105
 
 
106