I need a freelancer to prepare benchmark questions and answers for testing a custom LLM’s reasoning ability.
Scope: Question Set: Collect 500–600 LLM benchmark questions with correct answers. Focus areas: logical, mathematical, commonsense, analytical, and multi-step reasoning. Deliver as JSON or CSV. Python Script: Load questions and send them to an LLM (I'll handle API integration). Compare model answers to correct ones. Output a simple accuracy report. Requirements: Knowledge of LLMs, reasoning datasets, or NLP is preferred. Clean, documented code. Use only open or original questions.
Arabic Video Social Media Manager Category: Content Writing, Digital Marketing, Facebook Marketing, Instagram Marketing, Social Media Management, Video Editing, Video Production Budget: €30 - €250 EUR
CRM Online Energía y Telecomunicaciones Category: API Development, CRM, Database Management, HTML, PHP, Software Architecture, Web Development, Web Design Budget: €250 - €750 EUR
03-Nov-2025 17:02 GMT
Hindi-English Educational Material Translation Category: Content Writing, Editing, English (US) Translator, English Spelling, English Translation, Hindi Translator, Language Tutoring, Translation Budget: ₹1250 - ₹2500 INR
03-Nov-2025 17:01 GMT
Cottage Renovation Floor Plan Category: 2D Drafting, 3D Rendering, AutoCAD, Building Design, Floor Plan, Interior Design Budget: $750 - $1500 CAD
Google & Social Media Ad Plan Category: Digital Marketing, Facebook Ads, Facebook Marketing, Google Ads, Instagram Marketing, Internet Marketing, SEO, Social Media Marketing Budget: £750 - £1500 GBP
03-Nov-2025 17:00 GMT
Alianza Estratégica para Innovar Productos Category: 3D Design, 3D Modelling, 3D Printing, CAD / CAM, Fusion 360, Mechanical Engineering, Product Design, Solidworks Budget: $30 - $250 USD