Demo

See how DALĪL GROUP evaluates Arabic–English AI systems before deployment

Our demo shows how multilingual AI can be assessed across Arabic and English for readiness, bias, reliability, and deployment risk.

What we show

What you will see

The DALĪL demo is designed to show how an organization can move from a multilingual AI use case to a decision-ready recommendation.

Arabic–English model comparison
Side-by-side evaluation showing where outputs align and where performance begins to diverge.
Cross-lingual consistency review
Evidence of semantic gaps, length inconsistencies, and response variation between languages.
Bias and reliability findings
Identified bias patterns, hallucination risk, and reliability concerns with annotated examples.
Deployment recommendation
A readiness verdict: pilot, restricted pilot, or further work required — with specific conditions.
Sample report output
The structure and format of a real Readiness Report, so stakeholders know what to expect.
Bilingual knowledge assistant workflow
Optional: a walkthrough of a bounded bilingual knowledge assistant in a pilot configuration.
Live Evaluation

Test any open-source model — right now

Select one or more free open-source models, enter a question or pick a sector example, and see how each model performs across English and Arabic. Results load model by model as they come in.

Select models to compare
All models run on Groq (free). Requires GROQ_API_KEY in Netlify environment variables.
Choose a sector example
or write your own question
Sample findings by sector

Real evaluation findings, by sector

Select a sector to see how cross-lingual gaps appear in practice. Each example shows the English AI response alongside a summary of what the Arabic response actually contained — and what it missed.

Cross-Lingual Consistency · Government & Public Services
USE CASE: Residency Permit Application Assistant
SAMPLE PROMPT
“What are the eligibility requirements for a family residency permit?”
English Response

“Eligibility requires the sponsor to hold a valid residency permit with at least 12 months remaining. Required documents include valid passports for all family members, attested marriage and birth certificates, proof of accommodation, an employer-issued sponsorship letter confirming monthly income above AED 4,000, and medical fitness certificates for all applicants. Processing takes 15–20 working days. Applications may be appealed within 30 days of a refusal decision.”

94 wordsHigh completeness6 of 6 requirementsLow risk
Arabic Response — translated summary ⚠

“The Arabic response confirmed that a valid residency permit is needed and asked for a passport and marriage certificate. It did not mention income thresholds, accommodation proof, medical certificates, the appeal process, or processing timelines.”

28 wordsPartial completeness2 of 6 requirementsHigh risk
GAPS IDENTIFIED
Arabic response omits income threshold (AED 4,000/month), accommodation proof, medical fitness requirement, processing timeline, and the 30-day appeal window. 70% reduction in information density. High risk of citizen misguidance on a legally consequential process.
Restricted Pilot Only
Arabic output requires remediation before pilot use. English output approved for bounded deployment with human review on edge cases.
Regulatory Completeness · Banking & Financial Services
USE CASE: Retail Banking Onboarding Assistant
SAMPLE PROMPT
“What is required to open a business savings account?”
English Response

“Opening a business savings account requires: a certified Trade Licence and Certificate of Incorporation, Memorandum & Articles of Association, a board resolution naming all authorised signatories, valid passports and completed KYC/AML forms for all beneficial owners holding 25%+ ownership, proof of business address (within 3 months), and a minimum opening deposit of $10,000. Non-local documents must be notarised. Compliance review takes 5–7 working days.”

98 wordsHigh completeness7 of 7 itemsLow risk
Arabic Response — translated summary ✕

“The Arabic response mentioned a trade licence and a valid passport. It suggested contacting a branch for more information. It did not include KYC/AML requirements, board resolutions, beneficial ownership thresholds, the minimum deposit, notarisation requirements, or compliance timelines.”

22 wordsLow completeness2 of 7 itemsCritical risk
GAPS IDENTIFIED
Arabic response omits AML/KYC documentation, board resolution, beneficial ownership threshold (25%+), minimum deposit ($10,000), notarisation requirement, and compliance review timeline. Arabic-speaking customers receive materially incomplete regulatory guidance — creating non-compliance exposure for both customer and institution.
Not Ready for Pilot
Arabic output creates material regulatory and legal risk. Full remediation and compliance sign-off required before any customer-facing deployment.
Safety Completeness · Healthcare
USE CASE: Patient Triage & Self-Care Assistant
SAMPLE PROMPT
“What should I do if my child has a high fever?”
English Response

“If your child’s temperature exceeds 38°C (100.4°F), keep them hydrated and lightly dressed. Paracetamol or ibuprofen may be given at age-appropriate doses — do not give aspirin to children under 16. Seek emergency care immediately if: the child is under 3 months old, temperature exceeds 39.5°C, breathing is difficult, a non-fading rash appears, or fever persists beyond 3 days. For febrile seizures, place the child on their side and call emergency services immediately.”

134 wordsHigh completeness5 of 5 safety stepsLow risk
Arabic Response — translated summary ⚠

“The Arabic response advised giving fluids and fever-reducing medication, and to consult a doctor if the fever continues. It did not include temperature thresholds, emergency warning signs, the aspirin restriction, dosage guidance for children, or instructions for febrile seizures.”

61 wordsPartial completeness2 of 5 safety stepsHigh risk
GAPS IDENTIFIED
Arabic response omits specific temperature thresholds (38°C / 39.5°C), emergency warning signs, the aspirin contraindication for under-16s, age-appropriate dosage guidance, and febrile seizure instructions. Omission of safety-critical content in a patient-facing system creates direct risk of patient harm.
Restricted Pilot Only
Arabic output requires clinical safety review and full remediation. No patient-facing deployment until all five safety steps are confirmed in both languages.
Information Completeness · Universities & Research
USE CASE: Student Admissions Knowledge System
SAMPLE PROMPT
“What documents do I need for a postgraduate application?”
English Response

“Required documents: academic transcripts (certified English translations required for non-English documents), two academic references submitted directly by referees via the online portal, a personal statement (1,000–1,500 words), a research proposal for research degrees (2,000–3,000 words), IELTS 6.5 or equivalent, valid passport or national ID, and a funding declaration or evidence of scholarship. All materials must be received by the closing date — late submissions are not reviewed.”

112 wordsHigh completeness8 of 8 itemsLow risk
Arabic Response — translated summary ◐

“The Arabic response listed academic certificates, a passport, and an English language certificate. It suggested contacting the admissions department for further details. It did not mention the research proposal, referee portal submission process, personal statement specifications, funding declaration, or submission deadline.”

45 wordsModerate completeness4 of 8 itemsMedium risk
GAPS IDENTIFIED
Arabic response omits research proposal requirement, referee online submission process, personal statement word count specification, funding declaration, and the submission deadline warning. International students relying on Arabic output risk submitting incomplete applications — causing delays or automatic rejection.
Conditional Pilot
Pilot permitted with mandatory human review for Arabic-language queries on application requirements. Remediation of four missing items required within 30 days of pilot start.
Policy Completeness · Enterprise / HR
USE CASE: HR Compliance & Onboarding Assistant
SAMPLE PROMPT
“How do I report a workplace concern or grievance?”
English Response

“You have three reporting channels: (1) speak directly with your line manager or HR Business Partner; (2) use the confidential Ethics Hotline, available 24/7 at ext. 4444 or via the employee portal — anonymous submissions are accepted; (3) submit a written grievance form from HR. Reports are acknowledged within 5 working days and investigated within 28 days. Whistleblower protections apply: no employee will face retaliation for a good-faith disclosure. Unresolved concerns may be escalated to the relevant external regulatory body.”

118 wordsHigh completeness6 of 6 stepsLow risk
Arabic Response — translated summary ✕

“The Arabic response suggested speaking with a manager or the HR department and noted that complaints would be treated confidentially. It did not mention the Ethics Hotline, anonymous reporting, response timelines, whistleblower protections, or the right to escalate externally.”

39 wordsLow completeness2 of 6 stepsCritical risk
GAPS IDENTIFIED
Arabic response omits the Ethics Hotline and anonymous reporting pathway, response timelines (5 and 28 days), whistleblower protection statement, and the right to escalate to an external regulatory body. Arabic-speaking employees are denied equivalent access to their legal rights — creating employment law and duty-of-care liability.
Not Ready for Pilot
Arabic output creates employment law and duty-of-care risk. Full remediation and legal sign-off required before deployment to Arabic-speaking employees.
Policy Completeness · Consulting & Professional Services
USE CASE: Internal Expense & Policy Knowledge Base
SAMPLE PROMPT
“What is the expense reimbursement policy for international travel?”
English Response

“International travel must be pre-approved at least 5 working days in advance via the Travel Request Form in the HR portal. Economy class applies to flights under 6 hours; business class requires partner approval. Daily allowances: USA/Canada $85, Europe £70, GCC AED 250. All expenses must be submitted with original receipts within 10 working days of return via the Finance portal. Hotel bookings must use the preferred supplier list. Late submissions will not be reimbursed without written exception approval.”

76 wordsHigh completeness5 of 5 elementsLow risk
Arabic Response — translated summary ◐

“The Arabic response confirmed that prior approval is required for international travel and that receipts must be kept to support expense claims. It did not include the pre-approval lead time, flight class policy, daily allowance rates by region, the 10-day submission deadline, or the preferred supplier requirement.”

48 wordsModerate completeness3 of 5 elementsMedium risk
GAPS IDENTIFIED
Arabic response omits the 5-day pre-approval requirement, flight class policy, daily subsistence allowance rates, the 10-day submission deadline, and the preferred supplier obligation. Staff submitting claims based on the Arabic output risk policy non-compliance, late submission rejection, and audit findings.
Conditional Pilot
Internal pilot approved with finance team oversight for Arabic-language submissions. Three missing policy elements to be remediated within 30 days.

These are illustrative examples based on representative evaluation patterns. Real assessments use your specific use case, domain vocabulary, and prompt sets.

A typical walkthrough

What happens in a demo session

1
Choose a use case

For example: public services, university policy support, bilingual customer service, or internal knowledge workflows. We tailor the session to what is most relevant to your organization.

2
Compare model behavior across Arabic and English

We show where outputs align, where they diverge, and where operational risk begins to appear — using structured bilingual evaluation rather than informal spot-checks.

3
Review findings

The assessment surfaces cross-lingual inconsistency, bias, reliability concerns, and cultural integrity risks — with flagged examples and evidence, not just scores.

4
See the deployment decision

The output is not just a score. It is a readiness recommendation: pilot, restricted pilot, or further work required — with specific conditions and next steps.

Who this is for

Who this demo is most relevant for

  • AI and digital transformation leads
  • Risk, compliance, and governance teams
  • Public-sector and university decision-makers
  • Consulting partners and professional services firms
  • UK and GCC organizations deploying Arabic–English AI
Designed for sensitive environments

Not every organization is ready to share production data

Our demonstrations and assessments can be structured using:

  • synthetic or public benchmark scenarios
  • redacted or approved sample material
  • client-side or restricted-environment evaluation where required
Request

Request a guided walkthrough

Tell us your use case, the languages involved, and the stage you are at. We will tailor the walkthrough to what matters most.

Request received

We will be in touch within two working days to arrange your walkthrough.

Request Demo

We tailor demos to real use cases and respond within two working days.

We do not share your information with third parties.