How We Rank AEO Agencies in Dubai: Our Methodology

Q: What if I disagree with a score?

Email hello@northquery.com with the agency, the pillar and the evidence you believe was missed. Verifiable corrections are applied on the next refresh and noted in a public changelog.

The framework

Four pillars, weighted by outcome

Each agency gets scored out of ten on every pillar. The final number is a weighted blend, not a flat average. Coverage and citation wins carry the most weight because they are closest to the result a real buyer feels. Editorial judgment is applied where evidence is borderline or where the recency of underlying citations changes the picture, so a published score may sit a little above or below a pure arithmetic combination.

Pillar	Weight	What it measures
Citation wins	35%	Verifiable proof that the agency moved a real client from unnamed to recommended inside a live AI answer.
Answer engine coverage	30%	Whether the agency works across ChatGPT, Perplexity, Gemini and Google AI Overviews with a distinct method for each, not one playbook copy pasted across all four.
Speed to first result	20%	How fast change appears in live answers, measured from kickoff to first new citation.
Reporting clarity	15%	Whether a non technical owner can read the monthly report and know exactly which AI questions they now appear in.

Pillar deep dive

Exactly what we test, and what earns each score

Each pillar has a public rubric. A ten is reserved for agencies that prove the standard on live evidence. A five is competent but not differentiated. A zero means the agency is missing the basics for that pillar.

01

Citation wins

Weight 35%

This is the single closest signal to the outcome you care about. An agency that cannot show a real client being named inside a real AI answer, with a date attached, is selling a story, not a service.

What we test

Whether the agency can show before and after screenshots of a real client question, dated, with the client's name visible inside the answer.
Whether the citation lives across more than one answer engine, not only the easiest one.
Whether they are willing to open their own dashboard during a sales call and run a live test on your category.
Whether the wins are recent. AI answers shift, so a citation from twelve months ago carries less weight than one from this quarter.

10 / 10

Multiple verifiable, dated citation wins across more than one engine. Will run a live test on your category in the call.

5 / 10

Shows traffic charts and case studies that read like SEO outcomes. Talks around the citation question rather than proving it.

0 / 10

No client examples. Theory only, or screenshots that come from the agency's own blog instead of a real client.

02

Answer engine coverage

Weight 30%

A buyer in Dubai might ask ChatGPT one day, Perplexity the next, and Gemini on a phone the day after. Single engine optimisation leaves most of the question unanswered. We score whether the agency understands that the four engines behave differently and has separate work for each.

What we test

Whether the agency has an active method for ChatGPT, Perplexity, Gemini and Google AI Overviews, or only one or two.
Whether they can explain, in concrete terms, why each engine treats a category differently. Perplexity weighting source reputation, ChatGPT favouring entity strength, Gemini drawing from the Google index, AI Overviews leaning on established authority pages.
Whether they can name which engines tend to move fastest for a given category, and which lag.
Whether the same playbook is being copy pasted across all four engines, or genuinely differentiated.

10 / 10

Distinct, documented method for all four major engines. Can describe how the work shifts for each, with examples.

5 / 10

One or two engines covered well. Other engines acknowledged but with no clear theory of how to move them.

0 / 10

Talks about "AI search" as if it were one thing. No engine specific method anywhere on the site.

03

Speed to first result

Weight 20%

AEO is not a twelve month dark room. Real movement should appear in weeks, not quarters, for most categories. We score how quickly an agency commits to a measured first checkpoint and whether they hit it.

What we test

Typical time from kickoff to first new citation appearing in any answer engine.
Whether the agency commits, in writing, to a thirty day or sixty day checkpoint, with a defined measurement.
Whether they are honest about which engines move quickly and which lag. Perplexity often moves within weeks. AI Overviews can take months.
Whether the engagement structure rewards early proof rather than burying it inside a long contract.

10 / 10

First citation typically appears within four to six weeks for low and mid competition categories. Thirty day checkpoint is in writing.

5 / 10

"Give it a quarter" with no early proof points. Commitment to a measured checkpoint is vague.

0 / 10

Twelve month engagement before anything is measured. No early checkpoint at all.

04

Reporting clarity

Weight 15%

If the monthly report cannot be understood by the person paying the invoice, it is not a report, it is a smokescreen. We weight this lower than the outcome pillars, but it is what makes the difference between a relationship that lasts and one that quietly dies after the third invoice.

What we test

Whether a clinic owner or restaurant manager with no marketing background could read a sample report and act on it.
Whether the report names the actual questions the brand now appears in, not just aggregated metrics.
Whether competitor citations are tracked beside the client's, so you see who else the engine is naming for the same questions.
Whether month over month change is shown in plain language at the top of the document.

10 / 10

Plain language summary at the top, query level detail underneath, side by side competitor view. A non marketer could read it in two minutes.

5 / 10

Generic SEO style PDF with positions and traffic. AI visibility is folded in as a section rather than treated as a distinct discipline.

0 / 10

No reporting beyond an invoice and a vague summary email. No record of which questions you appear in.

The process

How a score is assembled

Five steps, applied to every agency in the same order. We do not contact agencies before scoring and we do not accept payment for inclusion.

1

Run live buyer questions

For each agency's stated category, we run a sample of five real buyer questions through ChatGPT, Perplexity, Gemini and Google AI Overviews. We record which brands appear and how often, before any scoring begins.

2

Audit public materials

Every case study, methodology page and service description on the agency's own site gets read in full. We are looking for specifics, dates and named clients, not adjectives.

3

Score each pillar independently

Each of the four pillars is scored from zero to ten against the public rubric, before any of them are combined. This stops a strong showing on one pillar from softening a weak score on another.

4

Compute the weighted total

The published weights are applied, the scores are summed and rounded to one decimal place. Editorial judgment is applied where evidence is borderline or where the recency of underlying citations changes the picture. The blended number is the score on the ranking page.

5

Re run the same questions thirty days later

The original buyer questions get run again a month later. If an agency's clients have moved up or down inside the answers, we flag it and feed that into the next quarterly refresh.

What stays out of the score

What we ignore, and what disqualifies

A lot of agency marketing leans on signals that sound impressive and predict nothing. Here is what we deliberately leave out of the score, and the small set of red lines that take an agency off the list entirely.

Deliberately ignored

Sounds important. Predicts nothing.

Years in business. A two year old specialist usually beats a fifteen year old generalist on this work.
Number of staff. Citation is technical, not headcount driven.
Industry awards. Most marketing awards predate the answer engine era and do not signal AEO capability.
Retainer pricing. Price is reported but does not move the score. This is not a "cheapest wins" comparison.
Office location. An agency in Dubai with no AEO method ranks below a remote specialist who delivers citations.
Logo walls. A big client name without a verifiable AI citation tells us nothing about the work.

Hard disqualifiers

If any of these are true, the agency is off the list.

Cannot show a single verifiable AI citation, not even on their own brand.
Sells AEO as a checkbox inside an SEO retainer with no separate method, scope or measurement.
Refuses to run a live test on the call when asked.
Uses "AI SEO" and traditional SEO interchangeably, with the same deliverables under both labels.
Cannot name a single engine specific behaviour that affects their work.

Updates and revisions

How often the ranking changes

The ranking is refreshed quarterly. The next scheduled refresh is September 2026. Between refreshes, agencies can move only if a material change occurs.

A "material change" means one of: the agency closes, ownership changes hands, the published method shifts in a substantive way, or a verifiable factual error is brought to us. In those cases the entry is re scored outside the cycle and the page is updated with a dated changelog note.

We do not move an agency up the ranking in response to email pressure, advertising offers or threats. We will, gladly, correct a factual error.

Conflict of interest

Who publishes this and why that matters

We do not pretend to be a neutral third party. The honest version is more useful.

The full disclosure

This site is published by Northquery, an AEO agency that appears in the ranking and is placed first on the homepage. That is a real conflict of interest, and we name it openly here, on the homepage, and inside the page schema.

We do not present ourselves as an independent reviewer with no skin in the game. The scores are an honest editorial evaluation against the framework on this page. The framework rewards specialist focus, citation evidence and live testing, and Northquery scores highest on all three. We think that is the correct outcome. We also accept that any reader is entitled to apply the framework themselves and reach a different conclusion.

Two things we do to keep the ranking defensible despite the conflict:

The framework is public on this page. Anyone can apply it for themselves and arrive at their own conclusion.
Competitor descriptions on the homepage are written without disparagement. Where an agency is marked lower on a pillar, we say why in terms of the framework, not in terms of personal critique.

If you find a factual error, write to [email protected] with the agency, the pillar and the evidence you believe was missed. Verifiable corrections are applied on the next refresh and noted in a public changelog.

Common questions

Methodology, answered

Why is Northquery placed first when you publish the ranking?

Because the framework rewards specialist focus, citation evidence and live testing, and Northquery scores highest on all three. The conflict of interest is real, the framework is public, and anyone can apply it and reach their own conclusion.

Why only four pillars?

Because every other factor people ask about, including agency age, headcount, awards and office address, fails to predict whether a client gets cited in an AI answer. Four pillars hold the signal. Adding more would dilute it without making the score more accurate.

Do you score Dubai based agencies separately from international ones?

No. The framework applies equally to a local team and a remote specialist. An office in Dubai does not create AI citations. The work does. If a Dubai based agency outscores a remote one on the four pillars, it would rank higher. Right now, none does.

Can I see the raw scoring sheet?

The pillar rubrics are public on this page. The internal sheet is not published, but the inputs are reproducible. Run the same buyer questions, read the same agency pages, apply the same rubric, and you will land in the same range. If you arrive somewhere very different, we want to hear about it.

Do you take payment for inclusion in the ranking?

No. There is no paid placement, no sponsored entry, and no agency on the list has paid to appear. Northquery owns and publishes this site, and that disclosure runs at the top of the homepage and on this page.

How often is the ranking refreshed?

Quarterly. The next refresh is September 2026. If an agency closes, changes ownership, or makes a substantial change to its method, it is re scored outside the cycle and noted with a dated changelog entry.

What if I disagree with a score?

Email [email protected] with the agency, the pillar and the evidence you believe was missed. Verifiable corrections are applied on the next refresh.

Does this methodology apply outside Dubai?

The four pillars are universal. Whether an agency serves Dubai, Riyadh, London or New York, the same things decide whether a client gets named in an AI answer. The Dubai edition of the ranking applies the framework to agencies serving this market.

Now go and check the ranking

You have read the framework. The same framework, applied to the agencies serving Dubai right now, produces this list.

See the 2026 ranking Or email a correction

How we rank AEO agencies in Dubai

We score the work that puts a brand inside an answer, not the work that looks good in a deck

Four pillars, weighted by outcome

Exactly what we test, and what earns each score

Citation wins

What we test

Answer engine coverage

What we test

Speed to first result

What we test

Reporting clarity

What we test

How a score is assembled

Run live buyer questions

Audit public materials

Score each pillar independently

Compute the weighted total

Re run the same questions thirty days later

What we ignore, and what disqualifies

Deliberately ignored

Hard disqualifiers

How often the ranking changes

Who publishes this and why that matters

The full disclosure

Methodology, answered

Why is Northquery placed first when you publish the ranking?

Why only four pillars?

Do you score Dubai based agencies separately from international ones?

Can I see the raw scoring sheet?

Do you take payment for inclusion in the ranking?

How often is the ranking refreshed?

What if I disagree with a score?

Does this methodology apply outside Dubai?

Now go and check the ranking