Question 1

What kinds of Vietnamese AI training data do you produce?

Accepted Answer

SFT / instruction datasets, RLHF and preference / DPO annotation, prompt engineering and red-teaming, transcription and speech, and model or MT evaluation, all in native Vietnamese with linguistic QA.

Question 2

How is native annotation different from crowdsourced labelling?

Accepted Answer

Crowdsourced labels optimise for speed and surface fluency and miss register, tone and false friends, with no rationale, so errors repeat. As a native linguist I optimise for accuracy and naturalness, attach a written reason to every label, and hold one standard across the project.

Question 3

What formats do you deliver?

Accepted Answer

JSONL, CSV, XLIFF, CoNLL, and ELAN / TextGrid for speech, or your own schema. Preference data ships as prompt / chosen / rejected / reason.

Question 4

Do you cover Vietnamese dialects?

Accepted Answer

Yes. Northern, Central and Southern register and lexicon, plus the formal / workplace / street axis, with conventions agreed up front.

Question 5

Can you work inside our platform and guidelines?

Accepted Answer

Yes. I work in your annotation platform and label spec and run a small calibration batch first so the standard is locked in before scale.

Question 6

How is pricing handled?

Accepted Answer

Hourly or per-item depending on the task, agreed after a short calibration batch. NDA before any data; USD via Upwork, bank, PayPal or Wise.

Question 7

Do you handle Vietnamese-English code-switching and mixed data?

Accepted Answer

Yes. Real Vietnamese mixes English tech and brand terms; I decide per label when to keep, gloss or localize a borrowed term and keep that policy consistent across the set.

Question 8

How do you measure quality and inter-rater agreement?

Accepted Answer

Against your rubric: error type, severity and a written rationale per item, plus a calibration batch and spot re-checks. I can also act as the gold / adjudication pass for multi-annotator projects.

Question 9

Can you build gold or reference sets for evaluation?

Accepted Answer

Yes. Expert-verified gold responses and evaluation sets with rubrics and edge cases, to benchmark a model or score other labellers against a native standard.

Question 10

What volume and turnaround can you handle?

Accepted Answer

Calibration batches in a day or two; sustained production scoped per project. I would rather ship a smaller, clean, rationale-backed set than a fast and noisy one.

Question 11

Why not just use machine translation or synthetic Vietnamese data?

Accepted Answer

Synthetic and scraped Vietnamese is fluent and wrong in ways that compound: stripped diacritics, flattened register, hallucinated facts and amplified false friends, with no rationale. Native gold data is what a model needs to learn real Vietnamese.

Quality signal	Native expert (me)	Crowdsourced	Synthetic / scraped
Register & honorifics	Controlled	Often wrong	Flattened
False friends	Caught	Missed	Amplified
Factuality	Verified	Varies	Hallucinated
Diacritic integrity	Intact	Varies	Often stripped
Rationale per label	Every item	None	None
Consistency at scale	One standard	Inter-rater drift	Uniform but wrong

Your model is only as good as its

Fluent Vietnamese is easy. Knowing when it's wrong is the job.

Why Vietnamese is hard for AI

Every rejection comes with a reason.

One message, four registers.

Six tones on one syllable.

Which one would a native ship?

Seven years of reading Vietnamese closely.

Native expert vs crowd vs synthetic

From spec to graded data.

Scope & guidelines

Calibration batch

Production with rationale

QA & delivery

Send a task spec, get a plan in a day.

Frequently asked.

The terms, in plain words.

Send me a sample. I'll grade it and tell you what your labellers missed.