Will an LLM be able to match the ground truth >85% of the time when performing PII detection by 2024 end?
Plus
25
Ṁ4723Dec 31
84%
chance
1D
1W
1M
ALL
PII - personal identification information
Stuff like people's names, numbers and codes that identify stuff (SSN, phone number, passport etc), places, locations, names of orgs, attributes that can be used to identify a person, etc.
GPT-4 outperforms Presidio, Microsoft's custom built tool for PII detection. GPT-4 matches ground truth ~77.4% of the times, while it misses a single PII element ~13% of the time.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
@PatrickDelaney I think microsoft tested against their in house system, which does detect PII on real data
Related questions
Related questions
LLM Hallucination: Will an LLM score >90% on SimpleQA before 2026?
60% chance
Will the most interesting AI in 2027 be a LLM?
52% chance
What will be true of OpenAI's best LLM by EOY 2025?
By 2027, will it be generally agreed upon that LLM produced text > human text for training LLMs?
62% chance
Will LLMs' loss function achieve the level of entropy of human text by the end of 2030?
61% chance
Will any widely used LLM be pre-trained with abstract synthetic data before 2030?
74% chance
Will an LLM improve its own ability along some important metric well beyond the best trained LLMs before 2026?
58% chance
By 2028 will we be able to identify distinct submodules/algorithms within LLMs?
75% chance
By 2025 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?
20% chance
Will a LLM-based AI be used for a law enforcement decision before 2025?
18% chance