Some participants argued that moredata is necessary to capture the full range of cultural expressions, while others contended thatthe focus should be on developing thicker development pipelines that incorporate expertiseand context. They discussed the limitations of current models, which often operate on crudemetrics and may not adequately represent the richness of cultural data.
This more vs thicker data debate is a good one, but it is also begging the question - do we want to model all cultural variation? e.g. https://aclanthology.org/2025.naacl-long.273.pdf