In our implementation, metadata is expressed using the Resource Description Framework (RDF), a W3C standard that structures information as a set of triples—subject, predicate, and object. This model enables precise, machine-readable assertions about datasets, including their title, publisher, license, themes, spatial and temporal coverage, and distribution formats. RDF metadata assertions facilitate linking and federating datasets across different institutions and domains, thereby supporting the principles of Linked Open Data and the Semantic Web.
Considering the Brazilian context and national open data policies—especially those outlined on the Plataforma de Dados Abertos (dados.gov.br)— DCAT Version 3 was implemented. DCAT-BR, a specialization of DCAT, is a Brazilian vocabulary and standard/ecosystem used to describe public sector and linked data sets.
DCAT, or Data Catalog Vocabulary, is an RDF vocabulary developed by the W3C (World Wide Web Consortium) to facilitate interoperability between data catalogs published on the Web. DCAT provides a standard model and vocabulary for describing datasets and data services, enabling people and machines to find, access, and use them more effectively.
This approach is fully aligned with Brazil’s legislative framework for transparency and open government, particularly:
By adopting RDF-based metadata assertions and adhering to DCAT Version 3, our datasets ensure semantic interoperability with both national and international frameworks.
To improve interoperability and semantic accessibility, all datasets used in this project were described using RDF (Resource Description Framework). We followed W3C standards such as DCAT and Dublin Core to create machine-readable metadata for each dataset and catalog.
Two Turtle (.ttl) files were created:
These serializations enable integration into open data portals, support for SPARQL queries, and visualization using RDF tools.