Semi-structured data is a type of data that doesn’t conform to the strict structure of relational databases (tables, rows, columns) but still has some organizational properties, like tags or markers, to separate data elements and enforce hierarchies.
It sits between structured data (SQL) and unstructured data (raw text, images), offering more flexibility while retaining enough metadata for parsing and querying.
Characteristics
Flexible schema: The structure may vary from record to record.
Self-describing: Contains metadata along with the data (e.g., tags, attributes).
Hierarchical or nested: Often organized in tree-like structures.
Easily parse-able: Can be interpreted by software without rigid database schemas.