The power of AI is its ability to turn…

Jul 4, 2023

So why are we spending all our time generating unstructured data?

3 Comments

Jul 5, 2023

take it one step further, you can entirely skip generating the structured data. With LLM's, you can directly query on unstructured data, and it'll only get better at doing this with time

Expand full comment

Sharif Islam

Jul 5, 2023

The key aspect is "ability to understand". Depending on how the LLM is trained and the context, the structured data might be misleading. For example, I asked chatGPT to create a json structure based on these two statements: "Apple is good. Let's invest in it." It came up with the following:

{

"statements": [

{

"text": "Apple is good.",

"sentiment": "positive"

{

"text": "Let's invest in it.",

"sentiment": "positive"

}

]

}

Which is nice. But because my input lacked context I still need to provide additional metadata.