After the deal struck between Google and Reddit earlier this year, OpenAI has now struck a deal with Reddit to get real time access to posts there and feed them into ChatGPT. Personally, I have delved deep into the comments in Reddit when doing research on a subject matter (ask me about the time I spent last year researching the exotic lumber market and the details around milling and working with woods like genuine mahogany). Had ChatGPT been able to access those comments and data last year, it would have saved days' worth of time in my research (I tried using it last year and noticed that it was hallucinating a lot and producing false information, which I sometimes only discovered days later when I delved deeper into the subject matter, and went back to correct my initial information gathered from ChatGPT). So the access of LLMs to niche commentary and discussions on Reddit will certainly enrich the content.
I do, though, wonder whether it will be able to parse the garbage and intentional misinformation that will be inevitably planted to try to poison the well. Will we access information aggregated and spit out by ChatGPT and assume it's correct when it isn't? Without specialized knowledge already about a specific topic, one won't be able to know whether the information being read as part of learning or an information gathering exercise is correct or not.
https://www.theverge.com/2024/5/16/24158529/reddit-openai-chatgpt-api-access-advertising