What ChatGPT Is aware of about You: OpenAI’s Journey In direction of Knowledge Privateness | by Andrea Valenzuela

Corporations permitting their customers to ask for his or her private information make them adjust to the aforementioned GDPR regulation. However, there’s a catch: the file format could make the info unreadable for a lot of the inhabitants. On this case, we received each html and json recordsdata. Whereas html may be learn immediately, json recordsdata may be tougher to interpret. I personally assume that new rules also needs to implement a readable format of the info. However in the interim…

Let’s discover the recordsdata one after the other to get probably the most out of this new function!

The primary file is chat.html which accommodates my total chat historical past with ChatGPT. Conversations are saved with their corresponding title. The consumer’s questions and ChatGPT’s solutions are labeled as assistantand consumer, respectively.

You probably have ever educated an AI mannequin your self, this labeling system will sound acquainted to you.

Let’s observe a pattern dialog from my historical past:

Self-made screenshot from my ChatGPT historical past. The dialog title is highlighted in blue. Consumer/Assistant labels are highlighted in purple and inexperienced, respectively.

Have you ever ever seen the thumbs-up, thumbs-down icons (👍👎) subsequent to any ChatGPT reply?

This info is seen by ChatGPT because the suggestions for a given reply, which is able to then assist in the chatbot coaching.

This info is saved within the message_feedback.json file containing any suggestions you offered to ChatGPT utilizing the thumbs icons. Data is saved within the following format:

[{"message_id": <MESSAGE ID>, "conversation_id": <CONVERSATION ID>, "user_id": <USER ID>, "rating": "thumbsDown", "content": "{"tags": ["not-helpful"]}"}]

The thumbsDown ranking accounts for wrongly-generated solutions whereas the thumbsUp accounts for the correctly-generated ones.

There may be additionally a file (consumer.json) containing the next private information from the consumer:

false], "phone_number": <USER PONE>

Some platforms are recognized for making a mannequin of the consumer based mostly on their utilization of the platform. For instance, if the Google searches of a consumer are largely about programming, Google is more likely to infer that the consumer is a programmer and use this info to point out personalised commercials.

ChatGPT might do the identical with the knowledge from the conversations, however they’re at the moment obliged to incorporate this inferred info within the exported information.

⚠️ FYI, One can entry What Google is aware of about them from Gmail by clicking on Account >> Knowledge & Privateness >> Customized Adverts >> My Advert Heart.

There may be one other file containing the dialog historical past, and in addition together with some metadata. This file is known as conversations.json and consists of info such because the creation time, a number of identifiers, and the mannequin behind ChatGPT, amongst others.

⚠️ The metadata offers details about the primary information. It could embrace info such because the origin of the info, its which means, its location, its possession, and its creation. Metadata accounts for info associated to the primary information, however it’s not a part of it.

Let’s discover the identical dialog in regards to the A320 Hydraulic System Failure uncovered within the first instance on this json format. The dialog itself consists of the next Q&A: