An Introduction to OpenAI Perform Calling | by David Hundley

No extra unstructured information outputs; flip ChatGPT’s completions into structured JSON!

A number of months in the past, OpenAI launched their API to most people, which excited many builders who needed to utilize ChatGPT’s outputs in a scientific approach. As thrilling has this has been, it’s equally been a little bit of a nightmare since we programmers are likely to work within the realm of structured information varieties. We like integers, booleans, and lists. The unstructured string may be unwieldy to take care of, and with a purpose to get constant outcomes, a programmer is required to face their worst nightmare: creating a daily expression (Regex) for correct parsing. 🤢

In fact, immediate engineering can truly assist fairly a bit right here, nevertheless it’s nonetheless not good. For instance, if you wish to have ChatGPT analyze the sentiment of a film evaluate for positivity or negativity, you may construction a immediate that appears like this:

immediate = f'''
Please carry out a sentiment evaluation on the next film evaluate:
{MOVIE_REVIEW_TEXT}
Please output your response as a single phrase: both "Optimistic" or "Unfavourable".
'''

This immediate truthfully does fairly decently, however the outcomes aren’t exactly constant. For instance, I’ve seen ChatGPT produce outputs that seem like the next when it comes to the film sentiment instance:

Optimistic
constructive
Optimistic.

This won’t appear to be a giant deal, however on this planet of programming, these are NOT equal. Once more, you will get round an easier instance like this with a little bit of Regex, however past the truth that most individuals (together with myself) are horrible at writing common expressions, there are merely some cases the place even Regex can’t parse the knowledge accurately.

As you possibly can inform, programmers have been hoping that OpenAI would add performance to help structured JSON outputs, and OpenAI has delivered within the type of perform calling. Perform calling is strictly because it sounds: it permits ChatGPT to provide arguments that may work together with a customized perform in a way that makes use of structured information varieties. Yup, no extra fancy immediate engineering and Regex to cross your fingers and hope you get the precise end result. On this publish, we’ll cowl easy methods to make use of this new performance, however first, let’s begin with an instance of how we used to aim to provide structured information outputs with immediate engineering and Regex.

Earlier than we bounce into the majority of our publish, please permit me to share a hyperlink to this Jupyter pocket book in my GitHub. This pocket book incorporates all of the code I might be working (and extra) as a part of this weblog publish. Moreover, I’d encourage you to take a look at OpenAI’s official perform calling documentation for something that I’ll not cowl right here.

To display what we used to do within the “pre-function calling days”, I wrote a small little bit of textual content about myself, and we’ll be utilizing the OpenAPI to extract bits of data from this textual content. Right here is the “About Me” textual content we’ll be working with:

Hey! My title is David Hundley. I’m a principal machine studying engineer at State Farm. I get pleasure from studying about AI and educating what I be taught again to others. I’ve two daughters. I drive a Tesla Mannequin 3, and my favourite online game sequence is The Legend of Zelda.

Let’s say I need to extract the next bits of data from that textual content:

Identify
Job title
Firm
Variety of youngsters as an integer (That is necessary!)
Automobile make
Automobile mannequin
Favourite online game sequence

Right here’s how I’d engineer a few-shot immediate with a purpose to produce a structured JSON output:

# Engineering a immediate to extract as a lot info from "About Me" as a JSON object
about_me_prompt = f'''
Please extract info as a JSON object. Please search for the next items of data.
Identify
Job title
Firm
Variety of youngsters as a single integer
Automobile make
Automobile mannequin
Favourite online game sequenceThat is the physique of textual content to extract the knowledge from:
{about_me}
'''
# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin="gpt-3.5-turbo",
messages = [{'role': 'user', 'content': about_me_prompt}]
)
# Loading the response as a JSON object
json_response = json.hundreds(openai_response['choices'][0]['message']['content'])
json_response

Let’s try how ChatGPT returned this completion to me:

The “Pre-Perform Calling” Days (Captured by the writer)

As you possibly can see, this truly isn’t dangerous. However it’s not ideally suited and will show to be dangerous for the next causes:

We’re not assured that OpenAI’s response will present a clear JSON output. It may have produced one thing like “Right here is your JSON:” adopted by the JSON output, which means that with a purpose to use json.hundreds() to parse the string right into a JSON object, we’d first should strip out that little little bit of textual content that opens the response.
We’re not assured that the keys within the key-value pairs of the JSON object might be constant from API name to API name. Recall the instance from above of the three cases of the phrase Optimistic. That is exactly the identical danger you run attempting to have ChatGPT parse out keys by means of few-shot immediate engineering. The one approach you could possibly perhaps lock this down is with Regex, which comes with its personal baggage as we already mentioned.
We’re not assured to obtain our responses within the correct information sort format. Whereas our immediate engineering to extract variety of youngsters did parse into a correct integer, we’re on the mercy of crossing our fingers and hoping we get that constant outcome for each API name.

We may summarize these points right into a single assertion: With out perform calling, we’re not assured to get constant outcomes which are necessary for the precision required for systematic implementation. It’s a nontrivial challenge that may be very difficult to treatment by means of immediate engineering and common expressions.

Now that we’ve constructed an instinct round why getting structured outputs from ChatGPT was previously problematic, let’s transfer into wanting on the new perform calling functionality launched by OpenAI.

Perform calling is definitely a little bit of a misnomer. OpenAI is just not truly working your code in a real perform name. Somewhat, it’s merely establishing the structured arguments you’d have to execute your personal customized capabilities, and I’d argue that is most well-liked conduct. Whilst you is likely to be considering that it doesn’t make sense that the OpenAI API isn’t executing your customized perform, contemplate that with a purpose to do this, you’d should go that perform code into ChatGPT. This perform code in all probability incorporates proprietary info that you’d NOT need to expose to anyone, therefore why it’s good that you just don’t truly should go this code to utilize OpenAI’s perform calling.

Let’s bounce into an instance of easy methods to allow perform calling with a single customized perform. Utilizing our “About Me” pattern textual content from the earlier part, let’s create a customized perform known as extract_person_info. This perform wants simply three bits of data: individual title, job title, and variety of youngsters. (We’ll revisit extracting the remainder of the knowledge within the subsequent part; I simply need to begin less complicated for now.) This practice perform is deliberately quite simple and can merely take our arguments and print them collectively in a single string. Right here’s the code for this:

def extract_person_info(title, job_title, num_children):
'''
Prints primary "About Me" infoInputs:
- title (str): Identify of the individual
- job_title (str): Job title of the individual
- num_chilren (int): The variety of youngsters the guardian has.
'''
print(f'This individual's title is {title}. Their job title is {job_title}, they usually have {num_children} youngsters.')

With a purpose to make use of perform calling, we have to arrange a JSON object in a selected approach that notes the title of our customized perform and what information parts we hope ChatGPT will extract from the physique of the textual content. Due to the specificity on how this JSON object ought to look, I’d encourage you reference OpenAI’s developer documentation if you wish to know any particulars that I don’t cowl right here.

(Observe: Within the OpenAI documentation, I observed one factor within the JSON object known as required that seemingly signifies {that a} parameter should be current for ChatGPT to correctly acknowledge the perform. I attempted testing this out, and both this isn’t how this performance works or I did one thing improper. Both approach, I transparently do not know what this required parameter signifies. 😅)

Right here is how we have to construction our JSON object to utilize our customized perform:

my_custom_functions = [
{
'name': 'extract_person_info',
'description': 'Get "About Me" information from the body of the input text',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
}
}
}
}
]

You’re in all probability already conversant in JSON syntax, though let me draw consideration for a second to the info sort related to every property. If you’re a Python developer like myself, bear in mind that the info typing for this JSON construction is NOT straight equal to how we outline information constructions in Python. Usually talking, we are able to discover equivalencies that work out alright, however if you wish to know extra concerning the particular information varieties related to this JSON construction, try this documentation.

Now we’re able to make our API name to get the outcomes! Utilizing the Python consumer, you’ll discover the syntax is similar to how we acquire completions on the whole. We’re simply going so as to add some further arguments into this name that signify our perform calling:

# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin="gpt-3.5-turbo",
messages = [{'role': 'user', 'content': about_me}],
capabilities = my_custom_functions,
function_call="auto"
)print(openai_response)

As you possibly can see, we merely go in our record of customized capabilities (or in our case for now, our singular customized perform) because the capabilities parameter, and also you’ll additionally discover a further parameter known as function_call that we’ve set to auto. Don’t fear about this for now as we’ll revisit what this auto piece is doing within the subsequent part.

Let’s run this code and try the total API response from ChatGPT

Perform calling with a single perform (Captured by the writer)

For probably the most half, this response seems the identical as a non-function name response, however now there’s a further area within the response known as function_call, and nested underneath this dictionary are two further objects: title and arguments. title signifies the title of our customized perform that we’ll be calling with ChatGPT’s output, and arguments incorporates a string that we are able to load utilizing json.hundreds() to load our customized perform arguments as a JSON object.

Discover now that we’re getting far more consistency than we had been in our pre-function calling methodology. Now we may be assured that the keys of the key-value pairs WILL be constant, and the info varieties WILL be constant. No want for fancy immediate engineering or common expressions!

That’s the core of OpenAI’s perform calling! In fact, this was a really simplistic instance to get you going, however you in all probability have further questions. Let’s cowl these on this subsequent part.

The earlier part coated a quite simple instance of easy methods to allow perform calling, however in the event you’re like me, you in all probability have some further questions past this level. Naturally, I can’t cowl all these questions, however I do need to cowl two huge ones which are barely extra superior than what we coated within the earlier part.

What if the immediate I submit doesn’t comprise the knowledge I need to extract per my customized perform?

In our authentic instance, our customized perform sought to extract three very particular bits of data, and we demonstrated that this labored efficiently by passing in my customized “About Me” textual content as a immediate. However you is likely to be questioning, what occurs in the event you go in another immediate that doesn’t comprise that info?

Recall that we set a parameter in our API consumer name known as function_call that we set to auto. We’ll discover this even deeper within the subsequent subsection, however what this parameter is basically doing is telling ChatGPT to make use of its greatest judgment in determining when to construction the output for one in every of our customized capabilities.

So what occurs once we submit a immediate that doesn’t match any of our customized capabilities? Merely put, it defaults to typical conduct as if perform calling doesn’t exist. Let’s check this out with an arbitrary immediate: “How tall is the Eiffel Tower?”

Perform calling however with a immediate that doesn’t match the perform (Captured by the writer)

As you possibly can see, we’re getting a typical “Completions” output although we handed in our customized perform. Naturally, this is smart since this arbitrary Eiffel Towel immediate incorporates not one of the particular info we’re in search of.

What if I need to go a number of customized capabilities and a few of them have overlapping parameters?

In brief, ChatGPT intelligently handles this with out a downside. The place we beforehand handed in a single customized perform as primarily an inventory of Python dictionaries, we simply have to hold including further Python dictionaries to this similar record, every representing its personal distinct perform. Let’s add two new capabilities: one known as extract_vehicle_info and one other known as extract_all_info. Right here’s what our adjusted syntax seems like:

# Defining a perform to extract solely car info
def extract_vehicle_info(vehicle_make, vehicle_model):
'''
Prints primary car infoInputs:
- vehicle_make (str): Make of the car
- vehicle_model (str): Mannequin of the car
'''
print(f'Car make: {vehicle_make}nVehicle mannequin: {vehicle_model}')
# Defining a perform to extract all info supplied within the authentic "About Me" immediate
def extract_vehicle_info(title, job_title, num_children, vehicle_make, vehicle_model, company_name, favorite_vg_series):
'''
Prints the total "About Me" info
Inputs:
- title (str): Identify of the individual
- job_title (str): Job title of the individual
- num_chilren (int): The variety of youngsters the guardian has
- vehicle_make (str): Make of the car
- vehicle_model (str): Mannequin of the car
- company_name (str): Identify of the corporate the individual works for
- favorite_vg_series (str): Individual's favourite online game sequence.
'''
print(f'''
This individual's title is {title}. Their job title is {job_title}, they usually have {num_children} youngsters.
They drive a {vehicle_make} {vehicle_model}.
They work for {company_name}.
Their favourite online game sequence is {favorite_vg_series}.
''')
# Defining how we wish ChatGPT to name our customized capabilities
my_custom_functions = [
{
'name': 'extract_person_info',
'description': 'Get "About Me" information from the body of the input text',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
}
}
}
},
{
'name': 'extract_car_info',
'description': 'Extract the make and model of the person's car',
'parameters': {
'type': 'object',
'properties': {
'vehicle_make': {
'type': 'string',
'description': 'Make of the person's vehicle'
},
'vehicle_model': {
'type': 'string',
'description': 'Model of the person's vehicle'
}
}
}
},
{
'name': 'extract_all_info',
'description': 'Extract all information about a person including their vehicle make and model',
'parameters': {
'type': 'object',
'properties': {
'name': {
'type': 'string',
'description': 'Name of the person'
},
'job_title': {
'type': 'string',
'description': 'Job title of the person'
},
'num_children': {
'type': 'integer',
'description': 'Number of children the person is a parent to'
},
'vehicle_make': {
'type': 'string',
'description': 'Make of the person's vehicle'
},
'vehicle_model': {
'type': 'string',
'description': 'Model of the person's vehicle'
},
'company_name': {
'type': 'string',
'description': 'Name of the company the person works for'
},
'favorite_vg_series': {
'type': 'string',
'description': 'Name of the person's favorite video game series'
}
}
}
}
]

Discover particularly how the extract_all_info covers a number of the similar parameters as our authentic extract_person_info perform, so how does ChatGPT know which one to pick out? Merely put, ChatGPT seems for one of the best match. If we go in a immediate that incorporates all of the arguments wanted for the extract_all_info perform, that’s the one it’ll choose. But when we simply go in a immediate that incorporates both simply easy details about me or a immediate about my car, it’ll leverage the respective capabilities that do this. Let’s execute that in code right here with just a few samples:

Pattern 1: The unique “About Me” textual content. (See above.)
Pattern 2: “My title is David Hundley. I’m a principal machine studying engineer, and I’ve two daughters.”
Pattern 3: “She drives a Kia Sportage.”

Pattern #1’s Outcomes (Captured by the writer)

Pattern #2’s Outcomes (Captured by the writer)

With every of the respective prompts, ChatGPT chosen the proper customized perform, and we are able to particularly notice that within the title worth underneath function_call within the API’s response object. Along with this being a helpful technique to determine which perform to make use of the arguments for, we are able to programmatically map our precise customized Python perform to this worth to run the proper code appropriately. If that doesn’t make sense, maybe taking a look at this in code would make this extra clear:

# Iterating over the three samples
for i, pattern in enumerate(samples):print(f'Pattern #{i + 1}'s outcomes:')
# Getting the response again from ChatGPT (gpt-3.5-turbo)
openai_response = openai.ChatCompletion.create(
mannequin="gpt-3.5-turbo",
messages = [{'role': 'user', 'content': sample}],
capabilities = my_custom_functions,
function_call="auto"
)['choices'][0]['message']
# Checking to see {that a} perform name was invoked
if openai_response.get('function_call'):
# Checking to see which particular perform name was invoked
function_called = openai_response['function_call']['name']
# Extracting the arguments of the perform name
function_args = json.hundreds(openai_response['function_call']['arguments'])
# Invoking the correct capabilities
if function_called == 'extract_person_info':
extract_person_info(*record(function_args.values()))
elif function_called == 'extract_vehicle_info':
extract_vehicle_info(*record(function_args.values()))
elif function_called == 'extract_all_info':
extract_all_info(*record(function_args.values()))

Remaining programmatic outcomes! (Captured by the writer)

**Beware one factor**: Within the spirit of full transparency, I needed to run that code there a number of instances to get it to provide like that. The difficulty is that as a result of the extract_person_info and extract_all_info are extra comparable in nature, ChatGPT saved complicated these for each other. I assume the lesson to be realized right here is that your capabilities ought to be extracting distinct info. I additionally solely examined utilizing gpt-3.5-turbo, so it’s potential {that a} extra highly effective mannequin like GPT-4 may have dealt with that higher.

I hope you possibly can see now why perform calling may be so highly effective! Relating to constructing functions that leverage Generative AI, this sort of perform calling is a godsend for programmers. By not having to fret a lot now concerning the output JSON construction, we are able to now focus our time on constructing out different components of the applying. It’s an superior time to be working on this house 🥳