What Is an AI Coding Agent?

een koppeling hebt gedeeld

2025-06-19 17:01:56

BLOG.JETBRAINS.COM

AI has quickly become one of the most discussed subjects globally and it now seems to be able to do just about everything for us. Students are asking it to help them with their homework, and lawyers are even using it for case research. AI agents have grown rapidly in popularity due to the widespread usage of large language models (LLMs) like OpenAIs ChatGPT. As a result, developers have also felt pressured to start using AI coding agents.In light of all this, it has become imperative for us to understand how AI coding agents work and how we can come up with workable prompts to get the most out of AI in our software development or data science projects. At JetBrains, we have our own coding agent for JetBrains IDEs we call it Junie. We invest a lot of effort in high performance of Junie explaining the reasoning and logic of the coding agent to make it clearer for all of you.LLMs and AI coding agents whats the connection?Without LLMs, AI agents as we know them would be totally different. As a rough analogy, LLMs are to AI agents what engines are to cars. Without engines, there would be no cars. However, not all machines with engines are cars.The work of an AI agent comprises several different stages:Perceiving the relevant informationAt this stage, the agent processes data in your project, including your code and any supporting files, together with your prompt, and sends that data to the LLM that it is using for processing.Reasoning with the LLMCommunication with the LLM is usually carried out according to a specific protocol. This protocol makes processing easier by specifying a format that the agent has to adhere to when sending information and prompts to the LLM.Putting the plan into actionAfter the LLM has processed the information, it will provide some suggested actions or generate some code. In the next step, the agent will take these instructions for various actions and perform them.Evaluation and feedbackIn the last stage, there are options to perform various tests and checks to evaluate the correctness of the result and make adjustments if needed.AI coding agents arent the only type of AI agent that is driven by LLMs. So what makes them special? How are they tailored specifically to the needs of coders? Lets find out!AI coding agents are designed to perform coding tasks with less user supervision. They can formulate an action plan that can potentially achieve the goal given by the user and execute them. An AI coding agent like Junie can also evaluate code and run tests to catch any errors that might crop up. Here is a simplified workflow of an AI coding agent:Set the scene for the LLMBefore the agent knows what to do, some basic information needs to be provided to the LLM in order to ensure a useful output. For example, we need to tell the model what tools are available and what format we want the action plan to be in. In terms of tools, the action plan might consist of functions that we created and that can be executed to perform a task, such as creating a file or listing a directory.Generate an action planNext, when a users prompt about the task is received, the agent asks the LLM to generate an action plan in the desired format. Optionally, we can also ask the LLM to act out a given thought process, which is the logic according to which this action plan is formulated. Once the action plan is received, it gets parsed into a format that can be followed and executed. In our example, we ask for an action plan in JSON format, and by using Pythons JSON library, we translate that action plan into a Python dictionary.Execute the action planNow that we have information about the action plan, the predefined tool functions can be executed according to the plan. The result of executing each step is noted and used in the evaluation stage.Evaluate the resultFinally, we ask the LLM to evaluate the result. In case of failure, the action plan can be updated (or a follow-up plan generated) to fix the error. Once we have an updated plan (or follow-up plan), we can attempt to execute the plan once again and evaluate the result. Keep in mind that there should be a maximum number of attempts or timeouts for this iteration, otherwise we could get stuck in an infinite loop.Converting your conversation with LLM into coding actionsAs an example of how an AI coding agent works, lets build a simplified version of an AI coding agent that works with Python code.Set up your LLM modelEarlier we reasoned that an AI coding agent without an LLM would be like a car without an engine. For this reason, its crucial that we start by securing access to an LLM model. While you can choose to use proprietary services from companies like OpenAI and Anthropic, for demonstration purposes, well use an open-source model.In this example, well use the DeepSeek-R1 model llamafile here on Hugging Face. Follow the instructions here to get the llamafile for the model.Why llamafile? What is it?llamafile is a single-file executable that runs locally on most computers, with no installation required.If youre using macOS, Linux, or BSD, make sure you grant permission for your computer to execute this new file.chmod +x DeepSeek-R1-Distill-Qwen-7B-Q6_K.llamafileNow we can launch the model as a local server by running the llamafile. Make sure you pass in the flag `-ngl 999` to offload to your available GPUs../DeepSeek-R1-Distill-Qwen-7B-Q6_K.llamafile -ngl 999Voil! The model is now available on your local server.2. Testing the LLM model server with Python codeThe llamafile server is also compatible with `openai` APIs. We can provide a prompt and get a result by using a Python script. For example:#!/usr/bin/env python3from openai import OpenAIclient = OpenAI(base_url="http://127.0.0.1:8080/v1", # "http://<Your api-server IP>:port"api_key = "sk-no-key-required")completion = client.chat.completions.create(model="DeepSeek-R1-Distill-Qwen-7B-Q6_K",messages=[{"role": "assistant", "content": "You are an AI coding agent. Your top priority is perform coding task requested by the user."},{"role": "user", "content": "create a Python function that returns the square of a number."},])content = completion.choices[0].message.contentprint(content)When you run this script, heres what you might get back from the LLM:<think>Okay, so the user asked me to create a Python function that returns the square of a number. Hmm, I need to figure out how to approach this. Let me start by recalling what a function in Python looks like. I remember that functions are defined using the def keyword, followed by the function name and parentheses for parameters.Alright, the function needs to take a number as input. Lets name it num for clarity. So the function definition would be something like def square(num):. Now, inside the function, I need to calculate the square of this number. Squaring a number means multiplying it by itself, so that would be num * num.Wait, is there another way to square a number in Python? Oh right, theres the exponentiation operator **, so I could also write num ** 2. Either way works, but using multiplication is straightforward and easy to understand for someone new.Next, I need to make sure the function returns this squared value. So after calculating, Ill use the return statement. Putting it all together, the function should look like:def square(num):return num * numI should test this function to ensure it works correctly. Let me try a few examples. If I pass 3, it should return 9. 3 squared is indeed 9, so thats good. What about a negative number, like -4? Squaring it should give 16, which the function should handle correctly since multiplying two negatives gives a positive.Also, zero is a good test case. 0 squared is 0, so the function should return 0 in that case. I think these test cases cover the basic scenarios and confirm that the function is working as intended.Are there any edge cases I should consider? Maybe very large numbers, but since Python handles big integers well, that shouldnt be a problem. What about non-integer inputs? The function will still work because multiplying floats is valid, but the user might expect an integer if they pass one. However, the function doesnt enforce input type, so its up to the user to handle that as needed.Overall, the function seems simple enough and should meet the users requirements. I think this is a solid solution.</think>To create a Python function that returns the square of a number, you can follow these steps:1. **Define the function**: Use the `def` keyword to create a function named `square` that takes one parameter, `num`.2. **Calculate the square**: Inside the function, multiply `num` by itself (`num * num`) to get the square.3. **Return the result**: Use the `return` statement to return the calculated square.Heres the code:`pythondef square(num):return num * num`**Example usage**:`pythonprint(square(3)) # Output: 9print(square(-4)) # Output: 16print(square(0)) # Output: 0`This function takes a number as input and returns its square..<endofsentence>Wow, thats a lot. And the thought process of the LLM is also there (the text within `<think>` and `</think>`). We need to be aware of that when handling the response from the LLM. We should also make sure the LLM responds with an action plan. Later on, well see how we can prompt the LLM to do this.If your curiosity has been piqued and you want to try out a coding agent, I recommend installing Junie. It is compatible with IntelliJ IDEA, PyCharm, WebStorm, GoLand, PhpStorm, RubyMine and RustRover. To learn more about Junie, head over to its official webpage: https://www.jetbrains.com/junie/.

0 Reacties 0 aandelen 26 Views