python tutorial - Python Subprocess Module - learn python - python programming
- A running program is called a process. Each process has its own system state, which includes memory, lists of open files, a program counter that keeps track of the instruction being executed, and a call stack used to hold the local variables of functions.
- Normally, a process executes statements one after the other in a single sequence of control flow, which is sometimes called the main thread of the process. At any given time, the program is only doing one thing.
- A program can create new processes using library functions such as those found in the os or subprocess modules such as os.fork(), subprocess.Popen(), etc. However, these processes, known as subprocesses, run as completely independent entities-each with their own private system state and main thread of execution.
- Because a subprocess is independent, it executes concurrently with the original process. That is, the process that created the subprocess can go on to work on other things while the subprocess carries out its own work behind the scenes.
- The subprocess module allows us to:
- spawn new processes
- connect to their input/output/error pipes
- obtain their return codes
- It offers a higher-level interface than some of the other available modules, and is intended to replace the following functions:
- We cannot use UNIX commands in our Python script as if they were Python code.
- For example, echo name is causing a syntax error because echo is not a built-in statement or function in Python. So, in Python script, we're using print name instead.
- To run UNIX commands we need to create a subprocess that runs the command.
- The recommended approach to invoking subprocesses is to use the convenience functions for all use cases they can handle.
- Or we can use the underlying Popen interface can be used directly.
- The simplest way of running UNIX command is to use os.system().
- As expected, we got $HOME as stdout (to a terminal). Also, we got a return value of 0 which is the result of executing this command, which means there was no error in the execution.
- os.system('command with args') passes the command and arguments to our system's shell.
- By using this can actually run multiple commands at once and set up pipes and input/output redirections. :
- If we run the code above os.system('echo $HOME') in the Python IDLE, we only see the 0 because the stdout means a terminal.
- To see the command output we should redirect it to a file, and the read from it:
- Open a pipe to or from command. The return value is an open file object connected to the pipe, which can be read or written depending on whether mode is 'r' (default) or 'w'.
- The bufsize argument has the same meaning as the corresponding argument to the built-in open() function.
- The exit status of the command (encoded in the format specified for wait()) is available as the return value of the close() method of the file object, except that when the exit status is zero (termination without errors), None is returned.
- os.popen() does the same thing as os.system except that it gives us a file-like stream object that we can use to access standard input/output for that process. There are 3 other variants of popen that all handle the i/o slightly differently.
- If we pass everything as a string, then our command is passed to the shell; if we pass them as a list then we don't need to worry about escaping anything.
- This is basically just like the Popen class and takes all of the same arguments, but it simply wait until the command completes and gives us the return code.
- Run the command described by args. Wait for command to complete, then return the returncode attribute.
- The command line arguments are passed as a list of strings, which avoids the need for escaping quotes or other special characters that might be interpreted by the shell.
- The check_call() function works like call() except that the exit code is checked, and if it indicates an error happened then a CalledProcessError exception is raised.
- The standard input and output channels for the process started by call() are bound to the parent's input and output.
- That means the calling program cannot capture the output of the command.
- To capture the output, we can use check_output() for later processing.
- This function was added in Python 2.7.
- The underlying process creation and management in this module is handled by the Popen class.
- It offers a lot of flexibility so that developers are able to handle the less common cases not covered by the convenience functions.
- subprocess.Popen() executes a child program in a new process.
- On Unix, the class uses os.execvp()-like behavior to execute the child program. On Windows, the class uses the Windows CreateProcess() function.
args of subprocess.Popen()
- Should be a sequence of program arguments or else a single string.
- By default, the program to execute is the first item in args if args is a sequence. If args is a string, the interpretation is platform-dependent.
- It is recommended to pass args as a sequence.
- shell argument (which defaults to False) specifies whether to use the shell as the program to execute. If shell is True, it is recommended to pass args as a string rather than as a sequence.
- On Unix with shell=True, the shell defaults to /bin/sh.
- If args is a string, the string specifies the command to execute through the shell. This means that the string must be formatted exactly as it would be when typed at the shell prompt. This includes, for example, quoting or backslash escaping filenames with spaces in them.
- If args is a sequence, the first item specifies the command string, and any additional items will be treated as additional arguments to the shell itself. That is to say, Popen does the equivalent of:
- if given, has the same meaning as the corresponding argument to the built-in open() function:
- 0 means unbuffered
- 1 means line buffered
- any other positive value means use a buffer of (approximately) that size
- A negative bufsize means to use the system default, which usually means fully buffered
- The default value for bufsize is 0 (unbuffered)
- specifies a replacement program to execute. It is very seldom needed.
- stdin, stdout and stderr:
- specify the executed program's standard input, standard output and standard error file handles, respectively.
- Valid values are PIPE, an existing file descriptor (a positive integer), an existing file object, and None.
- PIPE indicates that a new pipe to the child should be created.
- With the default settings of None, no redirection will occur; the child's file handles will be inherited from the parent.
- Additionally, stderr can be STDOUT, which indicates that the stderr data from the child process should be captured into the same file handle as for stdout.
- Is set to a callable object, this object will be called in the child process just before the child is executed. (Unix only)
- close_fds is true, all file descriptors except 0, 1 and 2 will be closed before the child process is executed. (Unix only). Or, on Windows, if close_fds is true then no handles will be inherited by the child process. Note that on Windows, we cannot set close_fds to true and also redirect the standard handles by setting stdin, stdout or stderr.
- cwd is not None the child's current directory will be changed to cwd before it is executed. Note that this directory is not considered when searching the executable, so we can't specify the program's path relative to cwd.
- env is not None, it must be a mapping that defines the environment variables for the new process; these are used instead of inheriting the current process' environment, which is the default behavior.
- universal_newlines is True, the file objects stdout and stderr are opened as text files in universal newlines mode. Lines may be terminated by any of '\n', the Unix end-of-line convention, '\r', the old Macintosh convention or '\r\n', the Windows convention. All of these external representations are seen as '\n' by the Python program.
- startupinfo will be a STARTUPINFO object, which is passed to the underlying CreateProcess function.
- can be CREATE_NEW_CONSOLE or CREATE_NEW_PROCESS_GROUP. (Windows only)
os.popen vs. subprocess.Popen()
- This is intended as a replacement for os.popen, but it is more complicated. For example, we use
- instead of
- But it is comprehensive and it has all of the options in one unified class instead of different os.popen functions.
subprocess.Popen() - stdout and stderr
- Note that the communicate() method returns a tuple (stdoutdata, stderrdata) : ('"Hello world!"\n' ,None). If we don't include stdout=subprocess.PIPE or stderr=subprocess.PIPE in the Popen call, we'll just get None back.
- Popen.communicate() interacts with process: Send data to stdin. Read data from stdout and stderr, until end-of-file is reached.
- Wait for process to terminate. The optional input argument should be a string to be sent to the child process, or None, if no data should be sent to the child.
- So, actually, we could have done as below:
- or we can explicitly specify which one we want from proc.communicate():
- The simplest code for the example above might be sending the stream directly to console:
- The code below is to test the stdout and stderr behaviour:
- If we run it:
- Note that the message to stderr gets displayed as it is generated but the message to stdout is read via the pipe. This is because we only set up a pipe to stdout.
- Then, let's make both stdout and stderr to be accessed from Python:
- The communicate() method only reads data from stdout and stderr, until end-of-file is reached. So, after all the messages have been printed, if we call communicate() again we get an error:
- If we want messages to stderr to be piped to stderr, we do: stderr=subprocess.STDOUT.
- As we see from the output, we do not have stderr because it's been redirected to stderr.
subprocess.Popen() - stdin
- Writing to a process can be done in a very similar way. If we want to send data to the process's stdin, we need to create the Popen object with stdin=subprocess.PIPE.
- To test it let's write another program (write_to_stdin.py) which simply prints Received: and then repeats the message we send it:
- To send a message to stdin, we pass the string we want to send as the input argument to communicate():
- Notice that the message created in the write_to_stdin.py process was printed to stdout and then the return value (None, None) was printed. That's because no pipes were set up to stdout or stderr.
- Here is another output after we specified stdout=subprocess.PIPE and stderr=subprocess.PIPE just as before to set up the pipe.
subprocess.Popen() - shell pipeline
- The p1.stdout.close() call after starting the p2 is important in order for p1 to receive a SIGPIPE if p2 exits before p1.
- Here is another example for a piped command. The code gets window ID for the currently active window. The command looks like this: