Date: | 2019-08-05 |
---|---|
Author: | Gábor Nyers |
tags: | python |
category: | python_workshop |
summary: | The things you need to get started with developing in Python |
licence: | CC BY-NC 4.0 https://creativecommons.org/licenses/by-nc/4.0/ |
Agenda
Prep
Get Material: URL: https://github.com/gnyers/python-tuesday
git clone [email protected]:gnyers/python-tuesday.git
Install Editor, e.g.: VSCode: https://code.visualstudio.com/
What is a Python distribution?
A software collection made available by the Python project or some other organization to execute Python programs. It contains a Python run-time environment, the Python Standard Library and (optionally) additional Python modules.
How many Python distros are there?
The Python Wiki lists more than a dozen of Python distributions, though several of the pages are unavailable.
We will be discussing the two most well-known Python distributions:
- The official Python distro, and
- The Anaconda product
The canonical Python run-time environment developed, managed and distributed by the Python Software Foundation the official custodian of the Python programming language, its reference implementation and the well-known Python trademark. To find out more visit the website at http://python.org.
It contains:
- Python CLI interpreter, which is used for both running programs and for interactive work.
- The IDLE GUI interpreter, which can be used for interactive work.
- Required (binary) libraries
- The Python Standard Library, which is mostly written in Python, but lower level operating system related functions are written in C.
- The official documentation of this minor version
- The Language Reference
- Standard Library
- Official HOWTO's and Tutorials
- C-API documentation (extending Python itself, OR extending an application with Python!)
The official Python distribution does not contain:
- An editor or IDE
- Python modules not part of the official Standard Library, such as:
The Anaconda distribution is made up of the official Python distro and a list of 1500+ selected Python modules. This product is aimed mainly at data science and scientific computing customers and is being developed and managed by a commercial company. While the product itself is completely open source, the company provides commercial services around it.
Anaconda contains the following major components:
- The official Python distribution (both v2 and v3)
- Anaconda Navigator: a convenient GUI application to start Anaconda applications, manage virtual environments and search for- and update modules.
- Conda package manager (a more advanced version of
pip
) - Some 1500+ Python modules (see Anaconda Pkg List), of which ~200 are automatically installed.
- Spyder, an open source IDE (Integrated Development Environment) specializing in scientific use-cases.
- Jupyter Notebooks and the IPython interpreter
- etc...
Further links to get started with Anaconda:
Depending on you specific case there might be server consideration when selecting a Python distribution:
- Download and Install size:
- What kind of applications will you be developing:
- The level of your Programming/Python knowledge:
One of the most important tool when you developing software is the actual application which you use to edit your code. This application goes by different names, such as an "editor" or an "IDE" (Integrated Development Environment). The difference between an editor and an IDE has to do with the level of specialization the tool offers for the language.
- Generic editors, such as: VIM (**) or Emacs (**)
- Light-weight IDEs: Spyder (**), Sublime (*), Atom (**), Geany (**)
- Heavy-duty development environments, like: PyCharm (*), Eclipse's PyDev (**) or VisualStudio (*)
- Honorable mention: Jupyter Notebooks (**)
where:
- (*): Proprietary product
- (**): Open source project
Support for the Python syntax and best practices, i.e.:
<Tab>
to 4x``<Space>`` coversion, auto-indentationSyntax Highlighting: colorize the code with multiple colors, which highlight different parts of the code, e.g.: comment, strings, reserved words
Tab completion
Variable overview: list of names of already defined variables, functions and class names.
Snippets: inserting small pieces of often used code with a few key-strokes
Automatically perform static code analytics: verify code quality with tools such as pep8, pylint, pyflakes , black, mypy
Things as: unused, unrecognized or mistyped variables, usage of <Space>s, un-imported modules, etc...
Terminal support, i.e.: ability to run the program being developed and watch its output
Support for virtual environments (related to previous point)
Is it possible to use your coding tool in a production environment?
Support for refactoring: consistently renaming variables, function- or class names throughout a code base
- "Python IDEs and Code Editors (Guide)" https://realpython.com/python-ides-code-editors-guide/
- "Which Python static analysis tools should I use?" https://www.codacy.com/blog/which-python-static-analysis-tools-should-i-use/
Based on the "Getting Started with Python in VS Code" article above.
- Download and install VSCode from https://code.visualstudio.com/Download
- Install the "Python extension for VSCode" by Microsoft using the built-in package manager
- Select your Python interpreter:
- Activate the Command Palette with the keystroke
<Ctrl>+<Shift>+P
- Start typing "Select Python Interpreter", which should automatically discover the installed Python run-time.
- Activate the Command Palette with the keystroke
- Create a new
helloworld
application and run it by right-clicking on the code and selecting the "Run Python File in Terminal"
Further read:
- "Getting Started with Python in VS Code" https://code.visualstudio.com/docs/python/python-tutorial
VIM is famous (or infamous) of its steep learning curve, yet it still has a huge number of users. So there must be something to it, right? Anyway, regardless if you're an experienced VIM person, or just curious about the fuss, this is how to get started with Python development in VIM. It is a more involved process that getting started with VSCode, yet people seem to thing it's worth the effort.
- Download and Install VIM
- Install a plug-in manager for VIM, such as Vundle
- Follow one of the more detailed guides below.
Further read:
- "VIM and Python – A Match Made in Heaven" https://realpython.com/vim-and-python-a-match-made-in-heaven/
- Setting up VIM as an IDE for Python https://medium.com/@hanspinckaers/setting-up-vim-as-an-ide-for-python-773722142d1d
Typically there are 3 types of modules you'll need to work with on a daily bases:
- Modules of the Standard Library
- Modules which are part of the application being developed
- 3rd party modules, e.g. available in PyPI or Anaconda Repository.
The availability of the first two are trivial, but finding and installing 3rd party modules is a concern.
There are several sources of to install Python modules from, such as:
- PyPI: the Python Package Index, which is the official 3rd party repository. The site is managed by the Python Software Foundation.
- Anaconda Repository_: the repository managed by Anaconda, the commercial entity behind the Python distro of the same name.
Tools to manage 3rd party modules:
- pip: the de-facto module management tool.
- conda: a much more sophisticated manager for modules developed primarily for the Anaconda Python distribution
- pipenv: similarly sophisticated capabilities as conda, but generic
Shared features:
- basic package management: searching, downloading, installing, up- and downgrading and removing of modules.
- support for the several different Python package formats (i.e.: the distributable, (usually) platform-independent archive containing stuff needed to install or compile a module
Differences:
pipenv
andconda
can do everythingpip
can, with support of more sophisticated dependency management and securitypipenv
andconda
manage Python virtual environments,pip
does not.
Let's install the requests
module using pip
:
1 $ pip install requests
2 Collecting requests
3 Downloading https://files.pythonhosted.org/packages/...
4 100% |████████████████████████████████| 61kB 2.1MB/s
5 Collecting idna<2.9,>=2.5 (from requests)
6 Downloading https://files.pythonhosted.org/packages/...
7 100% |████████████████████████████████| 61kB 4.0MB/s
8 Collecting certifi>=2017.4.17 (from requests)
9 Downloading https://files.pythonhosted.org/packages/...
10 100% |████████████████████████████████| 163kB 1.9MB/s
11 Collecting urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 (from requests)
12 Downloading https://files.pythonhosted.org/packages/...
13 100% |████████████████████████████████| 153kB 3.4MB/s
14 Collecting chardet<3.1.0,>=3.0.2 (from requests)
15 Downloading https://files.pythonhosted.org/packages/...
16 100% |████████████████████████████████| 143kB 2.4MB/s
17 Installing collected packages: idna, certifi, urllib3, chardet, requests
18 Successfully installed certifi-2019.6.16 chardet-3.0.4 idna-2.8 requests-2.22.0 urllib3-1.25.3
19 You are using pip version 9.0.3, however version 19.1.1 is available.
20 You should consider upgrading via the 'pip install --upgrade pip' command.
What just happened:
- lines 2-4:
pip
has downloaded the archive containing therequests
module - lines 5-16: based on information in the archive, it then (recursively) downloaded all other required modules the current one depends on.
- lines 17-18: once all required packages are downloaded it installed them
- lines 19-20:
pip
has thrown a warning about its own outdated version
Get a list of installed modules:
$ pip list
certifi (2019.6.16)
chardet (3.0.4)
idna (2.8)
pip (9.0.3)
requests (2.22.0)
setuptools (39.0.1)
urllib3 (1.25.3)
You are using pip version 9.0.3, however version 19.1.1 is available.
You should consider upgrading via the 'pip install --upgrade pip' command.
Now let's get rid of those annoying warnings about pip
's version by
updating it:
$ pip install --upgrade pip
Cache entry deserialization failed, entry ignored
Collecting pip
Downloading https://files.pythonhosted.org/packages/.../pip-19.1.1-py2.py3-none-any.whl (1.4MB)
100% |████████████████████████████████| 1.4MB 528kB/s
Installing collected packages: pip
Found existing installation: pip 9.0.3
Uninstalling pip-9.0.3:
Successfully uninstalled pip-9.0.3
Successfully installed pip-19.1.1
Warnings are gone; also note how the latest version presents the same information as above:
$ pip list
Package Version
---------- ---------
certifi 2019.6.16
chardet 3.0.4
clipboard 0.0.4
idna 2.8
pip 19.1.1
pyperclip 1.7.0
requests 2.22.0
setuptools 39.0.1
urllib3 1.25.3
- Python Virtual Environment:
- Used run conflicting Python applications, in terms of modules, module versions or even interpreter versions. For this reason they need to be isolated from one another. It is a best practice to develop each application in its own virtual environment.
Consider the following:
- A virtual environment (or venv) is very simple and very light-weight isolation mechanism to allow the use of conflicting modules or even different Python major (and minor) versions.
- Please note that this technique has nothing to do with virtualization (such
as VMware, VirtualBox or KVM) nor containers create by such tools as
LXC
,Docker
orsystemd-nspawn
. - Python virtual environments are "just" a separate sets of directories and appropriate environment variables resulting in a separate Python run-time environment.
- If the same module is used in multiple
venv
's, it needs to be installed multiple times.
A venv
can be created manually or programmatically (i.e. from code).
To manually create a venv
execute the following steps:
Create a directory which will be the
root
of thevenv
.mkdir -p ~/.virtualenvs/devenv
Populate the
root
directory:python3 -m venv ~/.virtualenvs/devenv
Every time you want to use a venv
you need to activate it either
interactively or as a batch process during the startup of an application.
Use a venv
interactively or with VIM
Activate the
venv
from a directoryOn Linux or MacOS X, if
venv
is installed in~/.virtualenvs/devenv
:source ~/.virtualenvs/devenv/bin/activate
On Windows, from dir :
C:/temp/devenv/bin/activate.bat
This will change the prompt to show the name of the active Python
venv
, e.g.:(devenv) user@host $
Above script will modify environmental variables such that when typing
python
, the interpreter, the libraries and modules in thevenv
will be used.Note: These settings are temporary and only active in the current session! #If the session is closed, you'll need to re-activate.
Run your Python application the usual way:
$ python myprogram.ph
Note: you don't need to specify explicitly
python3
, because the python interpreter of thevenv
will be the first the OS founds.To de-activate the
venv
and restore the system-wide Python settings simply execute:deactivate
Further read:
- The Official Python tutorial "Virtual Environments and Packages" https://docs.python.org/3/tutorial/venv.html
- "Managing environments" - Anaconda documentation https://docs.anaconda.com/anaconda/navigator/tutorials/manage-environments/
Setup VSCode to use venv
Please note that VSCode will automatically discover Python environments of the following types:
- system wide Python environments
- virtual environments located in any of the following special directories:
- $HOME/.virtualenvs
- $HOME/.pyenv
- $HOME/Envs
In case you'd like to use an environment from a non-standard location use the following steps:
Start VSCode
Open your project
Edit your settings by selecting
File
->Preferences
->Settings
menus. This will open a new tab titled: "Settings"or hitting the
<Ctrl> + ,
(comma) keystrokeIn the "Search" field type:
venv
and find the "Python: Venv Folder" parameter.Add the parent folder containing the root of your virtual environment.
For example: if you've created a virtual environment using:
mkdir /tmp/myenv && python3 -m venv /tmp/myenv
You will need to add the
/tmp
directory to thepython.venvPath
VSCode parameter. (btw: this will be stored in the~/.config/Code/User/settings.json
file.After this select the correct environment using the Command Palette -> "Select Python Interpreter" action.
Restart any VSCode terminal to propagate these changes and use the correct interpreter when executing your Python code.
We'll use a simple "echo" program to get started with a few best practices. Let's get started with the following:
1 import sys
2 print(' '.join(sys.argv[1:]))
- line 1: import the Standard Library's sys module, which contains many data structures related to the currently running interpreter
- line 2: in order of how Python evaluates things:
- get a reference to the
argv
variable in the sys module (which is alist
) - get all but the 1st element of the list (see Slicing), which is always the name of the currently running Python program
- take the elements of the list and "join" them together using spaces
(because we asked for spaces in "
' '.join(...)
", which will return astr
- print this
str
out to the terminal
- get a reference to the
Now let's add a few things to our program:
1 #!/usr/bin/env python
2 '''A simple program to echo back all its CLI arguments
3 '''
4
5 import sys
6 print(' '.join(sys.argv[1:]))
- line 1: an instruction to the operating system (Linux, MacOS X and Unix only!) with what interpreter we wish to run this file. That is: provided that our program file is "executable" (see file permissions)
- lines 2-3: a rudimentary "documentation" to our program using a simple docstring
Suppose that before echoing, we wish to modify the input:
1 #!/usr/bin/env python
2 '''A simple program to echo back all its CLI arguments
3 '''
4
5 import sys
6 answer = ' '.join(sys.argv[1:])
7 answer = answer.title()
8 print(answer)
- line 6: instead of printing out, the
str
is stored in the variableanswer
- line 7: by invoking the
.title()
method, we capitalize all words (see also the documentation about other string methods)
Using the pyperclip package we can access the clipboard of our desktop environment.
Install pyperclip
by executing the following in a terminal:
$ pip install pyperclip Collecting pyperclip Using cached https://files.pythonhosted.org/packages/.../pyperclip-1.7.0.tar.gz Installing collected packages: pyperclip Running setup.py install for pyperclip ... done Successfully installed pyperclip-1.7.0
Now create new Python file in your editor with the following content.
Warning: make sure to give this file any other name than clipboard.py
,
otherwise this will conflict with the module's name.
1 #!/usr/bin/env python
2 # -*- coding: utf-8 -*-
3
4 import clipboard
5
6 # Put a string on the clipboard
7 clipboard.copy('Beautiful is better than ugly.')
8
9 print('Retrieve the content of the clipboard:\n', clipboard.paste())
line 4: import the
clipboard
module;NOTE: we've installed the
pyperclip
package, which has created a moduleclipboard
!line 7: put the string "Beautiful is better than ugly." (see The Zen of Python)
line 9: print the content of the clipboard
NOTE: after running the program, the string remains on the clipboard, so you may try to paste this in a different application
In one of the previous example we've already worked with CLI arguments. This example is a significant jump in terms of sophistication, but with this we'll arrive to the definitive solution for this common programming pattern.
CLI arguments are one of oldest and most frequently used mechanism to provide some information to a program. Because it is so old (going back to the '70s) it has a wide variety of appearances depending on:
operating system: e.g. Unix, MacOS X and Linux usually use
-h
or--help
, while Windows traditionally uses/h
style: Unix vs. GNU style CLI arguments, the former being compact, the latter more informative
complexity: all the following forms can mean the same
'-vvvltp 8000'
'-v -v -v -l -t -p 8000'
'--verbose 3 --long --time --port 8000
A practical example showing just how much information can be conveyed using
CLI arguments to a seemingly simple program such as 'ls'
:
$ ls --help Usage: ls [OPTION]... [FILE]... List information about the FILEs (the current directory by default). Sort entries alphabetically if none of -cftuvSUX nor --sort is specified. Mandatory arguments to long options are mandatory for short options too. -a, --all do not ignore entries starting with . -A, --almost-all do not list implied . and .. --author with -l, print the author of each file -b, --escape print C-style escapes for nongraphic characters --block-size=SIZE with -l, scale sizes by SIZE when printing them; e.g., '--block-size=M'; see SIZE format below -B, --ignore-backups do not list implied entries ending with ~ -c with -lt: sort by, and show, ctime (time of last modification of file status information); with -l: show ctime and sort by name; otherwise: sort by ctime, newest first ...
Python provides multiple modules to deal with the varying levels of complexity of parsing command-line arguments. A few examples:
- optparse: a deprecated module in the Standard Library
- argparse: the currently supported module in the Standard Library to solve CLI parsing (see also the argparse tutorial)
- docopt: a very intuitive module, which parses a
docstr
describing the program's usage conforming to the traditional notation for documentation
Right now we'll be checking out a few simpler use-cases of the argparse
module.
The following code is perhaps the simplest first step:
1 #!/usr/bin/env python3
2 # -*- coding: utf-8 -*-
3
4 import argparse
5 p = argparse.ArgumentParser()
6 p.add_argument('-n', '--name',
7 type=str,
8 required=False,
9 default='John Doe',
10 help='Your name')
11 args = p.parse_args()
- line 4: import the
argparse
module from the Standard Library - line 5: create a new argument parser instance, which will be able to parse
a list of strings (typically the CLI arguments from
sys.argv
). This is a container of parser objects for individual arguments, such as '-n' or '--name' etc... - lines 6-10: a multi-line instruction to add the 1st argument parser, which
defines the requirements for the
name
argument:- line 6: the CLI argument designated with this either options
-n
or--name
will interpreted as thename
parameter - line 7: the type of this parameter is
str
- line 8: it is an optional argument, so it may be omitted
- line 9: if omitted, the default value of it will be "John Doe"
- line 10: when the '-h' or '--help' argument is provided, this string elaborates on the meaning of this parameter. The description of the parameter may be multiple lines.
- line 6: the CLI argument designated with this either options
Now let's execute the program in two different ways
without any arguments
$ python3 cliargs1.py -h
No response, which is correct!
with a
'--help'
argument$ python3 cliargs1.py -h usage: cliargs1.py [-h] [-n NAME] optional arguments: -h, --help show this help message and exit -n NAME, --name NAME Your name
Extend the program by adding a new argument '--age'
to the CLI parser and
printing the parsed values:
1 #!/usr/bin/env python3
2 # -*- coding: utf-8 -*-
3
4 import argparse
5 p = argparse.ArgumentParser()
6 p.add_argument('-n', '--name',
7 type=str,
8 required=False,
9 default='John Doe',
10 help='Your name')
11 p.add_argument('-a', '--age',
12 type=int,
13 required=False,
14 default=99,
15 help='Your age')
16 args = p.parse_args()
17 print('Name:', args.name)
18 print('Age :', args.age)
Execute the program in the following ways:
No arguments:
$ python3 cliargs2.py Name: John Doe Age : 99
The program prints out the default values of both parameters.
Only the
'--help'
argument will instruct the program to print out its help information.$ python3 cliargs2.py --help usage: cliargs2.py [-h] [-n NAME] [-a AGE] optional arguments: -h, --help show this help message and exit -n NAME, --name NAME Your name -a AGE, --age AGE Your age
Provide a value for both arguments:
$ python3 cliargs2.py --name Jane --age 32 Name: Jane Age : 32