Python Tutorial
Python Tutorial
Heavily based on presentations by Matt Huenerfauth (Penn State) Guido van Rossum (Google) Richard P. Muller (Caltech) ...
Python
Open source general-purpose language. Object Oriented, Procedural, Functional Easy to interface with C/ObjC/Java/Fortran Easy-ish to interface with C++ (via SWIG) Great interactive environment
You probably want 2.5.x unless you are starting from scratch. Then maybe 3.x
Technical Issues
Installing & Running Python
Binaries
Python comes pre-installed with Mac OS X and Linux. Windows binaries from http://python.org/ You might not have to do anything!
You could make the *.py file executable and add the following #!/usr/bin/env python to the top to make it runnable.
Batteries Included
Large collection of proven modules included in the standard distribution.
http://docs.python.org/modindex.html
numpy
Offers Matlab-ish capabilities within Python Fast array operations 2D arrays, multi-D arrays, linear algebra etc.
matplotlib
High quality plotting library.
#!/usr/bin/env python import numpy as np import matplotlib.mlab as mlab import matplotlib.pyplot as plt mu, sigma = 100, 15 x = mu + sigma*np.random.randn(10000) # the histogram of the data n, bins, patches = plt.hist(x, 50, normed=1, facecolor='green', alpha=0.75) # add a 'best fit' line y = mlab.normpdf( bins, mu, sigma) l = plt.plot(bins, y, 'r--', linewidth=1) plt.xlabel('Smarts') plt.ylabel('Probability') plt.title(r'$\mathrm{Histogram\ of\ IQ:}\ \mu=100,\ \sigma=15$') plt.axis([40, 160, 0, 0.03]) plt.grid(True) plt.show()
Downloads: http://matplotlib.sourceforge.net/
Monday, October 19, 2009
PyFITS
FITS I/O made simple:
>>> import pyfits >>> hdulist = pyfits.open(input.fits) >>> hdulist.info() Filename: test1.fits No. Name Type Cards Dimensions Format 0 PRIMARY PrimaryHDU 220 () Int16 1 SCI ImageHDU 61 (800, 800) Float32 2 SCI ImageHDU 61 (800, 800) Float32 3 SCI ImageHDU 61 (800, 800) Float32 4 SCI ImageHDU 61 (800, 800) Float32 >>> hdulist[0].header[targname] NGC121 >>> scidata = hdulist[1].data >>> scidata.shape (800, 800) >>> scidata.dtype.name float32 >>> scidata[30:40,10:20] = scidata[1,4] = 999
pyds9 / python-sao
Interaction with DS9 Display Python 1-D and 2-D arrays in DS9 Display FITS files in DS9
Custom Distributions
Python(x,y): http://www.pythonxy.com/
Python(x,y) is a free scientific and engineering development software for numerical computations, data analysis and data visualization
Sage: http://www.sagemath.org/
Sage is a free open-source mathematics software system licensed under the GPL. It combines the power of many existing open-source packages into a common Python-based interface.
The Basics
A Code Sample
x = 34 - 23 y = Hello z = 3.45 if z == 3.45 or y == Hello: x = x + 1 y = y + World print x print y # String concat. # A comment. # Another one.
Basic Datatypes
Integers (default for numbers)
z = 5 / 2 # Answer is 2, integer division.
Floats
x = 3.456
Strings
Can use or to specify. abc abc (Same thing.) Unmatched can occur within the string. matts Use triple double-quotes for multi-line strings or strings than contain both and inside of them: abc
Whitespace
Whitespace is meaningful in Python: especially indentation and placement of newlines. Use a newline to end a line of code.
Use \ when must go to next line prematurely.
Often a colon appears at the start of a new block. (E.g. for function and class definitions.)
Comments
Start comments with # the rest of line is ignored. Can include a documentation string as the first line of any new function or class that you define. The development environment, debugger, and other tools use it: its good style to include one.
def my_function(x, y): This is the docstring. This function does blah blah blah. # The code would go here...
Assignment
Binding a variable in Python means setting a name to hold a reference to some object.
Assignment creates references, not copies
You create a name the first time it appears on the left side of an assignment expression:
! x = 3
A reference is deleted via garbage collection after any names bound to it have passed out of scope.
Multiple Assignment
You can also assign to multiple names at the same time.
>>> x, y = 2, 3 >>> x 2 >>> y 3
Naming Rules
Names are case sensitive and cannot start with a number. They can contain letters, numbers, and underscores.
bob Bob _bob _2_bob_ bob_2 BoB
Why??
name list
memory
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 1
So, for simple built-in datatypes (integers, floats, strings), assignment behaves as you would expect:
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x # # # # Creates 3, name Creates name y, Creates ref for No effect on x, x refers to 3 refers to 3. 4. Changes y. still ref 3.
Assignment 2
For other data types (lists, dictionaries, user-defined types), assignment works differently.
These datatypes are mutable. When we change these data, we do it in place. We dont copy them into a new memory address each time. If we type y=x and then modify y, both x and y are changed.
immutable
>>> >>> >>> >>> 3 x = 3 y = x y = 4 print x
mutable
x = some mutable object y = x make a change to y look at x x will be changed as well
# a now references the list [1, 2, 3] # b now references what a references # this changes the list a references # if we print what b references, # SURPRISE! It has changed
Sequence Types
1. Tuple
A simple immutable ordered sequence of items Items can be of mixed types, including collection types
2. Strings Immutable Conceptually very much like a tuple 3. List Mutable ordered sequence of items of mixed types
Similar Syntax
All three sequence types (tuples, strings, and lists) share much of the same syntax and functionality.
Key difference:
Tuples and strings are immutable Lists are mutable
The operations shown in this section can be applied to all sequence types
most examples will just show the operation performed on one
Sequence Types 1
Tuples are defined using parentheses (and commas).
>>> tu = (23, abc, 4.56, (2,3), def)
Sequence Types 2
We can access individual members of a tuple, list, or string using square bracket array notation. Note that all are 0 based
>>> tu = (23, abc, 4.56, (2,3), def) >>> tu[1] # Second item in the tuple. abc >>> li = [abc, 34, 4.34, 23] >>> li[1] # Second item in the list. 34 >>> st = Hello World >>> st[1] # Second character in string. e
Return a copy of the container with a subset of the original members. Start copying at the first index, and stop copying before the second index.
>>> t[1:4] (abc, 4.56, (2,3))
Omit the first index to make a copy starting from the beginning of the container.
>>> t[:2] (23, abc)
Omit the second index to make a copy starting at the first index and going to the end of the container.
>>> t[2:] (4.56, (2,3), def)
Note the difference between these two lines for mutable sequences:
>>> list2 = list1 # 2 names refer to 1 ref # Changing one affects both
The in Operator
Boolean test whether a value is inside a container:
>>> t >>> 3 False >>> 4 True >>> 4 False = [1, 2, 4, 5] in t in t not in t
Be careful: the in keyword is also used in the syntax of for loops and list comprehensions.
The + Operator
The + operator produces a new tuple, list, or string whose value is the concatenation of its arguments.
>>> (1, 2, 3) + (4, 5, 6) (1, 2, 3, 4, 5, 6) >>> [1, 2, 3] + [4, 5, 6] [1, 2, 3, 4, 5, 6] >>> Hello + + World Hello World
The * Operator
The * operator produces a new tuple, list, or string that repeats the original content.
>>> (1, 2, 3) * 3 (1, 2, 3, 1, 2, 3, 1, 2, 3) >>> [1, 2, 3] * 3 [1, 2, 3, 1, 2, 3, 1, 2, 3] >>> Hello * 3 HelloHelloHello
Tuples: Immutable
>>> t = (23, abc, 4.56, (2,3), def) >>> t[2] = 3.14 Traceback (most recent call last): File "<pyshell#75>", line 1, in -topleveltu[2] = 3.14 TypeError: object doesn't support item assignment
You cant change a tuple. You can make a fresh tuple and assign its reference to a previously used name.
>>> t = (23, abc, 3.14, (2,3), def)
Lists: Mutable
>>> li = [abc, 23, 4.34, 23] >>> li[1] = 45 >>> li [abc, 45, 4.34, 23] We can change lists in place. Name li still points to the same memory reference when were done. The mutability of lists means that they arent as fast as tuples.
# number of occurrences
To convert between tuples and lists use the list() and tuple() functions:
li = list(tu) tu = tuple(li)
Dictionaries
63
Monday, October 19, 2009
You can define, modify, view, lookup, and delete the key-value pairs in the dictionary.
Using dictionaries
>>> d = {user:bozo, pswd:1234} >>> d[user] bozo >>> d[pswd] 1234 >>> d[bozo] Traceback (innermost last): File <interactive input> line 1, in ? KeyError: bozo >>> d = {user:bozo, pswd:1234} >>> d[user] = clown >>> d {user:clown, pswd:1234} >>> d[id] = 45 >>> d {user:clown, id:45, pswd:1234}
>>> d = {user:bozo, p:1234, i:34} >>> del d[user] # Remove one. >>> d {p:1234, i:34} >>> d.clear() # Remove all. >>> d {}
>>> d = {user:bozo, p:1234, i:34} >>> d.keys() # List of keys. [user, p, i] >>> d.values() # List of values. [bozo, 1234, 34] >>> d.items() # List of item tuples. [(user,bozo), (p,1234), (i,34)]
Functions
Functions
def creates a function and assigns it a name return sends a result back to the caller Arguments are passed by assignment Arguments and return types are not declared
def <name>(arg1, arg2, ..., argN): ! <statements> ! return <value> def times(x,y): ! return x*y
Optional Arguments
Can define defaults for arguments that need not be passed
def func(a, b, c=10, d=100): ! print a, b, c, d >>> func(1,2) 1 2 10 100 >>> func(1,2,3,4) 1,2,3,4
Gotchas
All functions in Python have a return value
even if no return line inside the code.
Functions without a return return the special value None. There is no function overloading in Python.
Two different functions cant have the same name, even if they have different arguments.
Functions can be used as any other data type. They can be:
Arguments to function Return values of functions Assigned to variables Parts of tuples, lists, etc
Control of Flow
71
Monday, October 19, 2009
Examples
if x == 3: print X equals 3. elif x == 2: print X equals 2. else: print X equals something else. print This is outside the if. assert(number_of_players < 5)
x = 3 while x < 10: if x > 7: x += 2 continue x = x + 1 print Still in the loop. if x == 8: break print Outside of the loop.
for x in range(10): if x > 7: x += 2 continue x = x + 1 print Still in the loop. if x == 8: break print Outside of the loop.
Modules
Namespace partitioning
Group data together with functions used for that data
Modules
Modules are functions and variables defined in separate files Items are imported using from or import
from module import function function() import module module.function()
What is an Object?
A software item that contains variables and methods Object Oriented Design focuses on
Encapsulation:
dividing the code into a public interface, and a private implementation of that interface
Polymorphism:
the ability to overload standard operators so that they have appropriate behavior based on their context
Inheritance:
the ability to create subclasses that contain specializations of their parents
Example
class ! def !! !! ! def !! ! def !! !! !! atom(object): __init__(self,atno,x,y,z): self.atno = atno self.position = (x,y,z) symbol(self): # a class method return Atno_to_Symbol[atno] __repr__(self): # overloads printing return '%d %10.4f %10.4f %10.4f' % ! (self.atno, self.position[0], ! self.position[1],self.position[2])
>>> at = atom(6,0.0,1.0,2.0) >>> print at 6 0.0000 1.0000 2.0000 >>> at.symbol() 'C'
Atom Class
Overloaded the default constructor Defined class variables (atno,position) that are persistent and local to the atom object Good way to manage shared memory:
instead of passing long lists of arguments, encapsulate some of this data into an object, and pass the object. much cleaner programs result
Overloaded the print operator We now want to use the atom class to build molecules...
Molecule Class
class ! def !! !! ! def !! ! def !! !! !! !! !! molecule: __init__(self,name='Generic'): self.name = name self.atomlist = [] addatom(self,atom): self.atomlist.append(atom) __repr__(self): str = 'This is a molecule named %s\n' % self.name str = str+'It has %d atoms\n' % len(self.atomlist) for atom in self.atomlist: ! str = str + `atom` + '\n' return str
Note that the print function calls the atoms print function
Code reuse: only have to type the code that prints an atom once; this means that if you change the atom specification, you only have one place to update.
Inheritance
class qm_molecule(molecule): def addbasis(self): self.basis = [] for atom in self.atomlist: self.basis = add_bf(atom,self.basis)
__init__, __repr__, and __addatom__ are taken from the parent class (molecule) Added a new function addbasis() to add a basis set Another example of code reuse
Basic functions don't have to be retyped, just inherited Less to rewrite when specifications change
Overloading
! ! ! ! ! ! ! ! class qm_molecule(molecule): def __repr__(self): str = 'QM Rules!\n' for atom in self.atomlist: ! str = str + `atom` + '\n' return str
Now we only inherit __init__ and addatom from the parent We define a new version of __repr__ specially for QM
Anything with one leading underscore is semiprivate, and you should feel guilty accessing this data directly.
_b
Sometimes useful as an intermediate step to making data private
86
Monday, October 19, 2009
>>> a = 1 >>> b = 2.4 >>> c = 'Tom' >>> '%s has %d coins worth a total of $%.02f' % (c, a, b) 'Tom has 1 coins worth a total of $2.40'