Biology concepts – protein, amino acids, non-standard amino acids, peptide bond
The fourth group is what makes each aa different. Called an R group, this side chain can be small or big, neutral or charged, and gives the aa its properties. The R stands for something, but that story is just too long.
Gaston MA, Jiang R, & Krzycki JA (2011). Functional context, biosynthesis, and genetic encoding of pyrrolysine. Current opinion in microbiology, 14 (3), 342-9 PMID: 21550296
Tofu comes in sixth and soybeans are seventh. This is why humans have sharp canine teeth – we're meat eaters. You can live happily (well, somewhat happily) as a vegetarian; you just have to work much harder at it.
So why is protein so important? How about, because it is one of the four major biomolecules and without it you die a horrible death? Sounds like a good reason to me.
Proteins reside in every cell of every living organism, from prokaryotes to your favorite uncle. There isn’t a job in a cell that proteins don’t have their hands in; proteins even perform numerous tasks at the extracellular level. Heck, that spider web hanging from your dusty Stairmaster is made of protein!
From prokaryotes to spiny echidnas to rosebushes, let’s look where proteins are involved in life. Proteins provide the structure from which cells hold their shape and onto which they build a membrane. Proteins do the talking, providing chemical signals and ways to sense chemical signals.
Proteins do the dirty work; as enzymes they put molecules together, cut them apart, and change their parts around. And most times, they make these reactions happen faster than they would otherwise and without being used up in the process.
A typical cell may contain 10 billion protein molecules. However, not every cell has the same proteins. Many proteins are necessary for every cell, but others have specialized functions needed in only some cells. The exception is unicellular organisms. Their one cell must be able to produce every kind of protein they might ever need.
Space is at a premium, so cells can’t waste room on proteins that aren’t needed right now. Therefore, making protein must be efficient, tightly regulated, and fast. Over 2000 new protein molecules are made every second in most cells, while some proteins exist only to destroy unneeded or old proteins.
Humans can make about 2 million different proteins, but we only have about 25,000 genes that code for them. We accomplish this by having some genes produce many different proteins, just by changing the parts of the gene used. These alternative splice variant proteins may have different functions even though they come from the same gene. For example, the cSlo gene is required for hearing, and each one of the 576different splice variants is responsible for sensing a different frequency. Biology is just so dang efficient.
Now that you know how important proteins are, let’s find out what they are. Proteins are polymers (poly = many, and mer= subunit) made up of bonded amino acid mers. Proteins come in many sizes; the TRP-Cage protein of gila monster spit is a polymer of only 20 amino acids, while the titin protein of your connective tissue is over 38,000 amino acids long.
Maybe we'll dig into the degeneracy of the genetic code when we talk about nucleic acids, but for now let’s just accept that DNA triplets code for different amino acids, and the order of the codons determines the order in which amino acids are linked to form a specific protein. The order of the different amino acids is the key. Why? I’m glad you asked.
Amino acids (or aa’s) are all small molecules made up of carbon, hydrogen, oxygen, nitrogen, and sometimes sulfur – five of last week’s “elements of life.” It’s the arrangement of these elements that makes an amino acid. Refer to the picture below for a visual aid. The central carbon is bound to four other things (often called moieities). One is simply a hydrogen. Another is an amino group (contains the nitrogen). The third is a carboxylic acidgroup. Get it? amino acid.
The fourth group is what makes each aa different. Called an R group, this side chain can be small or big, neutral or charged, and gives the aa its properties. The R stands for something, but that story is just too long.
In glycine, the R group is merely another H, but in tryptophan it contains complex rings. We have talked about how tryptophan is the least used amino acid; it is bulky and introduces big bends in the peptide. We’ll show that bends, kinks and other interactions between aa’s are important for the protein function.
Most organisms can make all the amino acids they need, but mammals are the exception. We have abandoned (genetically) pathways for making some aa’s, so we must get them from our diet. These are the essential amino acids, of which there are nine if you are healthy. Tryptophan must acquired by all animals – good thing plants still have the recipe.
Ribosomes (made of proteins and nucleic acids) link the individual aa’s together in the order demanded by DNA via the mRNA. The bond that connects them is called a peptide bond, and is a “dehydration” or “condensation” reaction.
Look at the amino acid picture again; the peptide bonding process kicks out water, ie. dehydration (de = lose, and hydro– water). Water forms from seemingly nowhere, like condensation on your mirror. See how fitting the names are?
When in a protein chain (also called a peptide), the order of aa’s is called the protein’s primary (1˚) structure. The primary structure in turn dictates the secondary (2˚) structure, which is a folding of small regions of the protein based on the interactions of the side chains of closely associated amino acids.
In turn, the folding of small regions brings together aa’s from farther apart, and they fold up based on their interactions. This is the tertiary (3˚) structure of the protein. If a protein needs more than one peptide chain to be functional, the shape that those different chains form when they interact is called the quaternary (4˚) structure.
The hemoglobin that carries oxygen in our red blood cells is made up of four protein subunits. Why is this important – because what the protein does in life is completely dependent on its three dimensional shape. Lots of aa’s means lots of potential shapes. This is in itself one of the greatest exceptions, since one of the basic tenets of biology is “form follows function.” But with proteins, function follows form.
For the greatest number of possible combinations and shapes, it’s lucky that DNA codes for 20 aa’s. Or are there more? Proteinogenic aa’s are those that can be added into a growing peptide chain, and there are actually 22 of them. The two exceptions are selenocysteine (like cysteine with selenium substituting for sulfur) and pyrrolysine(like lysine with a ring structure added to the end).
We talked last week about the functions of selenocysteineand how it can be incorporated into a peptide even though there isn’t a normal mRNA codon dedicated to it. Pyrrolysine is similar in that it becomes coded for after the modification of what is usually a stop codon, in this case UAG (a signal to add pyrrolysine is located after the UAG codon).
Pyrrolysine is used by methanogenic (methane producing) archaea and bacteria. It's important in the active site of the enzymes that actually produce the methane. New research is showing that more organisms than previously believed use pyrrolysine. A 2011 study identified more than 16 archaea and bacteria with pyrrolysine coding mRNA modifications, but it looks like there may be more.
A 2013 study indicates that the typical modification of the mRNA that occurs 100 bp downstream of the UAG stop codon isn't even there in some pyrrolysine-coding genes. One hypothesis is that in genes without the modification, the UAG sometimes acts as a stop codon and sometimes incorporates a pyrrolysine. Therefore, there are truncated (prematurely stopped) and full-length versions of the protein in the cell, and the relative number of each can be affected by local conditions and stressors.
In this paper, the authors have developed a different predictor, which doesn’t rely solely on the presence of the modification. Using it, they have identified many new candidate genes in archaea and bacteria that could be using pyrrolysines. Here’s my question – all organisms use selenocysteine, but it seems only arachaea and a few bacteria use pyrrolysine. Why did it go away in higher organisms? Can it only be used for methane production? Please, no methane production jokes.
Pyrrolysine and selenocysteine are coded for by mRNA and are added to proteins, so we definitely have 22 aa’s, but could there be more? You betcha. There are over 300 non-standard amino acids, but that isn’t such a big deal. Remember the definition of amino acid; a central carbon with a hydrogen, a carboxylic acid, an amino group, and something else attached. It isn’t a wonder there are many of them.
Other non-standard aa’s are produced as intermediates in other pathways and are not used in proteins. The list of them is great and their functions are even greater, but some act as neurotransmitters, others are important in vitamin synthesis, especially in plants. Still think life uses just 20 amino acids?
Next week we can finish up proteins. Life is very selective with the form of its amino acids – except when it isn’t.
Theil Have C, Zambach S, & Christiansen H (2013). Effects of using coding potential, sequence conservation and mRNA structure conservation for predicting pyrrolysine containing genes. BMC bioinformatics, 14 PMID: 23557142
Theil Have C, Zambach S, & Christiansen H (2013). Effects of using coding potential, sequence conservation and mRNA structure conservation for predicting pyrrolysine containing genes. BMC bioinformatics, 14 PMID: 23557142
For more information or classroom activities, see:
Dietary proteins –
Functions of proteins –
http://www.unc.edu/depts/our/hhmi/hhmi-ft_learning_modules/proteinsmodule/proteins/proteinfunxn.html
Standard amino acids –
Peptide bond –
http://www.chem.wisc.edu/deptfiles/genchem/netorial/modules/biomolecules/modules/protein1/prot15.htm
Protein structure –
Non-standard amino acids -