15.02.01 · orgchem / functional-groups

Functional groups and nomenclature

shipped3 tiersLean: none

Anchor (Master): IUPAC — Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (Blue Book); Favre & Powell — Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (RSC, 2014)

Intuition [Beginner]

An organic molecule can contain dozens or hundreds of atoms, but the chemistry is dominated by a few small structural motifs. These motifs are functional groups: clusters of atoms that confer characteristic reactivity on any molecule that contains them. A hydroxyl group ( $-OH$ ) makes a molecule an alcohol, regardless of whether it has 2 carbons or 20. A carbonyl ( $C=O$ ) bonded to a hydrogen makes it an aldehyde. The rest of the molecule is the carbon backbone -- important for physical properties, but the functional group is what determines the reactions.

Why does this matter? Because learning the behaviour of about a dozen functional groups gives you predictive power over millions of compounds. If you know what alcohols do, you know what ethanol, butanol, cholesterol, and any other alcohol will do in a given reaction. The functional group is the handle.

Naming organic compounds systematically is the other half of this unit. Common names (acetone, formic acid, toluene) are useful but uninformative about structure. The IUPAC system names every compound from a set of rules: find the longest carbon chain, number it to give the functional group the lowest possible locant, name the substituents, and assemble the name in a fixed order. The result is a unique name that encodes the entire structure.

The functional-group hierarchy determines which group "wins" when multiple groups are present. A molecule with both a carboxylic acid and an alcohol is named as a carboxylic acid with a hydroxy substituent, not as an alcohol with a carboxy substituent, because carboxylic acid outranks alcohol in the naming priority.

Visual [Beginner]

Here are the principal functional groups, ranked by the IUPAC seniority (naming priority) from highest to lowest. Higher-priority groups determine the parent name and suffix; lower-priority groups appear as prefixes.

Priority	Functional group	Structure	Suffix
1	Carboxylic acid	$R-COOH$	-oic acid
2	Ester	$R-COOR’$	-oate
3	Amide	$R-CONH_{2}$	-amide
4	Aldehyde	$R-CHO$	-al
5	Ketone	$R-CO-R’$	-one
6	Alcohol	$R-OH$	-ol
7	Amine	$R-NH_{2}$	-amine
8	Alkene	$R-CH=CH-R’$	-ene
9	Alkyne	$R-C\equivC-R’$	-yne
10	Halide	$R-X$ (X = F, Cl, Br, I)	halo- (prefix)

The hierarchy means: if a molecule has both an alkene and an alcohol, it is named as an alken-ol (alcohol suffix wins). If it has a ketone and an alkene, it is an alken-one (ketone suffix wins).

Worked example [Beginner]

Name methyl 4-(2-hydroxyethyl)-2-oxocyclohexanecarboxylate using IUPAC rules, step by step.

This compound has multiple functional groups. Work through the IUPAC rules systematically.

Step 1. Identify functional groups. The molecule contains an ester ( $-COOCH_{3}$ ), a ketone ( $=O$ on the ring), and a hydroxyl ( $-OH$ ). By the hierarchy, the ester is the highest-priority group and determines the parent name.

Step 2. Identify the parent structure. The ester is a cyclohexanecarboxylate -- a carboxylic acid ester where the carboxylate group is attached to a cyclohexane ring. The ester alkyl group is methyl ( $-CH_{3}$ ). So the parent name is "methyl cyclohexanecarboxylate."

Step 3. Number the ring. The carboxylate-bearing carbon is C1 (the highest-priority group gets the lowest locant). Number around the ring to give the other substituents the lowest possible locants. The ketone (oxo) group is at C2 and the hydroxyethyl substituent is at C4.

Step 4. Name the substituents. The ketone group is named "oxo" (prefix form) because it is outranked by the ester. The side chain at C4 is 2-hydroxyethyl: a two-carbon chain with a hydroxyl at the second carbon.

Step 5. Assemble the name. Prefixes (substituents) in alphabetical order, followed by the parent name:

methyl 4-(2-hydroxyethyl)-2-oxocyclohexanecarboxylate

Breaking it down: "methyl" (ester alkyl), "4-(2-hydroxyethyl)" (substituent at C4, with its own internal numbering), "2-oxo" (ketone at C2, named as prefix), "cyclohexane" (ring), "carboxylate" (ester suffix).

The name uniquely specifies the structure: a six-membered ring, carboxylate ester at C1, ketone at C2, 2-hydroxyethyl group at C4, methyl ester. No other structure matches this name.

Check your understanding [Beginner]

Formal definition [Intermediate+]

The IUPAC system of organic nomenclature assigns a unique name to every organic compound through a hierarchical procedure. The 2013 IUPAC Recommendations (the "Blue Book") define both preferred IUPAC names (PINs) and alternative acceptable names.

Step 1: Identify the principal characteristic group. The functional group of highest seniority determines the parent structure and the suffix of the name. The seniority order for common groups is:

carboxylic acid > ester > amide > nitrile > aldehyde > ketone > alcohol > amine > alkene > alkyne > halide

Groups below the principal group are named as prefixes (hydroxy, oxo, amino, etc.).

Step 2: Select the parent hydride. The parent hydride is the longest continuous chain (for acyclic compounds) or the ring system (for cyclic compounds) that contains the principal characteristic group. When multiple chains of equal length exist, choose the one with the greatest number of substituents.

Step 3: Number the parent hydride. Assign locants to each position. The numbering direction gives the principal characteristic group the lowest locant. If the principal group is tied, give the lowest locants to the substituents (first point of difference rule). For aldehydes and carboxylic acids, the carbonyl carbon is always C1.

Step 4: Name and number the substituents. Each substituent receives a name (methyl, ethyl, chloro, hydroxy, oxo, etc.) and a locant. Identical substituents are grouped with a multiplier (di-, tri-, tetra-). Substituents are listed in alphabetical order in the assembled name, ignoring multiplicative prefixes.

Step 5: Assemble the name. The format is:

(locant)-(substituent)-(locant)-... parent (locant)-suffix

Locants for the parent chain and suffix are placed immediately before the suffix. Multiple locants are separated by commas; ranges by an en-dash.

Functional-group hierarchy in detail. The hierarchy is not merely a naming convention -- it reflects the chemical reality that higher-priority groups dominate the molecule's reactivity. Carboxylic acids are both acidic and electrophilic; alcohols are nucleophilic and mildly acidic; alkenes are nucleophilic toward electrophiles but not acidic. The nomenclature hierarchy parallels the reactivity hierarchy.

Seniority of rings vs chains. When a molecule contains both a ring and a chain, the ring is the parent hydride only if it contains the principal characteristic group. Otherwise, the chain bearing the principal group is the parent, and the ring becomes a substituent (e.g., "cyclohexyl" prefix).

Heterocyclic nomenclature. Heterocyclic compounds (rings containing N, O, S) have their own naming conventions (Hantzsch-Widman system for saturated heterocycles; traditional names for aromatic heterocycles like pyridine, furan, thiophene). The priority of heteroatoms for ring numbering is O > S > N.

Counterexamples to common slips

"The longest chain always has 6 carbons if a hexane is present." The longest chain must contain the principal characteristic group. If the carboxylic acid is on a three-carbon chain that branches into a longer four-carbon alkyl substituent, the parent is the three-carbon chain bearing the carboxylic acid, not the longer branch.
"Number from left to right." Numbering is direction-dependent and is always chosen to give the principal group the lowest locant, regardless of how the molecule is drawn.
"Di-, tri- prefixes affect alphabetical order." Alphabetisation ignores di-, tri-, tetra-. "Diethyl" is alphabetised under E, not D. "Dimethyl" under M.
"Ketones use the prefix 'keto-'." Ketones as the principal group use the suffix "-one". When outranked, the prefix is "oxo", not "keto". The term "keto" is reserved for common usage (e.g., keto-enol tautomerism) but is not an IUPAC prefix.

Key theorem with proof [Intermediate+]

Proposition (Uniqueness of the IUPAC name for a given structure). The IUPAC naming procedure, applied to a given molecular structure, produces a unique preferred IUPAC name (PIN) when all tie-breaking rules are applied. Two non-identical molecular structures cannot share the same PIN.

Argument. The procedure is deterministic at each step. Step 1 selects a unique principal characteristic group (the hierarchy is total, not partial, for the standard set of functional groups). Step 2 selects a unique parent hydride: the longest chain containing the principal group is unique, or if multiple chains of equal length exist, the tie-breaking rules (greatest number of substituents, then lowest locants at first point of difference) select a unique chain. Step 3 assigns a unique numbering: the lowest-locant rule and first-point-of-difference rule resolve any ties. Steps 4 and 5 name and order substituents deterministically (alphabetical order is unique; locants are fixed by Step 3).

Since each step produces a unique output from a unique input, the composed procedure maps a molecular structure to a single name. If two structures produced the same name, the name would encode the same parent chain, same locants, same substituents, and same functional groups -- i.e., the same structure. This is a proof by construction: the name encodes the structure, and the encoding is injective. $■$

The converse (that every valid IUPAC name corresponds to exactly one molecular structure) follows by reversing the steps: the name specifies the parent chain, the suffix determines the principal group, the locants place every substituent, and the structure is reconstructed. The IUPAC system is a bijection between the set of organic structures and the set of valid IUPAC names, modulo stereochemical descriptors (which require the CIP rules from unit 15.01.01).

Corollary. Common names are not unique in this sense. "Acetone" names only one structure, but "acetic acid" and "ethanoic acid" name the same structure, and some common names (e.g., "limonene") name a specific stereoisomer without specifying which. IUPAC PINs eliminate all such ambiguities.

Bridge. The injectivity of the naming procedure builds toward the retrosynthetic analysis framework in 15.10.01, where unambiguous structure-to-name mapping becomes indispensable for computer-aided synthesis planning, and appears again in 15.11.01 when NMR spectroscopy must assign peaks to specific structural features by name. The foundational reason the system works is that the functional-group hierarchy is a total order on the standard group set; this is exactly the property that guarantees deterministic principal-group selection at Step 1, and the bridge is that every downstream discipline (synthetic planning, spectroscopy, regulatory databases) relies on this determinism to communicate molecular identity without ambiguity.

Exercises [Intermediate+]

Exercise 5 (medium, numeric).

A compound has the molecular formula $C_{5} H_{10} O$ and contains one ketone group as the only functional group. How many constitutional isomers (structural isomers) exist for this molecular formula with a ketone functional group? Count only acyclic ketones.

Hint

A ketone requires a $C=O$ flanked by two carbon groups. For $C_{5} H_{10} O$ , the carbonyl can be at different positions on the five-carbon chain, and the chain itself can be branched. Enumerate: pentan-2-one, pentan-3-one, 3-methylbutan-2-one, 2-methylbutan-3-one (same as 3-methylbutan-2-one mirrored? -- check carefully).

Answer

3. The acyclic ketone isomers of $C_{5} H_{10} O$ are: (1) pentan-2-one, (2) pentan-3-one, and (3) 3-methylbutan-2-one. "2-Methylbutan-3-one" is the same compound as 3-methylbutan-2-one (number from the end giving the ketone the lowest locant). No other branched arrangement places a ketone on a five-carbon skeleton without creating an aldehyde or moving the carbonyl to C1 (which would be an aldehyde).

Exercise 6 (hard, multiple choice).

The name "2-methyl-3-butanol" is incorrect. Which IUPAC rule does it violate?

A. The substituent di-/tri- prefixes are not alphabetised B. The parent chain is not the longest chain containing the principal group C. Ketone prefix should be "oxo" not "keto" D. The locants are not arranged in ascending order

Hint

A four-carbon chain (butane) with a methyl group on C2 means a longer chain of five carbons exists through the branch point. The parent chain should be longer.

Answer

B. The name "2-methyl-3-butanol" uses a four-carbon parent chain, but a methyl substituent at C2 of a butanol extends the longest continuous chain to five carbons. The correct IUPAC name is 2-methylbutan-2-ol (or 2-methyl-2-butanol in older convention), which uses a four-carbon chain only if that chain genuinely is the longest containing the hydroxyl. The specific error is failing to select the longest chain containing the principal group.

Exercise 8 (hard, numeric).

The molecule $HO-CH_{2} -CH=CH-CHO$ has the PIN "(2E)-2-hydroxybut-2-enal." At what wavenumber (in cm $^{- 1}$ ) would you expect the strongest carbonyl absorption in its IR spectrum? Use the standard aldehyde C=O range of 1720–1740 cm $^{- 1}$ and note that conjugation with a double bond lowers the frequency by approximately 20–30 cm $^{- 1}$ .

Hint

The aldehyde carbonyl in this molecule is conjugated with the C=C double bond (they are adjacent). Apply the conjugation correction to the standard aldehyde range.

Answer

~1700 cm $^{- 1}$ . The standard aldehyde C=O stretch is 1720–1740 cm $^{- 1}$ . Conjugation with the adjacent alkene lowers the frequency by 20–30 cm $^{- 1}$ , giving an expected range of approximately 1690–1710 cm $^{- 1}$ . The midpoint is ~1700 cm $^{- 1}$ . This conjugation lowering is a general feature of unsaturated carbonyl compounds and is diagnostic in IR spectroscopy.

Preferred IUPAC names and the 2013 Blue Book [Master]

The 2013 IUPAC Blue Book (Favre and Powell, Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013, RSC Publishing, 2014) ^{[IUPAC-Blue-Book-2013]} represents the culmination of more than a century of nomenclature development. It introduced Preferred IUPAC Names (PINs) as the single authoritative name for each compound. Before 2013, multiple names could be valid IUPAC names for the same structure. The PIN system selects exactly one.

Retained names. A small set of traditional names are retained as PINs because they are too deeply embedded in the chemical literature to replace. The retained PINs include: aniline (for phenylamine), phenol (for hydroxybenzene), toluene (for methylbenzene), styrene (for ethenylbenzene), naphthalene, anthracene, and a handful of heterocyclic names (pyridine, furan, thiophene, pyrrole). Most other common names are not PINs. "Acetone" is not a PIN; the PIN is propan-2-one. "Formic acid" is not a PIN; the PIN is methanoic acid. "Acetic acid" is not a PIN; the PIN is ethanoic acid. The retained names all satisfy two criteria: they name the unsubstituted parent compound, and their use in the literature is so pervasive that replacement would create more confusion than it resolves.

Locant placement. PINs place locants immediately before the part of the name they qualify. The PIN is "butan-2-ol" rather than "2-butanol." The older convention placed locants at the front of the name; the 2013 Blue Book moved them adjacent to the structural feature they describe. Both forms remain acceptable for general use, but only the PIN form is authoritative for regulatory and database purposes.

Substituent vs subtractive naming. For halogenated compounds, the PIN uses the prefix "bromo-", "chloro-", etc. (substitutive naming) rather than the older subtractive names. $CH_{3} Cl$ is "chloromethane" (PIN), not "methyl chloride." $CH_{2} Cl_{2}$ is "dichloromethane" (PIN), not "methylene chloride." The substitutive system is uniform and composable: every halogen atom adds a prefix, the parent hydride name stays the same.

Multiplicative nomenclature. Symmetric molecules can exploit multiplicative prefixes to produce shorter names. A molecule with two identical substituent groups on different parts of a parent structure can use the prefix "di-" or "bis-" with a shared locant set, rather than naming each substituent separately. For example, 4,4'-sulfonyldibenzoic acid uses the multiplicative "sulfonyldi-" construction rather than the longer substitutive form "4-(4-carboxyphenyl)sulfonylbenzoic acid." The PIN system prefers the shorter name when both are valid, and multiplicative nomenclature frequently produces the shorter form for symmetric structures.

Stereochemical descriptors in PINs. $R / S$ , $E / Z$ , $M / P$ , and other stereochemical descriptors are integral parts of the PIN when applicable. The descriptor is placed at the front of the name, enclosed in parentheses and italicised, separated by a hyphen: "(2 $S$ )-butan-2-ol" is the full PIN for ( $S$ )-sec-butanol. When multiple stereocentres exist, the descriptor lists all relevant locants: " $(2 S, 3 R)$ -2,3-dihydroxybutanoic acid."

The 2013 changes in practice. The most consequential changes from pre-2013 IUPAC practice to PINs were: (a) the elimination of multiple valid names for the same compound -- before 2013, "ethanol" and "ethyl alcohol" were both acceptable IUPAC names for $CH_{3} CH_{2} OH$ ; under PIN rules, only "ethanol" is the preferred name; (b) the systematic replacement of common names by systematic names for all compounds not on the retained list; (c) the introduction of phane nomenclature for complex ring systems, allowing naming of structures that were previously described only by common names or complex substitutive constructions.

The practical consequence is that regulatory databases (CAS, PubChem, ChemSpider) converge on PINs as their primary identifier, and the chemical literature increasingly uses PINs in experimental sections. The name-to-structure mapping is now unambiguous for any compound that has a PIN, which is effectively all well-defined organic compounds.

The functional-group hierarchy and quantitative reactivity [Master]

The naming hierarchy is not arbitrary -- it tracks the thermodynamic and kinetic reactivity of the functional groups. Carboxylic acids (rank 1) are the most reactive toward nucleophilic attack at the carbonyl and are simultaneously the most acidic common organic functional group. Halides (rank 10) are the least reactive toward most reagents and serve primarily as leaving groups.

This correspondence between nomenclature hierarchy and reactivity hierarchy arises from the same electronic-structure features: the polarity of the C=O bond (carbonyl groups), the electronegativity difference between carbon and the heteroatom (alcohols, amines, halides), and the electron density of the pi bond (alkenes, alkynes). The IUPAC hierarchy is a coarse-grained reactivity map.

The Hammett equation. The quantitative connection between functional-group identity and reactivity is captured by the Hammett equation ^{[Hammett-1935]}. For the ionisation of benzoic acids bearing para-substituents X:

lo g (\frac{K _{X}}{K _{H}}) = σ_{X}

where $K_{X}$ is the acid dissociation constant for the substituted benzoic acid and $K_{H}$ is the constant for unsubstituted benzoic acid. The substituent constant $σ_{X}$ measures the electronic effect of X: positive $σ$ indicates electron withdrawal (increasing acidity), negative $σ$ indicates electron donation (decreasing acidity). The general Hammett equation for any reaction rate or equilibrium is:

lo g (\frac{k _{X}}{k _{H}}) = ρ σ_{X}

where $ρ$ is the reaction constant measuring the sensitivity of the reaction to electronic effects. The Hammett $σ$ values for the principal functional groups correlate with their position in the naming hierarchy. Electron-withdrawing groups ( $σ > 0$ ) correspond to the higher-ranked carbonyl-containing groups (carboxylic acids, esters, amides, aldehydes, ketones); electron-donating groups ( $σ < 0$ ) correspond to lower-ranked groups (alkyl, amino, alkoxy).

Electrophilicity of carbonyl groups. Within the carbonyl family, the naming hierarchy (carboxylic acid > ester > amide > aldehyde > ketone) tracks the electrophilicity of the carbonyl carbon, which is the site of nucleophilic attack. Aldehydes and ketones have more electrophilic carbonyl carbons than amides because the nitrogen lone pair donates electron density into the carbonyl $π^{*}$ orbital by resonance, reducing the partial positive charge on the carbonyl carbon. Esters have intermediate electrophilicity: the oxygen lone pair donates, but oxygen is more electronegative than nitrogen, so the donation is weaker. Carboxylic acids are strongly electrophilic despite also having an oxygen lone pair, because the acidic proton makes them reactive in a distinct way (proton transfer precedes nucleophilic attack in many carboxylic-acid reactions).

The Mayr electrophilicity scale ( $E$ parameter) quantifies this ordering. Formaldehyde ( $E \approx + 2.0$ ) is far more electrophilic than acetamide ( $E \approx - 3.0$ ). The nomenclature hierarchy ranks aldehydes above amides, and the electrophilicity hierarchy concurs. The Mayr scale extends to thousands of electrophiles and nucleophiles, parameterised by the linear-free-energy relationship:

lo g k = s_{N} (N + E)

where $N$ is the nucleophilicity parameter, $E$ is the electrophilicity parameter, and $s_{N}$ is a nucleophile-specific slope. The scale allows quantitative prediction of whether a given nucleophile-electrophile pair reacts at measurable rates at room temperature.

Taft steric parameters. The Hammett equation accounts for electronic effects but not steric effects. The Taft equation extends the framework:

lo g (\frac{k _{X}}{k _{H}}) = ρ^{*} σ_{X}^{*} + δ E_{s}

where $σ_{X}^{*}$ is the polar substituent constant (analogous to Hammett $σ$ but for aliphatic systems), $E_{s}$ is the Taft steric parameter, and $δ$ measures the reaction's sensitivity to steric effects. Functional groups that are bulky (tert-butyl, isopropyl) have large negative $E_{s}$ values and slow reactions at sterically congested centres. The interplay of electronic and steric effects determines the functional-group hierarchy's predictive power: a highly electrophilic carbonyl may react slowly if it is sterically shielded.

The hierarchy as a predictive tool. The hierarchy predicts which functional group is modified first when a reagent reacts with a polyfunctional molecule. $LiAlH_{4}$ reduces carboxylic acids, esters, aldehydes, and ketones but leaves isolated alkenes untouched -- reflecting the reactivity gradient encoded in the naming hierarchy. Similarly, nucleophilic attack preferentially occurs at the most electrophilic carbonyl, which corresponds to the highest-ranked carbonyl group. The hierarchy is a qualitative map; the Hammett and Mayr scales are its quantitative counterpart.

Retrosynthetic disconnection at functional groups [Master]

Functional groups are not merely classification labels -- they are the strategic waypoints of synthetic planning. The retrosynthetic approach, formalised by E. J. Corey in the 1960s and codified in The Logic of Chemical Synthesis (Wiley, 1989) ^[Corey-1989], works backward from the target molecule to simpler precursors by mentally breaking bonds at or near functional groups. Each break is a disconnection, and the reverse operation (building the molecule forward) is the corresponding synthetic reaction.

Functional-group interconversion (FGI). The simplest retrosynthetic operation is changing one functional group into another without altering the carbon skeleton. An alcohol can be converted to a halide (via $SOCl_{2}$ or $PBr_{3}$ ), a halide to an alkene (via E2 elimination), an alkene to an alcohol (via hydroboration-oxidation or acid-catalysed hydration), and a ketone to an alcohol (via NaBH $_{4}$ or Grignard addition). The FGI network connects the dozen common functional groups through known, high-yielding reactions. A synthetic planner charts a path through this network to arrive at commercially available starting materials.

The FGI concept explains why knowing the full menu of functional groups matters. A synthetic route to a target ketone might proceed through an alcohol (reduction of a ketone precursor, then re-oxidation), through an alkene (ozonolysis of a terminal alkene gives an aldehyde, which is then oxidised to the acid or reduced to the alcohol), or through a nitrile (Grignard addition to a nitrile gives an imine that hydrolyses to a ketone). The choice of which FGI sequence to use depends on what starting materials are available, what protecting groups are needed for other functional groups, and what the overall step count and yield will be.

Disconnection at the carbonyl. The carbonyl group is the most strategically important functional group for retrosynthetic analysis, because the C-C bond adjacent to a carbonyl can be formed by two powerful reactions: the aldol reaction (forming a $β$ -hydroxy carbonyl from two carbonyl compounds) and the Grignard addition (forming an alcohol from a carbonyl and an organometallic reagent). Retrosynthetically, disconnecting the bond between the carbonyl carbon and the $α$ -carbon of the target reveals two simpler fragments. For a ketone $R-CO-R’$ , disconnection gives R-CO-Cl (an acyl chloride) and R'-MgBr (a Grignard reagent), or R-MgBr and R'-CO-Cl. The carbonyl functional group is the "synthetic handle" that makes the disconnection possible.

The synthon concept. Corey introduced the term synthon for the idealised fragment generated by a disconnection. A synthon is not a real molecule -- it is a conceptual building block with an implied reactivity. A "nucleophilic synthon" at a carbon atom means that carbon needs to be electron-rich in the forward reaction; an "electrophilic synthon" means it needs to be electron-poor. The synthons are then matched to real reagents: a nucleophilic carbon synthon might correspond to a Grignard reagent, an enolate, or a Wittig ylide; an electrophilic carbon synthon might correspond to an alkyl halide, an aldehyde, or an acyl chloride.

The synthon framework maps directly onto the functional-group hierarchy. High-ranking functional groups (carbonyls, esters, acids) are electrophilic at the carbon bearing the group and therefore serve as electrophilic synthons. Low-ranking groups (alcohols after deprotonation to alkoxides, amines) are nucleophilic and serve as nucleophilic synthons. The hierarchy is a first-pass guide to synthon polarity.

Protecting-group strategy. When a molecule contains multiple functional groups, the synthetic plan must ensure that reagents modify only the intended group. Protecting groups temporarily convert one functional group into an inert derivative while another group reacts. An alcohol can be protected as a silyl ether (TBDMS, TMS), a carboxylic acid as an ester, an amine as a carbamate (Boc, Cbz), and an aldehyde or ketone as an acetal. The choice of protecting group depends on the conditions needed for the other reactions in the sequence. The functional-group hierarchy predicts which groups are most reactive and therefore most in need of protection: carboxylic acids and aldehydes are protected before alcohols and amines because the former react with more reagents.

The protecting-group concept is one of the deepest connections between functional-group identity and synthetic practice. A target molecule with five functional groups may require four or five protecting-group operations interleaved with the synthetic steps, and the overall yield is the product of all individual yields. Minimising the number of protecting-group cycles is a central objective of retrosynthetic design, and the functional-group hierarchy is the organising principle that makes minimisation possible.

Functional groups in spectroscopy [Master]

Spectroscopic methods identify functional groups by their characteristic interactions with electromagnetic radiation. The two most diagnostic techniques for functional-group identification are infrared (IR) spectroscopy and nuclear magnetic resonance (NMR) spectroscopy. Each functional group absorbs at a characteristic frequency in IR and produces signals at characteristic chemical shifts in NMR, providing a spectroscopic "fingerprint" that confirms the presence or absence of every group in the hierarchy.

Infrared spectroscopy: the carbonyl diagnostic region. The C=O stretching vibration produces a strong, sharp absorption band in the range 1650–1800 cm $^{- 1}$ , a region of the IR spectrum largely free of other absorptions. The exact position of the carbonyl stretch is diagnostic of the type of carbonyl functional group:

Functional group	C=O stretch (cm $^{- 1}$ )	Character
Acid chloride	1800–1820	strong, broad
Anhydride	1810–1830 and 1750–1770	two bands
Ester	1735–1750	strong
Aldehyde	1720–1740	strong
Ketone	1705–1725	strong
Carboxylic acid	1700–1725	strong, broad
Amide	1640–1690	strong
Conjugated carbonyl	20–30 cm $^{- 1}$ below unconjugated	—

The ordering is informative. Acid chlorides absorb at the highest frequency because the electronegative chlorine withdraws electron density from the C=O bond, stiffening the bond and raising the vibrational frequency. Amides absorb at the lowest frequency because the nitrogen lone pair donates electron density into the carbonyl $π^{*}$ orbital, weakening the C=O bond and lowering the frequency. The IR carbonyl region thus encodes the same electronic-structure information that the nomenclature hierarchy encodes: more electrophilic carbonyls (higher hierarchy rank for the acyl halide subset) have stiffer C=O bonds and higher stretching frequencies.

Conjugation with a C=C double bond or an aromatic ring systematically lowers the carbonyl stretching frequency by 20–30 cm $^{- 1}$ . This lowering reflects the resonance delocalisation of the C=O pi electrons into the conjugated system, which weakens the C=O bond. The IR spectrum of benzaldehyde shows a carbonyl stretch near 1700 cm $^{- 1}$ (compared to 1725 cm $^{- 1}$ for an unconjugated aldehyde), confirming the conjugation. A compound suspected of containing an $α, β$ -unsaturated ketone can be confirmed by observing the carbonyl stretch 20–30 cm $^{- 1}$ below the unconjugated value.

Other diagnostic IR absorptions. Beyond the carbonyl region, several other functional-group absorptions are diagnostically useful. The O-H stretch of alcohols appears as a broad band near 3200–3600 cm $^{- 1}$ (broadened by hydrogen bonding; a free OH in dilute solution is sharp near 3600 cm $^{- 1}$ ). The N-H stretch of amines and amides appears near 3300–3500 cm $^{- 1}$ (primary amines show two bands; secondary amines one). The C-H stretch of alkenes (sp $^{2}$ C-H) appears near 3020–3100 cm $^{- 1}$ , above the aliphatic C-H stretch region (2850–2960 cm $^{- 1}$ ), distinguishing alkene C-H from alkane C-H. The $\equiv$ C-H stretch of terminal alkynes appears near 3300 cm $^{- 1}$ as a sharp band. The C=C stretch of alkenes appears near 1620–1680 cm $^{- 1}$ but is often weak; the C $\equiv$ C stretch of alkynes appears near 2100–2260 cm $^{- 1}$ and is diagnostic for internal alkynes (terminal alkynes show the $\equiv$ C-H stretch instead).

Carboxylic acids have a distinctive combination of a broad O-H stretch (2500–3300 cm $^{- 1}$ , very broad due to strong hydrogen bonding) with a C=O stretch near 1710 cm $^{- 1}$ . This pair of absorptions identifies a carboxylic acid functional group with high reliability.

NMR chemical shifts by functional group. Nuclear magnetic resonance spectroscopy identifies functional groups by the chemical environment of the hydrogen ( $^{1}$ H) and carbon ( $^{13}$ C) atoms near the functional group. The chemical shift ( $δ$ , in ppm) measures the electron density around each nucleus: electron-withdrawing groups (carbonyls, halides, nitro groups) deshield nearby protons and move their signals to higher $δ$ values (downfield); electron-donating groups (alkyl groups, ethers) shield protons and move signals to lower $δ$ (upfield).

Characteristic $^{1}$ H NMR chemical shift ranges:

Proton environment	$δ$ range (ppm)
R-COOH (acid)	10–13
R-CHO (aldehyde)	9–10
Ar-H (aromatic)	6.5–8.0
R-CO-CHR (ketone $α$ -H)	2.0–2.5
R-CH=CH-R (alkene)	4.5–6.5
R-CHR-OH (alcohol $α$ -H)	3.0–4.0
R-CH $_{2}$ -Cl (halide $α$ -H)	3.0–4.0
R-CH $_{3}$ (alkyl)	0.8–1.5

The aldehyde proton at $δ$ 9–10 is one of the most distinctive signals in $^{1}$ H NMR. No other common proton absorbs in this region, so a singlet near $δ$ 9.5 unambiguously identifies an aldehyde functional group. The carboxylic acid proton at $δ$ 10–13 is equally diagnostic, appearing as a broad singlet that exchanges with D $_{2}$ O (disappearing on a D $_{2}$ O shake).

Characteristic $^{13}$ C NMR chemical shifts follow the same electronic-structure logic. Carbonyl carbons appear at $δ$ 160–220 (ketones and aldehydes at $δ$ 190–220; esters and acids at $δ$ 160–185; amides at $δ$ 160–175). Aromatic carbons appear at $δ$ 110–150. Alkene carbons at $δ$ 100–150. Aliphatic carbons at $δ$ 0–50. The $^{13}$ C chemical shift range is wider than $^{1}$ H, providing better spectral resolution for complex molecules.

The spectroscopic fingerprint as a functional-group inventory. Combining IR and NMR data allows a chemist to confirm the presence or absence of every functional group in the IUPAC hierarchy. A molecule named "ethyl 4-hydroxybutanoate" should show: IR -- C=O stretch near 1740 cm $^{- 1}$ (ester), broad O-H stretch near 3400 cm $^{- 1}$ (alcohol). $^{1}$ H NMR -- triplet near $δ$ 1.2 (CH $_{3}$ of ethyl), quartet near $δ$ 4.1 (CH $_{2}$ O of ethyl), triplet near $δ$ 2.3 (CH $_{2}$ CO, $α$ to ester), multiplet near $δ$ 1.7 (CH $_{2}$ , $β$ to both OH and ester), triplet near $δ$ 3.6 (CH $_{2}$ OH), broad singlet near $δ$ 2.5 (OH, exchangeable). The spectroscopic data confirms every functional group implied by the name.

Synthesis. The functional-group hierarchy is the foundational reason that IUPAC nomenclature, synthetic planning, and spectroscopic identification converge on the same organisational framework. Putting these together, the hierarchy identifies the principal group (nomenclature), predicts which bond to disconnect first (retrosynthesis 15.10.01), and determines which spectroscopic signals are diagnostic (IR/NMR 14.12.01 15.11.01). This is exactly the structure that identifies molecular identity with reactivity: the bridge is that a named functional group simultaneously encodes the electrophilicity of a carbonyl (quantified by the Hammett and Mayr scales), the wavenumber of its IR absorption (a direct consequence of bond stiffness), and the chemical shift of its adjacent protons (a consequence of electron withdrawal). The pattern generalises from carbonyls to every functional group in the hierarchy, and the central insight is that electronic structure -- the distribution of electron density in a molecular bond -- is the single variable that underlies naming priority, reaction rate, spectral frequency, and synthetic strategy alike.

Full proof set [Master]

Proposition (Deterministic principal-group selection). The IUPAC seniority order for the standard set of functional groups is a total order. Given any organic molecule containing two or more distinct functional groups, there is a unique highest-ranking group.

Proof. The seniority order is defined by enumeration in the 2013 Blue Book (P-41): carboxylic acid > anhydride > ester > acyl halide > amide > nitrile > aldehyde > ketone > alcohol > amine > alkene > alkyne > halide > ether. For any pair of groups $G_{i}, G_{j}$ drawn from this list, the order specifies $G_{i} > G_{j}$ or $G_{j} > G_{i}$ , with no ties. This is a total order on the standard group set. If a molecule contains groups $G_{i_{1}}, G_{i_{2}}, \dots, G_{i_{n}}$ with $n \geq 2$ , the unique maximum of the set ${G_{i_{1}}, \dots, G_{i_{n}}}$ under this total order is the principal group. A finite totally-ordered set has a unique maximum. $□$

Proposition (Hammett correlation with hierarchy position). For the series of monosubstituted benzoic acids 4-X-C $_{6}$ H $_{4}$ -COOH, where X is a functional group from the IUPAC hierarchy, the Hammett $σ_{p}$ constants correlate with the hierarchy rank in the expected direction: higher-ranked groups (carbonyl derivatives, acids) have $σ_{p} > 0$ and lower-ranked groups (alkyl, amino) have $σ_{p} < 0$ .

Argument. The Hammett $σ_{p}$ constant measures the electronic effect of the para substituent on the acidity of benzoic acid. Electron-withdrawing groups increase acidity ( $σ_{p} > 0$ ); electron-donating groups decrease acidity ( $σ_{p} < 0$ ). The functional groups that rank highest in the IUPAC hierarchy (carboxylic acid, ester, amide, aldehyde, ketone) all contain a C=O bond whose $π$ -electron-withdrawing effect deshields the ring and increases acidity. The groups that rank lowest (alkyl, amino, alkoxy) donate electron density by hyperconjugation or resonance, shielding the ring and decreasing acidity. The sign of $σ_{p}$ therefore tracks the hierarchy position. This is a correlation, not a theorem -- the hierarchy was designed to reflect reactivity, not derived from Hammett constants -- but the alignment confirms that the hierarchy captures real electronic-structure differences. $□$

Connections [Master]

Lewis structures 14.02.01. Supplies the bonding and formal-charge formalism on which the functional-group definitions rest. A carboxylic acid group is identified by its Lewis structure (C double-bonded to O, single-bonded to OH), and the hierarchy rules depend on recognising each group's Lewis pattern within the larger molecular graph.
Stereochemistry 15.01.01. Stereochemical descriptors ( $R / S$ , $E / Z$ ) are integral parts of the full IUPAC name for stereoisomers. The CIP priority rules from the stereochemistry unit are invoked when naming chiral or geometrically isomeric compounds, and the full PIN includes these descriptors at the front of the name.
Acids and bases in organic chemistry 15.03.01. The functional-group hierarchy predicts acid-base behaviour. Carboxylic acids (rank 1) have pKa near 5, alcohols (rank 6) have pKa near 16, and the hierarchy ordering correlates with acidity. The acid-base unit 15.03.01 builds on the functional-group identification established here to explain why different groups have different acidities.
SN1 and SN2 substitution 15.04.02 pending. Nucleophilic substitution at saturated carbon targets specific functional groups (alkyl halides, tosylates). Identifying the leaving group and classifying the substrate by its substitution pattern (primary, secondary, tertiary) requires the functional-group vocabulary from this unit.
Carbonyl chemistry 15.07.01. Aldehydes, ketones, esters, amides, and carboxylic acids (ranks 1--5) are all carbonyl functional groups. The naming hierarchy within the carbonyl family reflects their differing electrophilicities, and the carbonyl unit 15.07.01 assumes the reader can distinguish these groups by name and structure.
Retrosynthetic analysis 15.10.01. Functional-group interconversion (FGI) is the central operation of retrosynthetic planning. The FGI network that connects the dozen common functional groups through known reactions is the synthetic-planning counterpart of the nomenclature hierarchy established here.
NMR spectroscopy of organic molecules 15.11.01. NMR chemical shifts identify functional groups by the electronic environment of nearby nuclei. The diagnostic shift ranges for aldehyde protons ( $δ$ 9--10), carboxylic acid protons ( $δ$ 10--13), and aromatic protons ( $δ$ 6.5--8.0) directly map functional-group identity to spectral features.
Spectroscopy fundamentals 14.12.01. The IR carbonyl stretch region (1650--1800 cm $^{- 1}$ ) and the UV-vis absorption of conjugated systems are treated in the general spectroscopy unit 14.12.01. This unit's functional-group IR and NMR tables are the organic-chemistry specialisation of that general spectroscopic framework.

Historical & philosophical context [Master]

Systematic chemical nomenclature was created by Louis-Bernard Guyton de Morveau in 1782 ^{[Guyton-1782]}, who proposed that chemical names should reflect composition rather than historical origin. Antoine Lavoisier adopted and extended the system the following year. The extension to organic chemistry began with Justus von Liebig and Friedrich Wohler in the 1830s, but early organic nomenclature was entirely common names: formic acid (from the Latin formica, ant, because it was isolated from ant bodies), acetic acid (from acetum, vinegar), and oxalic acid (from Oxalis, wood-sorrel).

The Geneva Congress of 1892 ^{[Geneva-1892]} established the first international rules for organic nomenclature, introducing the concepts of the longest chain, locant numbering, and systematic substitutive naming. The Geneva rules were the work of 34 chemists from nine countries, convened on the initiative of Charles Friedel. The Liege rules of 1930 extended the Geneva system to cover more complex structures, and the first IUPAC "Blue Book" in 1969 (published as Nomenclature of Organic Chemistry, edited by Pinner) consolidated all prior rules into a single authoritative document.

The 1993 IUPAC recommendations introduced the concept of preferred names, and the 2013 Blue Book (Favre and Powell, Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013, RSC Publishing, 2014) ^{[IUPAC-Blue-Book-2013]} codified the full PIN system. The 2013 Blue Book runs to 1568 pages and specifies the preferred name for every class of organic compound.

The Hammett equation, which quantifies the connection between functional-group identity and reactivity, was introduced by Louis P. Hammett in 1935 (Chemical Reviews 17, 125--136) ^{[Hammett-1935]} and developed in his 1940 monograph Physical Organic Chemistry. The Corey retrosynthetic analysis framework was introduced in a series of papers in the late 1960s and codified in The Logic of Chemical Synthesis (Wiley, 1989) ^[Corey-1989], which received the 1990 Nobel Prize in Chemistry.

The philosophical tension in nomenclature -- between descriptive precision and historical usage -- has a direct parallel in biological taxonomy (Linnaean binomial nomenclature) and astronomical nomenclature (star catalogues). The IUPAC resolution (a preferred name plus a set of acceptable alternatives) is a pragmatic middle ground: the PIN is the authoritative identifier for databases and regulatory documents, while common names persist in laboratory discourse and the chemical literature.

Bibliography [Master]

IUPAC nomenclature references.

Favre, H. A. & Powell, W. H., Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (RSC Publishing, 2014).
IUPAC, Nomenclature of Organic Chemistry: Sections A, B, C, D, E, F, and H (Pergamon, 1979) -- the first Blue Book.
IUPAC, A Guide to IUPAC Nomenclature of Organic Compounds: Recommendations 1993 (Blackwell Science, 1993).
Panico, R., Powell, W. H. & Richer, J.-C., A Guide to IUPAC Nomenclature of Organic Compounds (Blackwell, 1993).

Textbook references.

Clayden, J., Greeves, N. & Warren, S., Organic Chemistry, 2nd ed. (Oxford UP, 2012), Ch. 2--3.
Vollhardt, K. P. C. & Schore, N. E., Organic Chemistry: Structure and Function, 8th ed. (W. H. Freeman, 2018), Ch. 1--2.
Smith, M. B., March's Advanced Organic Chemistry, 7th ed. (Wiley, 2013), Ch. 1 (structure and nomenclature).

Physical organic chemistry and reactivity.

Hammett, L. P., "The Effect of Structure upon the Reactions of Organic Compounds", Chemical Reviews 17 (1935), 125--136.
Hammett, L. P., Physical Organic Chemistry (McGraw-Hill, 1940).
Mayr, H. & Patz, M., "Scales of Nucleophilicity and Electrophilicity", Angew. Chem. Int. Ed. Engl. 33 (1994), 938--957.
Anslyn, E. V. & Dougherty, D. A., Modern Physical Organic Chemistry (University Science Books, 2006), Ch. 8--9.

Retrosynthetic analysis.

Corey, E. J. & Cheng, X.-M., The Logic of Chemical Synthesis (Wiley, 1989).
Corey, E. J., "General methods for the construction of complex molecules", Pure and Applied Chemistry 14 (1967), 19--37.

Spectroscopy.

Pavia, D. L., Lampman, G. M., Kriz, G. S. & Engel, R. G., A Small Scale Approach to Organic Laboratory Techniques, 4th ed. (Cengage, 2015), Ch. 21--26 (IR, NMR, mass spectrometry).
Silverstein, R. M., Webster, F. X. & Kiemle, D. J., Spectrometric Identification of Organic Compounds, 7th ed. (Wiley, 2005).

Historical.

Guyton de Morveau, L. B., "Memoire sur les denominations chimiques", Observations sur la Physique 19 (1782), 370--382.
The Geneva Congress of 1892 -- proceedings reprinted in Bull. Soc. Chim. Fr. (1892).
Richter, F. P., ed., IUPAC Nomenclature of Organic Chemistry (Butterworths, 1969) -- the first consolidated Blue Book.

Prerequisites

none — this is a leaf unit

Tier anchors

beginner: Clayden, Greeves & Warren — Organic Chemistry 2nd ed. Ch. 2 (Functional groups)
intermediate: Clayden, Greeves & Warren — Organic Chemistry 2nd ed. Ch. 2–3 (Functional groups, nomenclature); Bruice — Organic Chemistry, nomenclature chapters
master: IUPAC — Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (Blue Book); Favre & Powell — Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (RSC, 2014)

References

TODO_REF pending
Clayden, Greeves & Warren — Organic Chemistry, 2nd ed. (Oxford UP, 2012) · Ch. 2 Functional groups; Ch. 3 Nomenclature and structure · see docs/catalogs/NEED_TO_SOURCE.md#chem-clayden
TODO_REF pending
Favre, H. A. & Powell, W. H. — Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013 (RSC, 2014) · The IUPAC Blue Book, comprehensive reference · see docs/catalogs/NEED_TO_SOURCE.md#chem-iupac-blue-book-2013
TODO_REF pending
Vollhardt & Schore — Organic Chemistry: Structure and Function, 8th ed. (W. H. Freeman, 2018) · Ch. 1–2 (structure and nomenclature basics) · see docs/catalogs/NEED_TO_SOURCE.md#chem-vollhardt-schore
TODO_REF pending
IUPAC — Nomenclature of Inorganic Chemistry: IUPAC Recommendations 2005 (Red Book) · Reference for coordination with inorganic nomenclature · see docs/catalogs/NEED_TO_SOURCE.md#chem-iupac-red-book
TODO_REF pending
Hammett, L. P. — Physical Organic Chemistry (McGraw-Hill, 1940); 'The Effect of Structure upon the Reactions of Organic Compounds', Chem. Rev. 17, 125–136 (1935) · Originator of the Hammett equation and sigma constants · see docs/catalogs/NEED_TO_SOURCE.md#chem-hammett-1935
TODO_REF pending
Corey, E. J. — The Logic of Chemical Synthesis (Wiley, 1989) · Ch. 1–3 (retrosynthetic analysis, synthons, FGI) · see docs/catalogs/NEED_TO_SOURCE.md#chem-corey-1989
TODO_REF pending
Guyton de Morveau, L. B. — Memoire sur les denominations chimiques, Observations sur la Physique 19, 370–382 (1782) · Originator of systematic chemical nomenclature · see docs/catalogs/NEED_TO_SOURCE.md#chem-guyton-1782
TODO_REF pending
IUPAC Commission on Nomenclature — Geneva Congress proceedings, Bull. Soc. Chim. Fr. (1892) · First international organic nomenclature rules · see docs/catalogs/NEED_TO_SOURCE.md#chem-geneva-1892
tong
raw/pdfs/dynamics/four.pdf · background reference for Hamiltonian formalism; chem readers may skip

Reviewer

Tyler (pending external chemistry reviewer per CHEMISTRY_PLAN §6)

Estimated time

beginner: 15m
intermediate: 35m
master: 70m