Quantum Chemistry Third Edition Quantum Chemistry Third Edition John P. Lowe Department of Chemistry The Pennsylvania State University University Park, Pennsylvania Kirk A. Peterson Department of Chemistry Washington State University Pullman, Washington Amsterdam • Boston • Heidelberg • London • NewYork • Oxford Paris • San Diego • San Francisco • Singapore • Sydney • Tokyo Acquisitions Editor: Jeremy Hayhurst Project Manager: A. B. McGee Editorial Assistant: Desiree Marr Marketing Manager: Linda Beattie Cover Designer: Julio Esperas Composition: Integra Software Services Cover Printer: Phoenix Color Interior Printer: Maple-Vail Book Manufacturing Group Elsevier Academic Press 30 Corporate Drive, Suite 400, Burlington, MA 01803, USA 525 B Street, Suite 1900, San Diego, CA 92101-4495, USA 84 Theobald’s Road, London WC1X 8RR, UK This book is printed on acid-free paper. Copyright c 2006, Elsevier Inc. All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Permissions may be sought directly from Elsevier’s Science & Technology Rights Department in Oxford, UK: telephone: (+44) 1865 843830, fax: (+44) 1865 853333, e-mail: permissions@elsevier.co.uk. You may also complete your request on-line via the Elsevier homepage (http://www.elsevier.com), by selecting “Customer Support” and then “Obtaining Permissions.” Library of Congress Cataloging-in-Publication Data Lowe, John P. Quantum chemistry. -- 3rd ed. / John P. Lowe, Kirk A. Peterson. p. cm. Includes bibliographical references and index. ISBN 0-12-457551-X 1. Quantum chemistry. I. Peterson, Kirk A. II. Title. QD462.L69 2005 541'.28--dc22 2005019099 British Library Cataloguing in Publication Data A catalogue record for this book is available from the British Library ISBN-13: 978-0-12-457551-6 ISBN-10: 0-12-457551-X For all information on all Elsevier Academic Press publications visit ourWeb site at www.books.elsevier.com Printed in the United States of America 05 06 07 08 09 10 9 8 7 6 5 4 3 2 1 Working together to grow libraries in developing countries www.elsevier.com | www.bookaid.org | www.sabre.org To Nancy -J. L. THE MOLECULAR CHALLENGE Sir Ethylene, to scientists fair prey, (Who dig and delve and peek and push and pry, And prove their findings with equations sly) Smoothed out his ruffled orbitals, to say: “I stand in symmetry. Mine is a way Of mystery and magic. Ancient, I Am also deemed immortal. Should I die, Pi would be in the sky, and Judgement Day Would be upon us. For all things must fail, That hold our universe together, when Bonds such as bind me fail, and fall asunder. Hence, stand I firm against the endless hail Of scientific blows. I yield not.” Men And their computers stand and stare and wonder. W.G. LOWE Contents Preface to the Third Edition xvii Preface to the Second Edition xix Preface to the First Edition xxi 1 ClassicalWaves and the Time-Independent Schr¨odingerWave Equation 1 1-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1-2 Waves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1-3 The ClassicalWave Equation . . . . . . . . . . . . . . . . . . . . . 4 1-4 StandingWaves in a Clamped String . . . . . . . . . . . . . . . . . 7 1-5 Light as an ElectromagneticWave . . . . . . . . . . . . . . . . . . . 9 1-6 The Photoelectric Effect . . . . . . . . . . . . . . . . . . . . . . . . 10 1-7 TheWave Nature of Matter . . . . . . . . . . . . . . . . . . . . . . 14 1-8 A Diffraction Experiment with Electrons . . . . . . . . . . . . . . . 16 1-9 Schr¨odinger’s Time-IndependentWave Equation . . . . . . . . . . . 19 1-10 Conditions on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 1-11 Some Insight into the Schr¨odinger Equation . . . . . . . . . . . . . 22 1-12 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 25 Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 2 Quantum Mechanics of Some Simple Systems 27 2-1 The Particle in a One-Dimensional “Box” . . . . . . . . . . . . . . . 27 2-2 Detailed Examination of Particle-in-a-Box Solutions . . . . . . . . . 30 2-3 The Particle in a One-Dimensional “Box” with One FiniteWall . . . 38 2-4 The Particle in an Infinite “Box” with a Finite Central Barrier . . . . 44 2-5 The Free Particle in One Dimension . . . . . . . . . . . . . . . . . . 47 2-6 The Particle in a Ring of Constant Potential . . . . . . . . . . . . . . 50 2-7 The Particle in a Three-Dimensional Box: Separation of Variables . . 53 2-8 The Scattering of Particles in One Dimension . . . . . . . . . . . . . 56 2-9 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 65 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 ix x Contents 3 The One-Dimensional Harmonic Oscillator 69 3-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 3-2 Some Characteristics of the Classical One-Dimensional Harmonic Oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 3-3 The Quantum-Mechanical Harmonic Oscillator . . . . . . . . . . . . 72 3-4 Solution of the Harmonic Oscillator Schr¨odinger Equation . . . . . . 74 3-5 Quantum-Mechanical Average Value of the Potential Energy . . . . . 83 3-6 Vibrations of Diatomic Molecules . . . . . . . . . . . . . . . . . . . 84 3-7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . . 88 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor 89 4-1 The Schr¨odinger Equation and the Nature of Its Solutions . . . . . . . 89 4-2 Separation of Variables . . . . . . . . . . . . . . . . . . . . . . . . . 105 4-3 Solution of the R, , and  Equations . . . . . . . . . . . . . . . . . 106 4-4 Atomic Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 4-5 Angular Momentum and Spherical Harmonics . . . . . . . . . . . . . 110 4-6 Angular Momentum and Magnetic Moment . . . . . . . . . . . . . . 115 4-7 Angular Momentum in Molecular Rotation—The Rigid Rotor . . . . 117 4-8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . . 125 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126 5 Many-Electron Atoms 127 5-1 The Independent Electron Approximation . . . . . . . . . . . . . . . 127 5-2 Simple Products and Electron Exchange Symmetry . . . . . . . . . . 129 5-3 Electron Spin and the Exclusion Principle . . . . . . . . . . . . . . . 132 5-4 Slater Determinants and the Pauli Principle . . . . . . . . . . . . . . 137 5-5 Singlet and Triplet States for the 1s2s Configuration of Helium . . . . 138 5-6 The Self-Consistent Field, Slater-Type Orbitals, and the Aufbau Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 5-7 Electron Angular Momentum in Atoms . . . . . . . . . . . . . . . . . 149 5-8 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . . 164 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165 6 Postulates and Theorems of Quantum Mechanics 166 6-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166 6-2 TheWavefunction Postulate . . . . . . . . . . . . . . . . . . . . . . . 166 6-3 The Postulate for Constructing Operators . . . . . . . . . . . . . . . . 167 6-4 The Time-Dependent Schr¨odinger Equation Postulate . . . . . . . . . 168 6-5 The Postulate Relating Measured Values to Eigenvalues . . . . . . . . 169 6-6 The Postulate for Average Values . . . . . . . . . . . . . . . . . . . . 171 6-7 Hermitian Operators . . . . . . . . . . . . . . . . . . . . . . . . . . 171 Contents xi 6-8 Proof That Eigenvalues of Hermitian Operators Are Real . . . . . . . 172 6-9 Proof That Nondegenerate Eigenfunctions of a Hermitian Operator Form an Orthogonal Set . . . . . . . . . . . . . . . . . . . . . . . . 173 6-10 Demonstration That All Eigenfunctions of a Hermitian Operator May Be Expressed as an Orthonormal Set . . . . . . . . . . . . . . . . . 174 6-11 Proof That Commuting Operators Have Simultaneous Eigenfunctions 175 6-12 Completeness of Eigenfunctions of a Hermitian Operator . . . . . . 176 6-13 The Variation Principle . . . . . . . . . . . . . . . . . . . . . . . . 178 6-14 The Pauli Exclusion Principle . . . . . . . . . . . . . . . . . . . . . 178 6-15 Measurement, Commutators, and Uncertainty . . . . . . . . . . . . 178 6-16 Time-Dependent States . . . . . . . . . . . . . . . . . . . . . . . . 180 6-17 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 189 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 7 The Variation Method 190 7-1 The Spirit of the Method . . . . . . . . . . . . . . . . . . . . . . . . 190 7-2 Nonlinear Variation: The Hydrogen Atom . . . . . . . . . . . . . . 191 7-3 Nonlinear Variation: The Helium Atom . . . . . . . . . . . . . . . . 194 7-4 Linear Variation: The Polarizability of the Hydrogen Atom . . . . . 197 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion . . . 206 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules . . . . . . . 220 7-7 Basis Set Choice and the VariationalWavefunction . . . . . . . . . . 231 7-8 Beyond the Orbital Approximation . . . . . . . . . . . . . . . . . . 233 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 241 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242 8 The Simple H¨uckel Method and Applications 244 8-1 The Importance of Symmetry . . . . . . . . . . . . . . . . . . . . . 244 8-2 The Assumption of s–p Separability . . . . . . . . . . . . . . . . . 244 8-3 The Independent p-Electron Assumption . . . . . . . . . . . . . . . 246 8-4 Setting up the H¨uckel Determinant . . . . . . . . . . . . . . . . . . 247 8-5 Solving the HMO Determinantal Equation for Orbital Energies . . . 250 8-6 Solving for the Molecular Orbitals . . . . . . . . . . . . . . . . . . 251 8-7 The Cyclopropenyl System: Handling Degeneracies . . . . . . . . . 253 8-8 Charge Distributions from HMOs . . . . . . . . . . . . . . . . . . . 256 8-9 Some Simplifying Generalizations . . . . . . . . . . . . . . . . . . 259 8-10 HMO Calculations on Some Simple Molecules . . . . . . . . . . . . 263 8-11 Summary: The Simple HMO Method for Hydrocarbons . . . . . . . 268 8-12 Relation Between Bond Order and Bond Length . . . . . . . . . . . 269 8-13 p-Electron Densities and Electron Spin Resonance Hyperfine Splitting Constants . . . . . . . . . . . . . . . . . . . . . . . . . . . 271 8-14 Orbital Energies and Oxidation-Reduction Potentials . . . . . . . . . 275 8-15 Orbital Energies and Ionization Energies . . . . . . . . . . . . . . . 278 8-16 p-Electron Energy and Aromaticity . . . . . . . . . . . . . . . . . . 279 xii Contents 8-17 Extension to Heteroatomic Molecules . . . . . . . . . . . . . . . . 284 8-18 Self-Consistent Variations of a and ß . . . . . . . . . . . . . . . . 287 8-19 HMO Reaction Indices . . . . . . . . . . . . . . . . . . . . . . . . 289 8-20 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 296 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 305 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306 9 Matrix Formulation of the Linear Variation Method 308 9-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308 9-2 Matrices and Vectors . . . . . . . . . . . . . . . . . . . . . . . . . 308 9-3 Matrix Formulation of the Linear Variation Method . . . . . . . . . 315 9-4 Solving the Matrix Equation . . . . . . . . . . . . . . . . . . . . . 317 9-5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 320 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 320 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323 10 The Extended H¨uckel Method 324 10-1 The Extended H¨uckel Method . . . . . . . . . . . . . . . . . . . . 324 10-2 Mulliken Populations . . . . . . . . . . . . . . . . . . . . . . . . . 335 10-3 Extended H¨uckel Energies and Mulliken Populations . . . . . . . . 338 10-4 Extended H¨uckel Energies and Experimental Energies . . . . . . . 340 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347 11 The SCF-LCAO-MO Method and Extensions 348 11-1 Ab Initio Calculations . . . . . . . . . . . . . . . . . . . . . . . . 348 11-2 The Molecular Hamiltonian . . . . . . . . . . . . . . . . . . . . . 349 11-3 The Form of theWavefunction . . . . . . . . . . . . . . . . . . . . 349 11-4 The Nature of the Basis Set . . . . . . . . . . . . . . . . . . . . . 350 11-5 The LCAO-MO-SCF Equation . . . . . . . . . . . . . . . . . . . . 350 11-6 Interpretation of the LCAO-MO-SCF Eigenvalues . . . . . . . . . 351 11-7 The SCF Total Electronic Energy . . . . . . . . . . . . . . . . . . 352 11-8 Basis Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353 11-9 The Hartree–Fock Limit . . . . . . . . . . . . . . . . . . . . . . . 357 11-10 Correlation Energy . . . . . . . . . . . . . . . . . . . . . . . . . . 357 11-11 Koopmans’ Theorem . . . . . . . . . . . . . . . . . . . . . . . . . 358 11-12 Configuration Interaction . . . . . . . . . . . . . . . . . . . . . . . 360 11-13 Size Consistency and the Møller–Plesset and Coupled Cluster Treatments of Correlation . . . . . . . . . . . . . . . . . . . . . . 365 11-14 Multideterminant Methods . . . . . . . . . . . . . . . . . . . . . . 367 11-15 Density Functional Theory Methods . . . . . . . . . . . . . . . . . 368 11-16 Examples of Ab Initio Calculations . . . . . . . . . . . . . . . . . 370 11-17 Approximate SCF-MO Methods . . . . . . . . . . . . . . . . . . . 384 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 386 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 388 Contents xiii 12 Time-Independent Rayleigh–Schr¨odinger Perturbation Theory 391 12-1 An Introductory Example . . . . . . . . . . . . . . . . . . . . . . . 391 12-2 Formal Development of the Theory for Nondegenerate States . . . . 391 12-3 A Uniform Electrostatic Perturbation of an Electron in a “Wire” . . 396 12-4 The Ground-State Energy to First-Order of Heliumlike Systems . . 403 12-5 Perturbation at an Atom in the Simple H¨uckel MO Method . . . . . 406 12-6 Perturbation Theory for a Degenerate State . . . . . . . . . . . . . 409 12-7 Polarizability of the Hydrogen Atom in the n=2 States . . . . . . . 410 12-8 Degenerate-Level Perturbation Theory by Inspection . . . . . . . . 412 12-9 Interaction Between Two Orbitals: An Important Chemical Model . 414 12-10 Connection Between Time-Independent Perturbation Theory and Spectroscopic Selection Rules . . . . . . . . . . . . . . . . . . . . 417 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 427 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 428 13 Group Theory 429 13-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 429 13-2 An Elementary Example . . . . . . . . . . . . . . . . . . . . . . . 429 13-3 Symmetry Point Groups . . . . . . . . . . . . . . . . . . . . . . . 431 13-4 The Concept of Class . . . . . . . . . . . . . . . . . . . . . . . . . 434 13-5 Symmetry Elements and Their Notation . . . . . . . . . . . . . . . 436 13-6 Identifying the Point Group of a Molecule . . . . . . . . . . . . . . 441 13-7 Representations for Groups . . . . . . . . . . . . . . . . . . . . . . 443 13-8 Generating Representations from Basis Functions . . . . . . . . . . 446 13-9 Labels for Representations . . . . . . . . . . . . . . . . . . . . . . 451 13-10 Some Connections Between the Representation Table and Molecular Orbitals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452 13-11 Representations for Cyclic and Related Groups . . . . . . . . . . . 453 13-12 Orthogonality in Irreducible Inequivalent Representations . . . . . 456 13-13 Characters and Character Tables . . . . . . . . . . . . . . . . . . . 458 13-14 Using Characters to Resolve Reducible Representations . . . . . . 462 13-15 Identifying Molecular Orbital Symmetries . . . . . . . . . . . . . . 463 13-16 Determining in Which Molecular Orbital an Atomic Orbital Will Appear . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465 13-17 Generating Symmetry Orbitals . . . . . . . . . . . . . . . . . . . . 467 13-18 Hybrid Orbitals and Localized Orbitals . . . . . . . . . . . . . . . 470 13-19 Symmetry and Integration . . . . . . . . . . . . . . . . . . . . . . 472 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476 Multiple Choice Questions . . . . . . . . . . . . . . . . . . . . . . 481 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 483 14 Qualitative Molecular Orbital Theory 484 14-1 The Need for a Qualitative Theory . . . . . . . . . . . . . . . . . . 484 14-2 Hierarchy in Molecular Structure and in Molecular Orbitals . . . . 484 14-3 H+2 Revisited . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 485 14-4 H2: Comparisons with H+2 . . . . . . . . . . . . . . . . . . . . . . 488 xiv Contents 14-5 Rules for Qualitative Molecular Orbital Theory . . . . . . . . . . . 490 14-6 Application of QMOT Rules to Homonuclear Diatomic Molecules . 490 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams . . . . . . . . . 495 14-8 Frontier Orbitals . . . . . . . . . . . . . . . . . . . . . . . . . . . 505 14-9 Qualitative Molecular Orbital Theory of Reactions . . . . . . . . . 508 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 521 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524 15 Molecular Orbital Theory of Periodic Systems 526 15-1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 526 15-2 The Free Particle in One Dimension . . . . . . . . . . . . . . . . . 526 15-3 The Particle in a Ring . . . . . . . . . . . . . . . . . . . . . . . . . 529 15-4 Benzene . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 530 15-5 General Form of One-Electron Orbitals in Periodic Potentials— Bloch’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 533 15-6 A Retrospective Pause . . . . . . . . . . . . . . . . . . . . . . . . 537 15-7 An Example: Polyacetylene with Uniform Bond Lengths . . . . . . 537 15-8 Electrical Conductivity . . . . . . . . . . . . . . . . . . . . . . . . 546 15-9 Polyacetylene with Alternating Bond Lengths—Peierls’ Distortion . 547 15-10 Electronic Structure of All-Trans Polyacetylene . . . . . . . . . . . 551 15-11 Comparison of EHMO and SCF Results on Polyacetylene . . . . . 552 15-12 Effects of Chemical Substitution on the p Bands . . . . . . . . . . 554 15-13 Poly-Paraphenylene—A Ring Polymer . . . . . . . . . . . . . . . 555 15-14 Energy Calculations . . . . . . . . . . . . . . . . . . . . . . . . . 562 15-15 Two-Dimensional Periodicity and Vectors in Reciprocal Space . . . 562 15-16 Periodicity in Three Dimensions—Graphite . . . . . . . . . . . . . 565 15-17 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 578 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 580 Appendix 1 Useful Integrals 582 Appendix 2 Determinants 584 Appendix 3 Evaluation of the Coulomb Repulsion Integral Over 1s AOs 587 Appendix 4 Angular Momentum Rules 591 Appendix 5 The Pairing Theorem 601 Appendix 6 H¨uckel Molecular Orbital Energies, Coefficients, Electron Densities, and Bond Orders for Some Simple Molecules 605 Appendix 7 Derivation of the Hartree–Fock Equation 614 Appendix 8 The Virial Theorem for Atoms and Diatomic Molecules 624 Contents xv Appendix 9 Bra-ket Notation 629 Appendix 10 Values of Some Useful Constants and Conversion Factors 631 Appendix 11 Group Theoretical Charts and Tables 636 Appendix 12 Hints for Solving Selected Problems 651 Appendix 13 Answers to Problems 654 Index 691 Preface to the Third Edition We have attempted to improve and update this text while retaining the features that make it unique, namely, an emphasis on physical understanding, and the ability to estimate, evaluate, and predict results without blind reliance on computers, while still maintaining rigorous connection to the mathematical basis for quantum chemistry. We have inserted into most chapters examples that allowimportant points to be emphasized, clarified, or extended. This has enabled us to keep intact most of the conceptual development familiar to past users. In addition, many of the chapters now include multiple choice questions that students are invited to solve in their heads. This is not because we think that instructors will be using such questions. Rather it is because we find that such questions permit us to highlight some of the definitions or conclusions that students often find most confusing far more quickly and effectively than we can by using traditional problems. Of course, we have also sought to update material on computational methods, since these are changing rapidly as the field of quantum chemistry matures. This book is written for courses taught at the first-year graduate/senior undergraduate levels, which accounts for its implicit assumption that many readers will be relatively unfamiliar with much of the mathematics and physics underlying the subject. Our experience over the years has supported this assumption; many chemistry majors are exposed to the requisite mathematics and physics, yet arrive at our courses with poor understanding or recall of those subjects. That makes this course an opportunity for such students to experience the satisfaction of finally seeing how mathematics, physics, and chemistry are intertwined in quantum chemistry. It is for this reason that treatments of the simple and extended Hückel methods continue to appear, even though these are no longer the methods of choice for serious computations. These topics nevertheless form the basis for the way most non-theoretical chemists understand chemical processes, just as we tend to think about gas behavior as “ideal, with corrections.” xvii Preface to the Second Edition The success of the first edition has warranted a second. The changes I have made reflect my perception that the book has mostly been used as a teaching text in introductory courses. Accordingly, I have removed some of the material in appendixes on mathematical details of solving matrix equations on a computer. Also I have removed computer listings for programs, since these are now commonly available through commercial channels. I have added a new chapter on MO theory of periodic systems—a subject of rapidly growing importance in theoretical chemistry and materials science and one for which chemists still have difficulty finding appropriate textbook treatments. I have augmented discussion in various chapters to give improved coverage of time-dependent phenomena and atomic term symbols and have provided better connection to scattering as well as to spectroscopy of molecular rotation and vibration. The discussion on degenerate-level perturbation theory is clearer, reflecting my own improved understanding since writing the first edition. There is also a new section on operator methods for treating angular momentum. Some teachers are strong adherents of this approach, while others prefer an approach that avoids the formalism of operator techniques. To permit both teaching methods, I have placed this material in an appendix. Because this edition is more overtly a text than a monograph, I have not attempted to replace older literature references with newer ones, except in cases where there was pedagogical benefit. A strength of this book has been its emphasis on physical argument and analogy (as opposed to pure mathematical development). I continue to be a strong proponent of the view that true understanding comes with being able to “see” a situation so clearly that one can solve problems in one’s head. There are significantly more end-of-chapter problems, a number of them of the “by inspection” type. There are also more questions inviting students to explain their answers. I believe that thinking about such questions, and then reading explanations from the answer section, significantly enhances learning. It is the fashion today to focus on state-of-the-art methods for just about everything. The impact of this on education has, I feel, been disastrous. Simpler examples are often needed to develop the insight that enables understanding the complexities of the latest techniques, but too often these are abandoned in the rush to get to the “cutting edge.” For this reason I continue to include a substantial treatment of simple H¨uckel theory. It permits students to recognize the connections between MOs and their energies and bonding properties, and it allows me to present examples and problems that have maximum transparency in later chapters on perturbation theory, group theory, qualitative MO theory, and periodic systems. I find simple H¨uckel theory to be educationally indispensable. xix xx Preface to the Second Edition Much of the new material in this edition results from new insights I have developed in connection with research projects with graduate students. The work of all four of my students since the appearance of the first edition is represented, and I am delighted to thank Sherif Kafafi, John LaFemina, Maribel Soto, and Deb Camper for all I have learned from them. Special thanks are due to Professor Terry Carlton, of Oberlin College, who made many suggestions and corrections that have been adopted in the new edition. Doubtless, there are new errors. I would be grateful to learn of them so that future printings of this edition can be made error-free. Students or teachers with comments, questions, or corrections are more than welcome to contact me, either by mail at the Department of Chemistry, 152 Davey Lab, The Pennsylvania State University, University Park, PA 16802, or by e-mail directed to JL3 at PSUVM.PSU.EDU. Preface to the First Edition My aim in this book is to present a reasonably rigorous treatment of molecular orbital theory, embracing subjects that are of practical interest to organic and inorganic as well as physical chemists. My approach here has been to rely on physical intuition as much as possible, first solving a number of specific problems in order to develop sufficient insight and familiarity to make the formal treatment of Chapter 6 more palatable. My own experience suggests that most chemists find this route the most natural. I have assumed that the reader has at some time learned calculus and elementary physics, but I have not assumed that this material is fresh in his or her mind. Other mathematics is developed as it is needed. The book could be used as a text for undergraduate or graduate students in a half or full year course. The level of rigor of the book is somewhat adjustable. For example, Chapters 3 and 4, on the harmonic oscillator and hydrogen atom, can be truncated if one wishes to know the nature of the solutions, but not the mathematical details of how they are produced. I have made use of appendixes for certain of the more complicated derivations or proofs. This is done in order to avoid having the development of major ideas in the text interrupted or obscured. Certain of the appendixes will interest only the more theoretically inclined student. Also, because I anticipate that some readers may wish to skip certain chapters or parts of chapters, I have occasionally repeated information so that a given chapter will be less dependent on its predecessors. This may seem inelegant at times, but most students will more readily forgive repetition of something they already know than an overly terse presentation. I have avoided early usage of bra-ket notation. I believe that simultaneous introduction of new concepts and unfamiliar notation is poor pedagogy. Bra-ket notation is used only after the ideas have had a change to jell. Problem solving is extremely important in acquiring an understanding of quantum chemistry. I have included a fair number of problems with hints for a few of them in Appendix 14 and answers for almost all of them in Appendix 15.1 It is inevitable that one be selective in choosing topics for a book such as this. This book emphasizes ground stateMOtheory of molecules more than do most introductory texts, with rather less emphasis on spectroscopy than is usual. Angular momentum is treated at a fairly elementary level at various appropriate places in the text, but it is never given a full-blown formal development using operator commutation relations. Time-dependent phenomena are not included. Thus, scattering theory is absent, 1In this Second Edition, these Appendices are numbered Appendix 12 and 13. xxi xxii Preface to the First Edition although selection rules and the transition dipole are discussed in the chapter on timeindependent perturbation theory. Valence-bond theory is completely absent. If I have succeeded in my effort to provide a clear and meaningful treatment of topics relevant to modern molecular orbital theory, it should not be difficult for an instructor to provide for excursions into related topics not covered in the text. Over the years, many colleagues have been kind enough to read sections of the evolving manuscript and provide corrections and advice. I especially thank L. P. Gold and O. H. Crawford, who cheerfully bore the brunt of this task. Finally, I would like to thank my father,Wesley G. Lowe, for allowing me to include his sonnet, “The Molecular Challenge.” Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation 1-1 Introduction The application of quantum-mechanical principles to chemical problems has revolutionized the field of chemistry. Today our understanding of chemical bonding, spectral phenomena, molecular reactivities, and various other fundamental chemical problems rests heavily on our knowledge of the detailed behavior of electrons in atoms and molecules. In this book we shall describe in detail some of the basic principles, methods, and results of quantum chemistry that lead to our understanding of electron behavior. In the first fewchapters we shall discuss some simple, but important, particle systems. This will allow us to introduce many basic concepts and definitions in a fairly physical way. Thus, some background will be prepared for the more formal general development of Chapter 6. In this first chapter, we review briefly some of the concepts of classical physics as well as some early indications that classical physics is not sufficient to explain all phenomena. (Those readers who are already familiar with the physics of classical waves and with early atomic physics may prefer to jump ahead to Section 1-7.) 1-2 Waves 1-2.A TravelingWaves A very simple example of a traveling wave is provided by cracking a whip. A pulse of energy is imparted to the whipcord by a single oscillation of the handle. This results in a wave which travels down the cord, transferring the energy to the popper at the end of the whip. In Fig. 1-1, an idealization of the process is sketched. The shape of the disturbance in the whip is called the wave profile and is usually symbolized .(x). The wave profile for the traveling wave in Fig. 1-1 shows where the energy is located at a given instant. It also contains the information needed to tell how much energy is being transmitted, because the height and shape of the wave reflect the vigor with which the handle was oscillated. 1 2 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation Figure 1-1  Cracking the whip. As time passes, the disturbance moves from left to right along the extended whip cord. Each segment of the cord oscillates up and down as the disturbance passes by, ultimately returning to its equilibrium position. The feature common to all traveling waves in classical physics is that energy is transmitted through a medium. The medium itself undergoes no permanent displacement; it merely undergoes local oscillations as the disturbance passes through. One of the most important kinds of wave in physics is the harmonic wave, for which thewave profile is a sinusoidal function. Aharmonicwave, at a particular instant in time, is sketched in Fig. 1-2. The maximum displacement of the wave from the rest position is the amplitude of the wave, and the wavelength . is the distance required to enclose one complete oscillation. Such a wave would result from a harmonic1 oscillation at one end of a taut string. Analogous waves would be produced on the surface of a quiet pool by a vibrating bob, or in air by a vibrating tuning fork. At the instant depicted in Fig. 1-2, the profile is described by the function .(x)=Asin(2px/.) (1-1) (. =0 when x =0, and the argument of the sine function goes from 0 to 2p, encompassing one complete oscillation as x goes from 0 to ..) Let us suppose that the situation in Fig. 1-2 pertains at the time t =0, and let the velocity of the disturbance through the medium be c. Then, after time t , the distance traveled is ct, the profile is shifted to the right by ct and is now given by (x, t)=Asin[(2p/.)(x -ct)] (1-2) Figure 1-2  A harmonic wave at a particular instant in time. A is the amplitude and . is the wavelength. 1A harmonic oscillation is one whose equation of motion has a sine or cosine dependence on time. Section 1-2 Waves 3 A capital  is used to distinguish the time-dependent function (1-2) from the timeindependent function (1-1). The frequency . of a wave is the number of individual repeating wave units passing a point per unit time. For our harmonic wave, this is the distance traveled in unit time c divided by the length of a wave unit .. Hence, . =c/. (1-3) Note that the wave described by the formula (x, t)=Asin[(2p/.)(x -ct)+] (1-4) is similar to  of Eq. (1-2) except for being displaced. If we compare the two waves at the same instant in time, we find  to be shifted to the left of  by ./2p. If  =p, 3p, . . . , then  is shifted by ./2, 3./2, . . . and the two functions are said to be exactly out of phase. If  =2p, 4p, . . . , the shift is by ., 2., . . . , and the two waves are exactly in phase.  is the phase factor for  relative to . Alternatively, we can compare the two waves at the same point in x, in which case the phase factor causes the two waves to be displaced from each other in time. 1-2.B StandingWaves In problems of physical interest, the medium is usually subject to constraints. For example, a string will have ends, and these may be clamped, as in a violin, so that they cannot oscillate when the disturbance reaches them. Under such circumstances, the energy pulse is unable to progress further. It cannot be absorbed by the clamping mechanism if it is perfectly rigid, and it has no choice but to travel back along the string in the opposite direction. The reflected wave is now moving into the face of the primary wave, and the motion of the string is in response to the demands placed on it by the two simultaneous waves: (x, t)=primary(x, t)+reflected(x, t) (1-5) When the primary and reflected waves have the same amplitude and speed, we can write (x, t) = Asin [(2p/.)(x -ct)]+Asin [(2p/.)(x +ct)] = 2Asin(2px/.) cos(2pct/.) (1-6) This formula describes a standing wave—a wave that does not appear to travel through the medium, but appears to vibrate “in place.” The first part of the function depends only on the x variable. Wherever the sine function vanishes,  will vanish, regardless of the value of t . This means that there are places where the medium does not ever vibrate. Such places are called nodes. Between the nodes, sin(2px/.) is finite. As time passes, the cosine function oscillates between plus and minus unity. This means that  oscillates between plus and minus the value of sin(2px/.). We say that the xdependent part of the function gives the maximum displacement of the standing wave, and the t-dependent part governs the motion of the medium back and forth between these extremes of maximum displacement. A standing wave with a central node is shown in Fig. 1-3. 4 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation Figure 1-3  A standing wave in a string clamped at x =0 and x =L. The wavelength . is equal to L. Equation (1-6) is often written as (x, t)=.(x) cos(.t) (1-7) where .=2pc/. (1-8) The profile .(x) is often called the amplitude function and . is the frequency factor. Let us consider how the energy is stored in the vibrating string depicted in Fig. 1-3. The string segments at the central node and at the clamped endpoints of the string do not move. Hence, their kinetic energies are zero at all times. Furthermore, since they are never displaced from their equilibrium positions, their potential energies are likewise always zero. Therefore, the total energy stored at these segments is always zero as long as the string continues to vibrate in the mode shown. The maximum kinetic and potential energies are associated with those segments located at the wave peaks and valleys (called the antinodes) because these segments have the greatest average velocity and displacement from the equilibrium position. Amore detailed mathematical treatment would show that the total energy of any string segment is proportional to .(x)2 (Problem 1-7). 1-3 The ClassicalWave Equation It is one thing to draw a picture of a wave and describe its properties, and quite another to predict what sort of wave will result from disturbing a particular system. To make such predictions, we must consider the physical laws that the medium must obey. One condition is that the medium must obey Newton’s laws of motion. For example, any segment of string of massmsubjected to a force F must undergo an acceleration ofF/m in accord with Newton’s second law. In this regard, wave motion is perfectly consistent with ordinary particle motion. Another condition, however, peculiar to waves, is that each segment of the medium is “attached” to the neighboring segments so that, as it is displaced, it drags along its neighbor, which in turn drags along its neighbor, Section 1-3 The ClassicalWave Equation 5 Figure 1-4  A segment of string under tension T . The forces at each end of the segment are decomposed into forces perpendicular and parallel to x. etc. This provides the mechanism whereby the disturbance is propagated along the medium.2 Let us consider a string under a tensile force T . When the string is displaced from its equilibrium position, this tension is responsible for exerting a restoring force. For example, observe the string segment associated with the region x to x +dx in Fig. 1-4. Note that the tension exerted at either end of this segment can be decomposed into components parallel and perpendicular to the x axis. The parallel component tends to stretch the string (which, however, we assume to be unstretchable), the perpendicular component acts to accelerate the segment toward or away from the rest position. At the right end of the segment, the perpendicular component F divided by the horizontal component gives the slope of T . However, for small deviations of the string from equilibrium (that is, for small angle a) the horizontal component is nearly equal in length to the vector T . This means that it is a good approximation to write slope of vector T =F/T at x +dx (1-9) But the slope is also given by the derivative of , and so we can write Fx+dx =T (./.x)x+dx (1-10) At the other end of the segment the tensile force acts in the opposite direction, and we have Fx =-T (./.x)x (1-11) The net perpendicular force on our string segment is the resultant of these two: F =T (./.x)x+dx -(./.x)x (1-12) The difference in slope at two infinitesimally separated points, divided by dx, is by definition the second derivative of a function. Therefore, F =T .2/.x2 dx (1-13) 2Fluids are of relatively low viscosity, so the tendency of one segment to drag along its neighbor is weak. For this reason fluids are poor transmitters of transverse waves (waves in which the medium oscillates in a direction perpendicular to the direction of propagation). In compression waves, one segment displaces the next by pushing it. Here the requirement is that the medium possess elasticity for compression. Solids and fluids often meet this requirement well enough to transmit compression waves. The ability of rigid solids to transmit both wave types while fluids transmit only one type is the basis for using earthquake-induced waves to determine how deep the solid part of the earth’s mantle extends. 6 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation Equation (1-13) gives the force on our string segment. If the string has mass m per unit length, then the segment has mass mdx, and Newton’s equation F =ma may be written T .2/.x2 =m.2/.t2 (1-14) where we recall that acceleration is the second derivative of position with respect to time. Equation (1-14) is the wave equation for motion in a string of uniform density under tension T . It should be evident that its derivation involves nothing fundamental beyond Newton’s second law and the fact that the two ends of the segment are linked to each other and to a common tensile force. Generalizing this equation to waves in three-dimensional media gives  .2 .x2 + .2 .y2 + .2 .z2  (x, y, z, t) =ß .2 (x, y, z, t) .t2 (1-15) where ß is a composite of physical quantities (analogous to m/T ) for the particular system. Returning to our string example, we have in Eq. (1-14) a time-dependent differential equation. Suppose we wish to limit our consideration to standing waves that can be separated into a space-dependent amplitude function and a harmonic time-dependent function. Then (x, t)=.(x) cos(.t) (1-16) and the differential equation becomes cos(.t) d2. (x) dx2 = m T .(x) d2 cos(.t) dt2 =- m T .(x).2 cos(.t) (1-17) or, dividing by cos(.t), d2.(x)/dx2=-(.2m/T ).(x) (1-18) This is the classical time-independent wave equation for a string. We can see by inspection what kind of function .(x) must be to satisfy Eq. (1-18). . is a function that, when twice differentiated, is reproduced with a coefficient of -.2m/T . One solution is . =Asin .m/T x (1-19) This illustrates that Eq. (1-18) has sinusoidally varying solutions such as those discussed in Section 1-2. Comparing Eq. (1-19) with (1-1) indicates that 2p/. = .vm/T . Substituting this relation into Eq. (1-18) gives d2.(x)/dx2=-(2p/.)2.(x) (1-20) which is a more useful form for our purposes. For three-dimensional systems, the classical time-independent wave equation for an isotropic and uniform medium is (.2/.x2 +.2/.y2 +.2/.z2).(x, y, z)=-(2p/.)2.(x, y, z) (1-21) Section 1-4 StandingWaves in a Clamped String 7 where . depends on the elasticity of the medium. The combination of partial derivatives on the left-hand side of Eq. (1-21) is called the Laplacian, and is often given the shorthand symbol .2 (del squared). This would give for Eq. (1-21) .2.(x, y, z)=-(2p/.)2.(x, y, z) (1-22) 1-4 StandingWaves in a Clamped String We now demonstrate how Eq. (1-20) can be used to predict the nature of standing waves in a string. Suppose that the string is clamped at x = 0 and L. This means that the string cannot oscillate at these points. Mathematically this means that .(0)=.(L)=0 (1-23) Conditions such as these are called boundary conditions. Our question is, “What functions . satisfy Eq. (1-20) and also Eq. (1-23)?” We begin by trying to find the most general equation that can satisfy Eq. (1-20). We have already seen that Asin(2px/.) is a solution, but it is easy to show that Acos(2px/.) is also a solution. More general than either of these is the linear combination3 .(x)=Asin(2px/.)+B cos(2px/.) (1-24) By varying A and B, we can get different functions .. There are two remarks to be made at this point. First, some readers will have noticed that other functions exist that satisfy Eq. (1-20). These are Aexp(2pix/.) and Aexp(-2pix/.), where i = v-1. The reason we have not included these in the general function (1-24) is that these two exponential functions are mathematically equivalent to the trigonometric functions. The relationship is exp(±ikx)=cos(kx)±i sin(kx). (1-25) This means that any trigonometric function may be expressed in terms of such exponentials and vice versa. Hence, the set of trigonometric functions and the set of exponentials is redundant, and no additional flexibility would result by including exponentials in Eq. (1-24) (see Problem 1-1). The two sets of functions are linearly dependent.4 The second remark is that for a given A and B the function described by Eq. (1-24) is a single sinusoidal wave with wavelength .. By altering the ratio of A to B, we cause the wave to shift to the left or right with respect to the origin. If A=1 and B =0, the wave has a node at x =0. If A=0 and B =1, the wave has an antinode at x =0. We nowproceed by letting the boundary conditions determine the constantsAandB. The condition at x =0 gives .(0)=Asin(0)+B cos(0)=0 (1-26) 3Given functions f1,f2,f3 . . . . A linear combination of these functions is c1f1 +c2f2 +c3f3 +···, where c1, c2, c3, . . . are numbers (which need not be real). 4If one member of a set of functions (f1,f2,f3, . . . ) can be expressed as a linear combination of the remaining functions (i.e., if f1 =c2f2 +c3f3 +···), the set of functions is said to be linearly dependent. Otherwise, they are linearly independent. 8 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation However, since sin(0)=0 and cos(0)=1, this gives B =0 (1-27) Therefore, our first boundary condition forces B to be zero and leaves us with .(x)=Asin(2px/.) (1-28) Our second boundary condition, at x =L, gives .(L)=Asin(2pL/.)=0 (1-29) One solution is provided by settingAequal to zero. This gives. =0, which corresponds to no wave at all in the string. This is possible, but not very interesting. The other possibility is for 2pL/. to be equal to 0,±p,±2p, . . . ,±np, . . . since the sine function vanishes then. This gives the relation 2pL/.=np, n=0,±1,±2, . . . (1-30) or .=2L/n, n=0,±1,±2, . . . (1-31) Substituting this expression for . into Eq. (1-28) gives .(x)=Asin(npx/L), n=0,±1,±2, . . . (1-32) Some of these solutions are sketched in Fig. 1-5. The solution for n=0 is again the uninteresting . =0 case. Furthermore, since sin(-x) equals -sin(x), it is clear that the set of functions produced by positive integers n is not physically different from the set produced by negative n, so we may arbitrarily restrict our attention to solutions with positive n. (The two sets are linearly dependent.) The constant A is still undetermined. It affects the amplitude of the wave. To determine A would require knowing how much energy is stored in the wave, that is, how hard the string was plucked. It is evident that there are an infinite number of acceptable solutions, each one corresponding to a different number of half-waves fitting between 0 and L. But an even larger infinity of waves has been excluded by the boundary conditions—namely, all waves having wavelengths not divisible into 2L an integral number of times. The result Figure 1-5  Solutions for the time-independent wave equation in one dimension with boundary conditions .(0)=.(L)=0. Section 1-5 Light as an ElectromagneticWave 9 of applying boundary conditions has been to restrict the allowed wavelengths to certain discrete values. As we shall see, this behavior is closely related to the quantization of energies in quantum mechanics. The example worked out above is an extremely simple one. Nevertheless, it demonstrates how a differential equation and boundary conditions are used to define the allowed states for a system. One could have arrived at solutions for this case by simple physical argument, but this is usually not possible in more complicated cases. The differential equation provides a systematic approach for finding solutions when physical intuition is not enough. 1-5 Light as an ElectromagneticWave Suppose a charged particle is caused to oscillate harmonically on the z axis. If there is another charged particle some distance away and initially at rest in the xy plane, this second particle will commence oscillating harmonically too. Thus, energy is being transferred from the first particle to the second, which indicates that there is an oscillating electric field emanating from the first particle. We can plot the magnitude of this electric field at a given instant as it would be felt by a series of imaginary test charges stationed along a line emanating from the source and perpendicular to the axis of vibration (Fig. 1-6). If there are some magnetic compasses in the neighborhood of the oscillating charge, these will be found to swing back and forth in response to the disturbance. This means that an oscillating magnetic field is produced by the charge too. Varying the placement of the compasses will show that this field oscillates in a plane perpendicular to the axis of vibration of the charged particle. The combined electric and magnetic fields traveling along one ray in the xy plane appear in Fig. 1-7. The changes in electric and magnetic fields propagate outward with a characteristic velocity c, and are describable as a traveling wave, called an electromagnetic wave. Its frequency . is the same as the oscillation frequency of the vibrating charge. Its wavelength is . = c/.. Visible light, infrared radiation, radio waves, microwaves, ultraviolet radiation, X rays, and . rays are all forms of electromagnetic radiation, their only difference being their frequencies .. We shall continue the discussion in the context of light, understanding that it applies to all forms of electromagnetic radiation. Figure 1-6  A harmonic electric-field wave emanating from a vibrating electric charge. The wave magnitude is proportional to the force felt by the test charges. The charges are only imaginary; if they actually existed, they would possess mass and under acceleration would absorb energy from the wave, causing it to attenuate. 10 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation Figure 1-7  A harmonic electromagnetic field produced by an oscillating electric charge. The arrows without attached charges show the direction in which the north pole of a magnet would be attracted. The magnetic field is oriented perpendicular to the electric field. If a beam of light is produced so that the orientation of the electric field wave is always in the same plane, the light is said to be plane (or linearly) polarized. The planepolarized light shown in Fig. 1-7 is said to be z polarized. If the plane of orientation of the electric field wave rotates clockwise or counterclockwise about the axis of travel (i.e., if the electric field wave “corkscrews” through space), the light is said to be right or left circularly polarized. If the light is a composite of waves having random field orientations so that there is no resultant orientation, the light is unpolarized. Experiments with light in the nineteenth century and earlier were consistent with the view that light is a wave phenomenon. One of the more obvious experimental verifications of this is provided by the interference pattern produced when light from a point source is allowed to pass through a pair of slits and then to fall on a screen. The resulting interference patterns are understandable only in terms of the constructive and destructive interference of waves. The differential equations of Maxwell, which provided the connection between electromagnetic radiation and the basic laws of physics, also indicated that light is a wave. But there remained several problems that prevented physicists from closing the book on this subject. Onewas the inability of classical physical theory to explain the intensity andwavelength characteristics of light emitted by a glowing “blackbody.” This problem was studied by Planck, who was forced to conclude that the vibrating charged particles producing the light can exist only in certain discrete (separated) energy states. We shall not discuss this problem. Another problem had to do with the interpretation of a phenomenon discovered in the late 1800s, called the photoelectric effect. 1-6 The Photoelectric Effect This phenomenon occurs when the exposure of some material to light causes it to eject electrons. Many metals do this quite readily. A simple apparatus that could be used to study this behavior is drawn schematically in Fig. 1-8. Incident light strikes the metal dish in the evacuated chamber. If electrons are ejected, some of them will strike the collecting wire, giving rise to a deflection of the galvanometer. In this apparatus, one can vary the potential difference between the metal dish and the collecting wire, and also the intensity and frequency of the incident light. Suppose that the potential difference is set at zero and a current is detected when light of a certain intensity and frequency strikes the dish. This means that electrons Section 1-6 The Photoelectric Effect 11 Figure 1-8  A phototube. are being emitted from the dish with finite kinetic energy, enabling them to travel to the wire. If a retarding potential is now applied, electrons that are emitted with only a small kinetic energy will have insufficient energy to overcome the retarding potential and will not travel to the wire. Hence, the current being detected will decrease. The retarding potential can be increased gradually until finally even the most energetic photoelectrons cannot make it to the collecting wire. This enables one to calculate the maximum kinetic energy for photoelectrons produced by the incident light on the metal in question. The observations from experiments of this sort can be summarized as follows: 1. Below a certain cutoff frequency of incident light, no photoelectrons are ejected, no matter how intense the light. 2. Above the cutoff frequency, the number of photoelectrons is directly proportional to the intensity of the light. 3. As the frequency of the incident light is increased, the maximum kinetic energy of the photoelectrons increases. 4. In cases where the radiation intensity is extremely low (but frequency is above the cutoff value) photoelectrons are emitted from the metal without any time lag. Some of these results are summarized graphically in Fig. 1-9. Apparently, the kinetic energy of the photoelectron is given by kinetic energy=h(. -.0) (1-33) where h is a constant. The cutoff frequency .0 depends on the metal being studied (and also its temperature), but the slope h is the same for all substances. We can also write the kinetic energy as kinetic energy=energy of light-energy needed to escape surface (1-34) 12 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation Figure 1-9  Maximum kinetic energy of photoelectrons as a function of incident light frequency, where .0 is the minimum frequency for which photoelectrons are ejected from the metal in the absence of any retarding or accelerating potential. The last quantity in Eq. (1-34) is often referred to as the work function W of the metal. Equating Eq. (1-33) with (1-34) gives energy of light-W =h. -h.0 (1-35) The material-dependent term W is identified with the material-dependent term h.0, yielding energy of light=E =h. (1-36) where the value of h has been determined to be 6.626176×10-34 J sec. (See Appendix 10 for units and conversion factors.) Physicists found it difficult to reconcile these observations with the classical electromagnetic field theory of light. For example, if light of a certain frequency and intensity causes emission of electrons having a certain maximum kinetic energy, one would expect increased light intensity (corresponding classically to a greater electromagnetic field amplitude and hence greater energy density) to produce photoelectrons of higher kinetic energy. However, it only produces more photoelectrons and does not affect their energies. Again, if light is a wave, the energy is distributed over the entire wavefront and this means that a low light intensity would impart energy at a very low rate to an area of surface occupied by one atom. One can calculate that it would take years for an individual atom to collect sufficient energy to eject an electron under such conditions. No such induction period is observed. An explanation for these results was suggested in 1905 by Einstein, who proposed that the incident light be viewed as being comprised of discrete units of energy. Each such unit, or photon, would have an associated energy of h.,where . is the frequency of the oscillating emitter. Increasing the intensity of the light would correspond to increasing the number of photons, whereas increasing the frequency of the light would increase the energy of the photons. If we envision each emitted photoelectron as resulting from a photon striking the surface of the metal, it is quite easy to see that Einstein’s proposal accords with observation. But it creates a new problem: If we are to visualize light as a stream of photons, how can we explain the wave properties of light, such as the double-slit diffraction pattern? What is the physical meaning of the electromagnetic wave? Section 1-6 The Photoelectric Effect 13 Essentially, the problem is that, in the classical view, the square of the electromagnetic wave at any point in space is a measure of the energy density at that point. Now the square of the electromagnetic wave is a continuous and smoothly varying function, and if energy is continuous and infinitely divisible, there is no problem with this theory. But if the energy cannot be divided into amounts smaller than a photon—if it has a particulate rather than a continuous nature—then the classical interpretation cannot apply, for it is not possible to produce a smoothly varying energy distribution from energy particles any more than it is possible to produce, at the microscopic level, a smooth density distribution in gas made from atoms of matter. Einstein suggested that the square of the electromagnetic wave at some point (that is, the sum of the squares of the electric and magnetic field magnitudes) be taken as the probability density for finding a photon in the volume element around that point. The greater the square of the wave in some region, the greater is the probability for finding the photon in that region. Thus, the classical notion of energy having a definite and smoothly varying distribution is replaced by the idea of a smoothly varying probability density for finding an atomistic packet of energy. Let us explore this probabilistic interpretation within the context of the two-slit interference experiment. We know that the pattern of light and darkness observed on the screen agrees with the classical picture of interference of waves. Suppose we carry out the experiment in the usual way, except we use a light source (of frequency .) so weak that only h. units of energy per second pass through the apparatus and strike the screen. According to the classical picture, this tiny amount of energy should strike the screen in a delocalized manner, producing an extremely faint image of the entire diffraction pattern. Over a period of many seconds, this pattern could be accumulated (on a photographic plate, say) andwould become more intense. According to Einstein’s view, our experiment corresponds to transmission of one photon per second and each photon strikes the screen at a localized point. Each photon strikes a new spot (not to imply the same spot cannot be struck more than once) and, over a long period of time, they build up the observed diffraction pattern. If we wish to state in advance where the next photon will appear, we are unable to do so. The best we can do is to say that the next photon is more likely to strike in one area than in another, the relative probabilities being quantitatively described by the square of the electromagnetic wave. The interpretation of electromagnetic waves as probability waves often leaves one with some feelings of unreality. If the wave only tells us relative probabilities for finding a photon at one point or another, one is entitled to ask whether the wave has “physical reality,” or if it is merely a mathematical device which allows us to analyze photon distribution, the photons being the “physical reality.” We will defer discussion of this question until a later section on electron diffraction. EXAMPLE 1-1 Aretarding potential of 2.38 volts just suffices to stop photoelectrons emitted from potassium by light of frequency 1.13×1015 s-1. What is the work function, W, of potassium? SOLUTION  Elight =h. =W +KEelectron,W =h. -KEelectron =(4.136×10-15eV s) (1.13×1015 s-1)-2.38eV=4.67eV-2.38eV=2.29eV [Note convenience of using h in units of eV s for this problem. See Appendix 10 for data.]  14 Chapter 1 ClassicalWaves and the Time-Independent Schr ¨ odingerWave Equation EXAMPLE 1-2 Spectroscopists often express E for a transition between states in wavenumbers , e.g., m-1, or cm-1, rather than in energy units like J or eV. (Usually cm-1 is favored, so we will proceed with that choice.) a) What is the physical meaning of the term wavenumber? b) What is the connection between wavenumber and energy? c) What wavenumber applies to an energy of 1.000 J? of 1.000 eV? SOLUTION  a)Wavenumber is the number of waves that fit into a unit of distance (usually of one centimeter). It is sometimes symbolized ˜.. ˜. =1/., where . is the wavelength in centimeters. b)Wavenumber characterizes the light that has photons of the designated energy. E=h. =hc/.= hc˜.. (where c is given in cm/s). c) E = 1.000 J = hc˜.; ˜. = 1.000 J/hc = 1.000 J /[(6.626×10-34 J s)(2.998×1010 cm/s)] = 5.034×1022 cm-1. Clearly, this is light of an extremely short wavelength since more than 1022 wavelengths fit into 1 cm. For 1.000 eV, the above equation is repeated using h in eV s. This gives ˜. =8065cm-1.  1-7 TheWave Nature of Matter Evidently light haswave and particle aspects, and we can describe it in terms of photons, which are associated withwaves of frequency .=E/h. Nowphotons are rather peculiar particles in that they have zero rest mass. In fact, they can exist only when traveling at the speed of light. The more normal particles in our experience have nonzero rest masses and can exist at any velocity up to the speed-of-light limit. Are there also waves associated with such normal particles? Imagine a particle having a finite rest mass that somehow can be made lighter and lighter, approaching zero in a continuous way. It seems reasonable that the existence of a wave associated with the motion of the particle should become more and more apparent, rather than the wave coming into existence abruptly when m=0. De Broglie proposed that all material particles are associated with waves, which he called “matter waves,” but that the existence of these waves is likely to be observable only in the behaviors of extremely light particles. De Broglie’s relation can be reached as follows. Einstein’s relation for photons is E =h. (1-37) But a photon carrying energy E has a relativistic mass given by E =mc2 (1-38) Equating these two equations gives E =mc2 =h. =hc/. (1-39) or mc=h/. (1-40) Section 1-7 TheWave Nature of Matter 15 Anormal particle, with nonzero rest mass, travels at a velocity v. If we regard Eq. (1-40) as merely the high-velocity limit of a more general expression, we arrive at an equation relating particle momentum p and associated wavelength .: mv =p =h/. (1-41) or .=h/p (1-42) Here, m refers to the rest mass of the particle plus the relativistic correction, but the latter is usually negligible in comparison to the former. This relation, proposed by de Broglie in 1922, was demonstrated to be correct shortly thereafter when Davisson and Germer showed that a beam of electrons impinging on a nickel target produced the scattering patterns one expects from interferingwaves. These “electron waves” were observed to have wavelengths related to electron momentum in just the manner proposed by de Broglie. Equation (1-42) relates the de Broglie wavelength . of a matter wave to the momentum p of the particle. A higher momentum corresponds to a shorter wavelength. Since kinetic energy T =mv2 =(1/2m)(m2v2)=p2/2m (1-43) it follows that p =v2mT (1-44) Furthermore, Since E = T + V , where E is the total energy and V is the potential energy, we can rewrite the de Broglie wavelength as .= h v2m(E -V ) (1-45) Equation (1-45) is useful for understanding the way in which . will change for a particle moving with constant total energy in a varying potential. For example, if the particle enters a region where its potential energy increases (e.g., an electron approaches a negatively charged plate), E -V decreases and . increases (i.e., the particle slows down, so its momentum decreases and its associated wavelength increases). We shall see examples of this behavior in future chapters. Observe that if E = V,. as given by Eq. (1-45) is real. However, if E U. The first of these corresponds to the classical situation where the particle has insufficient energy to escape from the box and get into region II. Let us see what quantum mechanics says about this case in region II. For this case, .II is imaginary since .II =h/2m(E -U) (2-31) and E - U is negative. Because .II is imaginary, it is more convenient to use the general form (2-30) because then the i in the exponential argument can combine with the i of .II to produce a real argument. Let us assume that .II is equal to i times a positive number. (This will not affect our results.) Let us now examine the properties of the two exponential functions in Eq. (2-30). The first exponential has an argument that is real (because the i’s cancel) and positive (because of our above assumption). As x increases, this exponential increases rapidly, approaching infinity. Since acceptable functions do not blow up like this, we set CII equal to zero to prevent it. The second exponential has a negative, real argument, so it decays exponentially toward zero as x approaches infinity. This is acceptable behavior, and we are left with .II =DII exp(-2pix/.II) (2-32) We nowhave formulas describing fragments of thewavefunction for the two regions. All that remains is to join these together at x = L in such a way that the resulting wavefunction is continuous at x =L and has a continuous first derivative there. (Recall 40 Chapter 2 Quantum Mechanics of Some Simple Systems from Section 2-1 that this second requirement results from the fact that the potential is finite at x =L. Hence, . must be smooth at x =L.) The continuity requirement gives AI sin(2pL/.I)=DII exp(-2piL/.II) (2-33) Taking the derivatives of .I and .II and setting these equal at x =L (to force smoothness) gives (2p/.I)AI cos(2pL/.I)=(-2pi/.II)DII exp(-2piL/.II) (2-34) The exponential term is common to both Eqs. (2-33) and (2-34), providing the basis for another equality: AI sin(2pL/.I)=(-AI.II/i.I) cos(2pL/.I) (2-35) or tan(2pL/.I)=i.II/.I (2-36) Substituting for .I and .II as indicated by Eqs. (2-28) and (2-31) gives tan(2pLv2mE/h)=-vE/vU -E (2-37) The only unknown in Eq. (2-37) is the total energy E. For given values of L, m, and U, only certain values of EU, let us discuss in detail the results just obtained. In the first place, the energies are quantized, much as they were in the infinitely deep square well. There is some difference, however. In the infinitely deep well or box, the energy levels increased with the square of the quantum number n. Here they increase less rapidly (the dashed lines in Fig. 2-10 show the allowed energy levels which result when U=8) because the barrier becomes effectively less restrictive for particles with higher energies (see the following). For the lowest solution, for example, slightly less than one-half a sine wave is needed in one box width of distance. Thus, the wavelength here is slightly longer than in an infinitely deep well of equal width, and so, by de Broglie’s relation, the energy is slightly lower. Notice that the effect of lowering the height of one wall is least for the levels lying deepest in the well. The solutions sketched in Fig. 2-10 indicate that there is a finite probability for finding the particle in the region x >L even though it must have a negative kinetic energy there. Thus, quantum mechanics allows the particle to penetrate into regions where classical mechanics claims it cannot go. Notice that the penetration becomes more appreciable as the energy of the particle approaches that of the barrier. This results Section 2-3 The Particle in a One-Dimensional “Box” with One FiniteWall 41 Figure 2-9  Graphical solution of the equation -tan(2pLv2mE/h) = vE/vU -E. Here L = 2.50 nm, m = 9.11 × 10-31 kg, U = 1 eV = 16.02 × 10-20 J. Intersections occur at E = 0.828×10-20 J, 3.30×10-20 J, 7.36×10-20 J and 12.8×10-20 J. from the fact that E - U determines the rate at which the exponential in .II decays [see Eqs. (2-31) and (2-32)]. In the limit that U.8, the wavefunction vanishes at the barrier, in agreement with the results of the infinite square well of Section 2-1. If the barrier in Fig. 2-8 has finite thickness (V becomes zero again at, say, x =2L), then there is a finite probability that a particle in the well will penetrate through the barrier and appear on the other side. This phenomenon is called quantum-mechanical tunneling, and this is the way, for example, an a particle escapes from a nucleus even though it classically lacks sufficient energy to overcome the attractive nuclear forces. We emphasize that the tunneling referred to in this example is really not a stationary state phenomenon. We have an initial condition (particle in the well) and ask what the half-life is for the escape of the particle—a time-dependent problem. We saw earlier that the energy quantization for the particle in the infinitely deep well could be thought of as resulting from fitting integral numbers of half sine waves into a fixed width. Most sine waves just will not fit perfectly, and so most energies are not allowed. In this problem the waves are allowed to leak past one of the well walls, but we can still see why only certain energies are allowed. Suppose that we pick some arbitrary energy E for the particle. We know that . must be zero at the left wall of the well where V =8. Starting there, we can draw a sine wave of wavelength determined by E across the well to the right wall, as shown in Fig. 2-11. When the wave hits the right wall, it must join on smoothly to a decaying exponential, which also depends on E. Most of the time, it will be impossible to effect a smooth junction, and that particular value of E will be disallowed. Let us now consider the case where E>U. In region I, the considerations are the same as before. Then, .I is a sine wave that can be drawn from the left wall and has a 42 Chapter 2 Quantum Mechanics of Some Simple Systems Figure 2-10  Solutions for particle in well with one finite wall (see Fig. 2-9 for details). Dashed lines correspond to energy levels which would exist if U =8. wavelength determined by E(=T ) from de Broglie’s relation. This sine wave arrives at x =L with a certain magnitude and a certain derivative (assuming that the multiplier A1 has been fixed at some arbitrary value). In region II, we also have a solution of the usual form .II(x)=AII sin(2px/.II)+BII cos(2px/.II) (2-38) where .II is real and determined by E -U, which is now positive. The question is, can we always adjust .II (by changing AII and BII) so that it has the same value and slope at x = L that .I has? A little thought shows that such adjustment is indeed always possible. The two adjustments allowed in .II correspond to a change of phase for .II (a shift in the horizontal direction) and a change in amplitude for .II. The only thing about .II we cannot change is the wavelength, since this is determined by E -U. This is just a physical description of the mathematical circumstance in which we have two adjustable parameters and two requirements to fit—a soluble problem. The essential difference between this case and that of the trapped particle is that here we have fewer Section 2-3 The Particle in a One-Dimensional “Box” with One FiniteWall 43 Figure 2-11  An example of partial wavefunctions for an arbitrary energy E. These functions cannot be joined smoothly at x =L and so this value of E is not allowed. boundary conditions. Before, our square-integrability requirement was used to remove a positive exponential term. That requirement is, in effect, a boundary condition—. must vanish at x=8—and it led to energy quantization. Then we used the normality requirement to achieve unique values for AI and DII. In this case we cannot get a square-integrable solution. .II goes on oscillating as x.8, and so we have no boundary condition there. As a result, E is not quantized and . is not normalizable, so that only ratios of AI, AII, and BII are obtainable. The energy scheme for the particle in the potential well with one finite wall, then, is discrete when EU. Notice the way in which the wavelengths vary in Fig. 2-10. We have already seen that the time-independent Schr¨odinger equation states that the total energy for a particle in a stationary state is the same at all particle positions (i.e., a constant of motion). The kinetic and potential energies must vary together, then, in such a way that their sum is constant. This is reflected by the fact that the wavelength of an unbound solution is shorter in region I than it is in region II. In region I, V = 0, so that all energy of the particle is kinetic (T =E). In region II,V >0, so that the kinetic energy (T =E -V ) is less than it was in region I. Therefore, the de Broglie wavelength, which is related to kinetic energy, must be greater in region II. EXAMPLE 2-4 For the system described in the caption for Fig. 2-9, calculate the percentage drop of the lowest-energy state that results from barrier penetration. SOLUTION  For this, we need to solve the problem for the simple particle-in-a-box system (for which U =8). E1 = n2h2 8mL2 = (1)2(6.626×10-34 J s)2 8(9.11×10-31 kg)(2.500×10-10 m)2 =9.64×10-21 J compared to 8.28×10-21 J. Barrier penetration lowers E1 by 14%.  44 Chapter 2 Quantum Mechanics of Some Simple Systems 2-4 The Particle in an Infinite “Box” with a Finite Central Barrier Another example of barrier penetration in a stationary state of a system is provided by inserting a barrier of finite height and thickness at the midpoint of the infinite square well of Section 2-1 (see Fig. 2-12). The boundary conditions for this problem are easily obtained by obvious extensions of the considerations already discussed. Rather than solve this case directly, we shall make use of our insights from previous systems to deduce the main characteristics of the solutions. Let us begin by considering the case where the barrier is infinitely high. Figure 2-12  (a) Solutions for identical infinite square wells. (b) Effect of finite partition on half waves. (c) Symmetric combination of half waves. (d) Antisymmetric combination of half waves. Section 2-4 The Particle in an Infinite “Box” with a Finite Central Barrier 45 Then the problem becomes merely that of two isolated infinite square wells, each well having solutions as described in Sections 2-1 and 2-2. Now, as the height of the barrier is lowered from infinity, what happens? The levels lying deepest in the two sections should be least affected by the change. They must still vanish at the outer walls but now they can penetrate slightly into the finite barrier. Thus, the lowest state in, say, the left-hand section of the well will begin to look as given in Fig. 2-12b. The solution on the right side will do likewise, of course. As this happens, their energies will decrease slightly since their wavelengths increase. However, since the two wells are no longer separated by an infinite barrier, they are no longer independent. We can no longer talk about separate solutions for the two halves. Each solution for the Schr¨odinger equation is now a solution for the whole system from x=-L to +L. Furthermore, symmetry arguments state that, since the hamiltonian for this problem is symmetric for reflection through x =0, the solutions, if nondegenerate, must be either symmetric or antisymmetric through x =0. This requirement must be reconciled with the barrier-penetration behavior indicated by Fig. 2-12b, which is also occurring. One way to accomplish this is by summing the two halfwaves as shown in Fig. 2-12c, giving a symmetricwavefunction. Alternatively, subtraction gives the antisymmetric form shown in Fig. 2-12d. Both of these solutions will be lower in energy than their infinite-well counterparts, because the wavelengths in Fig. 2-12c and d continue to be longer than in 2-12a. Will their energies be equal to each other? Not quite. By close inspection, we can figure out which solution will have the lower energy. In Figs. 2-12b to 2-12d, the slopes of the half wave, the symmetric, and the antisymmetric combinations at the finite barrier are labeled respectively m, m, and m. What can we say about their relative values? The slope m should be less negative than m because the decaying exponential producing m has an increasing exponential added to it when producing m. Slope m should be more negative than m since the decaying exponential has an increasing exponential subtracted from it in case d, causing it to decay faster. This means that the sine curve on the left-hand side of Fig. 2-12c cannot be identical with that on the left side of Fig. 2-12d since they must arrive at the barrier with different slopes. (The same is true for the right-hand sides, of course.) How can we make the sine wave arrive with a less negative slope m?—by increasing the wavelength slightly so that not quite so much of the sine wave fits into the left well (see Fig. 2-13a). Increasing the wavelength slightly means, by de Broglie’s relation, that the energy of the particle is decreased. Similarly, the sine curve in Fig. 2-12d must be shortened so that it will arrive at the barrier with slope m, which corresponds to an energy increase. Of course, now that the energy has changed outside the barrier, it must change inside the barrier too. This would require going back and modifying the exponentials inside the barrier. But the first step is sufficient to indicate the qualitative results: The symmetric solution has lower energy. In Fig. 2-13a is a detailed sketch of the final solution for the two lowest states. There is a simpler way to decide that the symmetric solution has lower energy. As the barrier height becomes lower and lower, the two solutions become more and more separated in energy, but they always remain symmetric or antisymmetric with respect to reflection since the hamiltonian always has reflection symmetry. In the limit when the barrier completely disappears we have a simple square well again (but larger), the lowest solution of which is symmetric. (See Fig. 2-13b.) This lowest symmetric solution must 46 Chapter 2 Quantum Mechanics of Some Simple Systems Figure 2-13  (a) Detailed sketch of the two lowest solutions for the infinite square-well divided by a finite barrier at the midpoint. The waves are sketched from a common energy value for ease of comparison. Actually, the symmetric wave has a lower energy. (b) A correlation diagram relating energies when the barrier is infinite (left side) with those when the barrier vanishes. Letters A and S refer to antisymmetric and symmetric solutions, respectively. “come from” the symmetric combination of smaller-well wavefunctions sketched at the left of Fig. 2-13b; similarly, the second lowest, antisymmetric solution of the large well correlates with the antisymmetric small-well combination (also at left in Fig. 2-13b). A figure of the kind shown in Fig. 2-13b is called a correlation diagram. It shows how the energy eigenvalues change throughout a continuous, symmetry-conserving process. Section 2-5 The Free Particle in One Dimension 47 We shall see that the correlation of wavefunction symmetries in such a manner as this is a powerful technique in understanding and predicting chemical behavior. The splitting of energy levels resulting from barrier penetration is an extremely pervasive phenomenon in quantum chemistry. It occurs regardless of whether the barrier separates identical or nonidentical potential regions, i.e., regardless of whether the final system is symmetric or unsymmetric. When two atoms (N and N, or C and O) interact to form a molecule, the original atomic wavefunctions combine to form molecular wavefunctions in much the same way as was just described. One of these molecular wavefunctions may have an energy markedly lower than those in the corresponding atoms. Electrons having such a wavefunction will stabilize the molecule relative to the separated atoms. Another case in which energy level splitting occurs is in the vibrational spectrum of ammonia. Ammonia is most stable in a pyramidal configuration, but is capable of inverting through a higher-energy planar configuration into an equivalent “mirror image” pyramid. Thus, vibrations tending to flatten out the ammonia molecule occur in a potential similar to the double well, except that in ammonia the potential is not discontinuous. The lowest vibrational energy levels are not sufficiently high to allow classical inversion of ammonia. However, these vibrational levels are split by interaction through barrier penetration just as quantum mechanics predicts. The energy required to excite ammonia from the lowest of these sublevels to its associated sublevel can be accurately measured through microwave spectroscopy. Knowledge of the level splittings in turn allows a precise determination of the height of the barrier to inversion in ammonia (see Fig. 2-14). It is easy to anticipate the appearance of the solutions for the square well with central barrier for energies greater than the partition height. They will be sinusoidal waves, symmetric or antisymmetric in the well, and vanishing at the walls. Their wavelengths will be somewhat longer in the region of the partition than elsewhere because some of the kinetic energy of the particle is transformed to potential energy there. A sketch of the final results is given in Fig. 2-15. EXAMPLE 2-5 Fig. 2-15 shows energy levels for states when the barrier has finite height. When the barrier is made infinitely high, the levels at E1 and E2 merge into one level. Where does the energy of that one level lie—below E1, between E1 and E2, or above E2—and why? SOLUTION  It lies above E2. When the barrier is finite, there is always some penetration, so . is always at least a little larger than is the case for the infinite barrier. If . is larger, E is lower.  2-5 The Free Particle in One Dimension Suppose a particle of mass m moves in one dimension in a potential that is everywhere zero. The Schr¨odinger equation becomes -h2 8p2m d2. dx2 =E. (2-39) 48 Chapter 2 Quantum Mechanics of Some Simple Systems Figure 2-14  Sketch of potential for inversion vibrational mode in ammonia. The lowest levels are split by tunneling. The low energy transition E1 is visible in the microwave region whereas the second transition E2 is visible in the infrared. E1 =0.16×10-22 J; E2 =7.15×10-22 J. which has as solutions . =Aexp(±2piv2mEx/h) (2-40) or alternatively, trigonometric solutions . =A sin(2pv2mEx/h), . =A cos(2pv2mEx/h) (2-41) As is most easily seen from the exponential forms (2-40), if E is negative, . will blow up at either +8 or -8, and so we reject negative energies. Since there are no boundary conditions, it follows that E can take on any positive value; the energies of the free particle are not quantized. This result would be expected from our earlier results on constrained particles. There we saw that quantization resulted from spatial constraints, and here we have none. The constants A and A of Eqs. (2-40) and (2-41) cannot be evaluated in the usual way, since the solutions do not vanish at x =±8. Sometimes it is convenient to evaluate them to correspond to some experimental situation. For instance, suppose that one was working with a monoenergetic beam of electrons having an intensity of one electron every 10-6 m. Then we could normalize . of Eq. (2-40) so that  10-6 m 0 |.|2 dx =1 There is a surprising difference in the particle distributions predicted from expressions (2-40) and (2-41). The absolute square .*. of the exponentials is a constant (A*A), whereas the squares of the trigonometric functions are fluctuating functions Section 2-5 The Free Particle in One Dimension 49 Figure 2-15  Wavefunctions for the infinite square well with finite partition. of x. It seems sensible for a particle moving without restriction to have a constant probability distribution, and it seems absurd for it to have a varying probability distribution. What causes this peculiar behavior? It results from there being two independent solutions for each value of E (except E =0). This degeneracy with respect to energy means that from a degenerate pair, . and ., one can produce any number of new eigenfunctions, . = a. + b. (Problem 2-11). In such a situation, the symmetry proof of Section 2-2 does not hold. However, there will always be an independent pair of degenerate wavefunctions that will satisfy certain symmetry requirements. Thus, in the problem at hand, we have one pair of solutions, the exponentials, which do have the proper symmetry since their absolute squares are constant. From this pair we can produce any number of linear combinations [one set being given by Eq. (2-41)], but these need not display the symmetry properties anymore. The exponential solutions have another special attribute: A particle whose state is described by one of the exponentials has a definite linear momentum, whereas, when described by a trigonometric function, it does not. In Section 1-9, it was shown that the connection between classical and wave mechanics could be made if one related the classical momentum, px, with a quantum mechanical operator (h/2pi)d/dx. Now, for a particle to have a definite (sharp) value p for its momentum really means that, if we measure the momentum at some instant, there is no possibility of getting any value other than p. This means that the particle in the state described by . always has momentum p, no matter where it is in x; i.e., its momentum is a constant of motion, 50 Chapter 2 Quantum Mechanics of Some Simple Systems just as its energy is. This corresponds to saying that there is an eigenvalue equation for momentum, just as for energy. Thus h 2pi d. dx =p. (2-42) The statement made earlier, that the exponential solutions correspond to the particle having sharp momentum, means that the exponentials (2-40) must be solutions to Eq. (2-42). This is easily verified: h 2pi d dx Aexp±2piv2mEx h  =±v2mE Aexp±2piv2mEx h  Thus, the positive and negative exponential solutions correspond to momentum values of +v2mE and -v2mE, respectively, and are interpreted as referring to particle motion toward +8 and -8 respectively. Since energy is related to the square of the momentum, these two solutions have identical energies. (The solution for E =0 corresponds to no momentum at all, and the directional degeneracy is removed.) A mixture of these states contains contributions from two different momenta but only one energy, so linear combinations of the exponentials fail to maintain a sharp value for momentum but do maintain a sharp value for energy. EXAMPLE 2-6 An electron is accelerated along the x axis towards x=8from rest through a potential drop of 1.000 kV. a) What is its final momentum? b) What is its final de Broglie wavelength? c) What is its final wavefunction? SOLUTION  a) px = v2mE = [2(9.105 × 10-31 kg)(1.602 × 10-16 J)]1/2 = 1.708 × 10-23 kgms-1 b) .= h p = 6.626×10-34 J s 1.708×10-23 kgms-1 =3.879×10-11 m c) . =Aexp(+2piv2mEx/h)(choose + because moving towards x=+8)=Aexp(2pip/h)x =Aexp(2pix/.)=Aexp[2pi(2.578×1010 m-1)x].  2-6 The Particle in a Ring of Constant Potential Suppose that a particle of mass m is free to move around a ring of radius r and zero potential, but that it requires infinite energy to get off the ring. This system has only one variable coordinate—the angle f. In classical mechanics, the useful quantities and relationships for describing such circular motion are those given in Table 2-1. Comparing formulas for linear momentum and angular momentum reveals that the variables mass and linear velocity are analogous to moment of inertia and angular velocity in circular motion, where the coordinate f replaces x. The Schr¨odinger equation for circular motion, then, is -h2 8p2I d2. (f) df2 =E.(f) (2-43) Section 2-6 The Particle in a Ring of Constant Potential 51 TABLE 2-1  Quantity Formula Units Moment of inertia I =mr2 g cm2 or kgm2 Angular velocity .=f/t =v/r s-1 Angular momentum (linear momentum times orbit radius) mvr =I. g cm2/s or erg s or J s which has, as solutions Aexp(±ikf) (2-44) or alternatively A sin(kf) (2-45) and A cos(kf) (2-46) where [substituting Eq. (2-45) or (2-46) into (2-43) and operating] k =2pv2IE/h (2-47) Let us solve the problem first with the trigonometric functions. Starting at some arbitrary point on the ring and moving around the circumference with a sinusoidal function, we shall eventually reencounter the initial point. In order that our wavefunction be single valued, it is necessary that . repeat itself every time f changes by 2p radians. Thus, for f given by Eq. (2-45), sin(kf)=sin[k(f +2p)] (2-48) Similarly, for . given by Eq. (2-49) cos(kf)=cos(kf +2kp) (2-49) Either of these relations is satisfied only if k is an integer. The case in which k =0 is not allowed for the sine function since it then vanishes everywhere and is unsuitable. However, k =0 is allowed for the cosine form. The normalized solutions are, then, . = (1/vp)sin(kf), k =1, 2, 3, . . . . = (1/vp)cos(kf), k =1, 2, 3, . . . . = (1/v2p) (from the k =0 case for the cosine) (2-50) Now let us examine the exponential form of . (Eq. 2-44). The requirement that . repeat itself for f.f +2p gives Aexp(±ikf)=Aexp[±ik(f +2p)]=Aexp(±ikf) exp(±2pik) 52 Chapter 2 Quantum Mechanics of Some Simple Systems or exp(±2pik)=1 Taking the positive case and utilizing Eq. (2-3), we obtain cos(2pk)+i sin(2pk)=1 or cos(2pk)=1 and sin(2pk)=0 Again, k must be an integer. (The same result arises by requiring that d./df repeat for f.f +2p.) Thus, an alternative set of normalized solutions is . =1/v2p exp(ikf) k =0,±1,±2,±3, . . . (2-51) The energies for the particle in the ring are easily obtained from Eq. (2-47): E =k2h2/8p2I, k =0,±1,±2,±3, . . . (2-52) The energies increase with the square of k, just as in the case of the infinite square well potential. Here we have a single state with E =0, and doubly degenerate states above, whereas, in the square well, we had no solution at E =0, and all solutions were nondegenerate. The solution at E =0 means that there is no finite zero point energy to be associated with free rotation, and this is in accord with uncertainty principle arguments since there is no constraint in the coordinate f. The similarity between the particle in a ring and the free particle problems is striking. Aside from the fact that in the ring the energies are quantized and the solutions are normalizable, there are few differences. The exponential solutions (2-51) are eigenfunctions for the angular momentum operator (h/2pi)d/df. The two angular momenta for a pair of degenerate solutions correspond to particle motion clockwise or counterclockwise in the ring. (The nondegenerate solution for E =0 has no angular momentum, hence no ability to achieve degeneracy through directional behavior.) The particle density predicted by the exponentials is uniform in the ring, while that for the trigonometric solutions is not. Since the trigonometric functions tend to localize the particle into part of the ring, thereby causing f =8, it is consistent that they are impure momentum states ( ang. mom. =0). (Infinite uncertainty in the coordinate f means that all values of f in the range 0–2p are equally likely.) EXAMPLE 2-7 Demonstrate that any two degenerate exponential eigenfunctions for a particle in a ring are orthogonal. SOLUTION  Such a pair of degenerate wavefunctions can be written as .+ = 1 v2p exp(ikf), .- = 1 v2p exp(-ikf). These are orthogonal if  2p 0 .*+ .- df =0 where we must use the complex conjugate of either one of the wavefunctions since . is complex. But .*+ =.-, so  2p 0 .-.- df = 1 2p  2p 0 exp(-2if)df = 1 2p  2p 0 [cos(2f)-i sin(2f)]df = 1 2p sin(2f)|2p 0 -i(-cos(2f)|2p 0  = 1 2p [sin(4p)-sin(0)+i cos(4p)-i cos(0)] = 1 2p [0-0+i -i]=0.  Section 2-7 The Particle in a Three-Dimensional Box: Separation of Variables 53 2-7 The Particle in a Three-Dimensional Box: Separation of Variables Let us now consider the three-dimensional analog of the square well of Section 2-1. This would be a three-dimensional box with zero potential inside and infinite potential outside. As before, the particle has no probability for penetrating beyond the box. Therefore, the Schr¨odinger equation is just -h2 8p2m .2 .x2 + .2 .y2 + .2 .z2 . =E. (2-53) and . vanishes at the box edges. The hamiltonian operator on the left side of Eq. (2-53) can be written as a sum of operators, one in each variable (e.g., Hx = (-h2/8p2m).2/.x2). Let us assume for the moment that . can be written as a product of three functions, each one being a function of a different variable, x,y, or z (i.e., . =X(x)Y(y)Z(z)). If we can show that such a . satisfies Eq. (2-53), we will have a much simpler problem to solve. Using this assumption, Eq. (2-53) becomes (Hx +Hy +Hz)XYZ =EXYZ (2-54) This can be expanded and then divided through by XYZ to obtain HxXYZ XYZ + HyXYZ XYZ + HzXYZ XYZ =E (a constant) (2-55) Now, since Hx, for example, operates only on functions of x, but not y or z, we can carry out some limited cancellation. Those functions that are not operated on in a numerator can be canceled against the denominator. Those that are operated on cannot be canceled since these are differential operators [e.g., in (1/x)dx2/dx it is not permissible to cancel 1/x against x2 before differentiating: (1/x)dx2/dx =dx/dx]. Such cancellation gives HxX X + HyY Y + HzZ Z =E (a constant) (2-56) Now, suppose the particle is moving in the box parallel to the x axis so that the variables y and z are not changing. Then, of course, the functions Y and Z are also not changing, so HyY/Y and HzZ/Z are both constant. Only HxX/X can vary—but does it vary? Not according to Eq. (2-56), which reduces under these conditions to HxX X +constant+constant=E (a constant) (2-57) Therefore, even though the particle is moving in the x direction, HxX/X must also be a constant, which we shall callEx. Similar reasoning leads to analogous constantsEy and Ez . Furthermore, the behavior of HxX/X must really be independent of whether the particle is moving parallel to the y and z axes. Even if y and z do change, they do not appear in the quantity HxX/X anyway. Thus we may write, without restriction, HxX X =Ex, HyY Y =Ey, HzZ Z =Ez (2-58) 54 Chapter 2 Quantum Mechanics of Some Simple Systems and, from Eq. (2-56), Ex +Ey +Ez =E (2-59) Our original equation in three variables has been separated into three equations, one in each variable. The first of these equations may be rewritten HxX =ExX (2-60) which is just the Schr¨odinger equation for the particle in the one-dimensional square well, which we have already solved. For a rectangular box with Lx =Ly =Lz we have the general solution . =XYZ =2/Lx sin (nxpx/Lx)2/Ly sin nypy/Ly2/Lz sin (nzpz/Lz) (2-61) and E =Ex +Ey +Ez =h2/8m n2x /L2x +n2y /L2y +n2z /L2z  (2-62) For a cubical box, Lx =Ly =Lz =L, and the energy expression simplifies to E =h2/8mL2 n2x +n2y +n2z  (2-63) The lowest energy occurs when nx =ny =nz =1, and so E1(1)=3h2/8mL2 (2-64) Thus, the cubical box has three times the zero point energy of the corresponding onedimensional well, one-third coming from each independent coordinate for motion (i.e., “degree of freedom”). The one in parentheses indicates that this level is nondegenerate. The next level is produced when one of the quantum numbers n has a value of two while the others have values of one. There are three independent ways of doing this; therefore, the second level is triply degenerate, and E2(3)=6h2/8mL2. Proceeding, E3(3)=9h2/8mL2, E4(3)=11h2/8mL2, E5(1)=12h2/8mL2, E6(6)=14h2/8mL2, etc. Apparently, the energy level scheme and degeneracies of these levels do not proceed in the regular manner which is found in the one-dimensional cases we have studied. EXAMPLE 2-8 Verify that E6 is six-fold degenerate. SOLUTION  E6 = 14h2/8mL2, so n2x + n2y + n2z = 14. There is only one combination of integers that satisfies this relation, namely 1, 2, and 3. So we simply need to deduce how many unique ways we can assign these integers to nx,ny , and nz . There are three ways to assign 1. For each of these three choices, there remain but two ways to assign 2, and then there is only one way to assign 3. So the number of unique ways is 3×2×1=6. (Or one can simply write down all of the possibilities and observe that there are six of them.)  We shall now briefly consider what probability distributions for the particle are predicted by these solutions. The lowest-energy solution has its largest value at the box center where all three sine functions are simultaneously largest. The particle Section 2-7 The Particle in a Three-Dimensional Box: Separation of Variables 55 Figure 2-16  Sketches of particle probability distributions for a particle in a cubical box. (a) nx =ny =nz =1. (b) nx =2, ny =nz =1. (c) nx =ny =nz =2. distribution is sketched in Fig. 2-16a. The second level may be produced by setting nx = 2, and ny = nz = 1. Then there will be a nodal plane running through the box perpendicular to the x axis, producing the split distribution shown in Fig. 2-16b. Since there are three ways this node can be oriented to produce distinct but energetically equal distributions, this energy level is triply degenerate. The particle distribution for the state where nx =ny =nz =2 is sketched in Fig. 2-16c. It is apparent that, in the high energy limit, the particle distribution becomes spread out uniformly throughout the box in accord with the classical prediction. The separation of variables technique which we have used to convert our threedimensional problem into three independent one-dimensional problems will recur in other quantum-chemical applications. Reviewing the procedure makes it apparent that this technique will work whenever the hamiltonian operator can be cleanly broken into parts dependent on completely different coordinates. This is always possible for the kinetic energy operator in cartesian coordinates. However, the potential energy operator often prevents separation of variables in physical systems of interest. It is useful to state the general results of separation of variables. Suppose we have a hamiltonian operator, with associated eigenfunctions and eigenvalues: H.i =Ei.i (2-65) Suppose this hamiltonian can be separated, for example, H (a,ß)=Ha (a)+Hß (ß) (2-66) 56 Chapter 2 Quantum Mechanics of Some Simple Systems where a and ß stand for two different coordinates or groups of coordinates. Then it follows that .j,k =fj (a) gk(ß) (2-67) where Hafj =ajfj (2-68) and Hßgk =bkgk (2-69) Furthermore, Ej,k =aj +bk (2-70) In other words, if a hamiltonian is separable, then the eigenfunctions will be products of eigenfunctions of the subhamiltonians, and the eigenvalues will be sums of the subeigenvalues. 2-8 The Scattering of Particles in One Dimension Consider the potential shown in Fig. 2-17a. We imagine that a beam of particles, each having energy E, originates from the left and travels toward x=8, experiencing a constant potential everywhere except at the potential step at x =0. We are interested in what becomes of these particles—what fraction makes it all the way to the “end” (some kind of particle trap to the right of the step) and what fraction is reflected back toward x=-8. Problems of this type are related to scattering experiments where electrons, for example, travel through potential jumps produced by electronic devices or through a dilute gas where potential changes occur in the neighborhood of atoms. This kind of problem differs from most of those discussed earlier because the particle is not trapped (classically), so all nonnegative energy values are possible. We already know what the form of the eigenfunctions is for the constant potentials to the left and right of the step for any choice of E. On the left they are linear combinations of exp(±iv2mEx/¯h), where ¯h = h/2p, and to the right they are linear combinations of exp(±iv2m(E -U)x/¯h). The only thing we do not yet know is which linear combinations to take. That is, we need to find A,B,C, and D in .left = Aexp(ikx/¯h)+B exp(-ikx/¯h), x <0, k=v2mE (2-71) .right = C exp(ikx/¯h)+Dexp(-ikx/¯h), x >0, k =2m(E -U) (2-72) We have seen earlier that the exponentials having positive arguments correspond to particles traveling from left to right, etc. We signify this with arrows in Fig. 2-17a. The nature of .right is qualitatively different depending on whether E is larger or smaller than the step height U. If E 0 due to barrier penetration does not affect this conclusion. The evaluation of 58 Chapter 2 Quantum Mechanics of Some Simple Systems extent of reflection assumes that a time-independent (steady-state) situation has been achieved, so the penetration population remains constant and none of the new particles entering from the left are “lost” due to barrier penetration.) The situation changes when we consider cases for E>U. For any such E, we now have two acceptable exponential functions on both the right and the left. We proceed by realizing that the function with coefficient D in .right should be rejected since it corresponds to particles traveling from right to left, i.e., to particles that have been reflected from the trap. But we assume the trap to be 100% effective, so once again we have only one term in .right As before, we set about forcing the values and slopes of Eqs. (2-71) and (2-72) (with D =0) to be equal at x =0. This time, however, there is no requirement that these values be real. We arrive at the relations (Problem 2-25) B A = k -k k +k and C A = 2k k +k (2-73) The extent of reflection is equal to |B|2 |A|2 = k -k 2 (k +k)2 (2-74) This can be seen to range from zero, when k=k, to one, when k=0. k=k when E= E-U, i.e., when U is negligible compared toE. So zero reflection (total transmission) is approached in the high-energy limit. Only whenE=U does k=0, so total reflection occurs only when E equals the barrier height (or, as we saw previously, is lower). A plot of the fraction of particles reflected versus E/U appears in Fig. 2-17c. The transmission, equal to 1.0–reflection, is plotted in Fig. 2-17d. The approach represented by this scattering problem is to identify the two terms that can contribute to the wavefunction in each region; then to recognize that one of the terms in one region is lost, either because it explodes or because it corresponds to reflection from the particle trap; then to force a smooth junction at the position of the discontinuity in the potential; and finally to draw conclusions about reflection and transmission from the values of the absolute squares of the coefficients. Notice that, for E >U, we could just as well have postulated the beam to be coming from the right, with the trap at the left. This would lead us to set A=0 in Eq. (2-71). Even in cases like this, where the particles are passing over the edge of a potential cliff, there is backscatter (Problem 2-26). An additional feature appears when we consider potentials that change at two points in x, as in Fig. 2-18a. The solution now involves three regions and two places (x=±a) where . and d./dx must be made equal. As before, we decide where the particle source and trap are and set one coefficient equal to zero (say G). Detailed solution of this problem is tedious.3 For our purposes, it is the nature of the result that is important. The extent of transmission as a function of E/U is plotted in Fig. 2-18b. There are two obvious ways this differs from the step-potential transmission function of Fig. 2-17d: First, some of the particles are transmitted even when their energy is less than U. This is the result of barrier penetration leading to finite particle density at the right-hand side of the barrier transmission due to tunneling. (The extent of tunneling transmission depends on the thickness of the barrier.) Second, there are oscillations 3See Merzbacher [2, Chapter 6]. Section 2-9 Summary 59 Figure 2-18  (a) A rectangular-hill potential of height U and width 2a. (b) Fraction of beam transmitted as a function of E/U, where U =h2/2p2ma2. in the transmission function for E>U, with 100% transmission occurring at intervals in E instead of only in the infinite limit. These come about because of interference between waves reflecting off the front and back edges of the barrier. This is most easily understood by recognizing that 100% transmission corresponds to no reflection, so then B =0. This occurs when the wave reflecting back from x =-a is of opposite phase to that reflecting back from x =+a, and this happens whenever there is an integral number of de Broglie half-wavelengths between x=-a and a. At energies where this happens, the beam behaves as though the potential barrier is not there. The variation of reflection from thin films (e.g., soap bubbles) of light of different wavelengths results in the perception of colors and is a familiar example of scattering interference. Less familiar is the variation in reflection of a particle beam, outlined above. However, once we recognize thewave nature of matter, we must expect particles to manifest the same sort of wave properties we associate with light. 2-9 Summary In this chapter we have discussed the following points: 1. A particle constrained in the classical sense (i.e., lacking the energy to overcome barriers preventing its motion over the entire coordinate range) will have quantized energy levels and a finite zero-point energy. In the mathematical analysis, this arises from requirements on . at boundaries. 60 Chapter 2 Quantum Mechanics of Some Simple Systems 2. . can be nonsmooth, or cusped, where V is infinite at a point. If V is infinite over a finite range, . must be zero there. 3. Nondegenerate eigenfunctions of H must be symmetric or antisymmetric for any operation that leaves H unchanged. 4. |.|2 may be regarded as a statistical measure—a summary of many measurements of position on independent, but identically prepared, systems. 5. Quantum-mechanical predictions approach classical predictions in the limits of large E, or large mass, or very high quantum number values. 6. Integrals with antisymmetric integrands must vanish. 7. |.|2 does not vanish in regions whereV >E if V is finite. This is called “barrier penetration.” 8. One-dimensional motion of a free particle has a continuum of energy levels. Except for E = 0, the states are doubly degenerate. Therefore, any mixture of such a pair of states is still an eigenfunction of H. But only two eigenfunctions (for a given E = 0) are also eigenfunctions for the momentum operator. These are the exponential functions. Since they correspond to different momenta, mixing them produces functions that are not eigenfunctions for the momentum operator. 9. Motion of a particle on a ring has quantum-mechanical solutions very similar to those for free-particle motion in one dimension. In both cases, there is no zero-point energy. Both are doubly degenerate forE>0 because two directional possibilities are present. Both have a set of exponential solutions that are eigenfunctions for momentum. The main difference is that the particle-in-a-ring energies are quantized, due to head-to-tail “joining conditions” on .. 10. Increasing the dimensionality of a particle’s range of motion increases the number of quantum numbers needed to define the wavefunctions. In cases where the hamiltonian operator can be written as a sum of operators for different coordinates (i.e., is “separable”), the problem greatly simplifies; the wavefunctions become products, and the energies become sums. 11. Scattering problems are treated by selecting an energy of interest from the continuum of possibilities, removing functions that describe nonphysical processes such as backscatter from the trap, and matching wave values and slopes at region boundaries. Resulting wavefunctions show wave interference effects similar to those observed for light. 2-9.A Problems 2-1. Ascertain that the expression (2-12) for energy has the proper dimensions. 2-2. Solve Eq. (2-9) for A. 2-3. There is a simple way to show that A in Eq. (2-9) must equal v2/L. It involves sketching .2, recognizing that sin2 x +cos2 x =1, and asking what A must equal Section 2-9 Summary 61 in order to make the area under .2 equal 1. Show this for n=1, and argue why it must give the same result for all n. 2-4. Evaluate the probability for finding a particle in the middle third of a onedimensional box in states with n=1, 2, 3, 104. Compare your answers with the sketches in Fig. 2-5 to see if they are reasonable. 2-5. a) Estimate the probability for finding a particle in the n = 1 state in the line elementx centered at the midpoint of a one-dimensional box ifx=0.01L. How does this compare to the classical probability? b) Repeat the problem, but with x centered one third of the way from a box edge. 2-6. a) Use common sense to evaluate the following integral for the particle in a one-dimensional box, assuming that . is normalized.  L/5 0 .2 5dx b) How does this value compare to that for the integral over the same range, but using .1 instead of .5? (Larger, smaller, or equal?) Use a sketch to defend your answer. 2-7. Let S and A be respectively symmetric and antisymmetric functions for the operator R. Evaluate the following, where R operates on every function to its right: (a) RS (b)RA(c) RSS (d)RAA(e)RAS(f)RAASASSA(g) RAASASAA. Can you think of a simple general rule for telling when a product of symmetric and antisymmetric functions will be antisymmetric? 2-8. Using the concept of odd and even functions, ascertain by inspection of sketches whether the following need be identically zero: a)  p 0 sin . cos. d. b)  p -p sin . cos. d. c)  1 -1 x cos x dx d)  a -a cos y sin2 y dy e)  p 0 sin3 . cos2 . d. f)  p 0 sin2 . cos3 . d. g)  1 -1  1 -1 x2y dx dy h)  p -p x sin x cosx dx i)  p 0 sin x d dx sin2 x dx j)  p -p sin2 x d dx sinx dx 2-9. Verify Eq. (2-23) for the general case n =m by explicit integration. 2-10. For the potential of Fig. 2-8, when EU, they are continuous. Is there a solution with E = U? What special requirements are there, if any, for such a solution to exist? 62 Chapter 2 Quantum Mechanics of Some Simple Systems Figure P2-12  2-11. Prove the following statement: any linear combination of degenerate eigenfunctions of H is also an eigenfunction of H. 2-12. In a few words, indicate what is wrong with the wavefunctions sketched in the potentials shown in Fig. P2-12. If the solution appears to be acceptable, indicate this fact. 2-13. A double-well potential ranges from x = 0 to x = 2L and has a thin (width = 0.01L) rectangular barrier of finite height centered at L. a) Sketch the wavefunction that goes with the fourth energy level in this system, assuming that its energy is less than the height of the barrier. b) Estimate the energy of this level for a particle of mass m. 2-14. Use the simple approach presented in Problem 2-3 to demonstrate that A = 1/vp for the trigonometric particle-in-a-ring eigenfunctions and 1/v2p for the exponential eigenfunctions. 2-15. Explain why (2p)-1/2 exp(iv2f) is unacceptable as a wavefunction for the particle in a ring. 2-16. For a particle in a ring, an eigenfunction is . =(1/vp)cos(3f). a) Write down H. b) Evaluate H. and identify the energy. c) Is this a state for which angular momentum is a constant of motion? Demonstrate that your answer is correct. Section 2-9 Summary 63 2-17. Consider two related systems—a particle in a ring of constant potential and another just like it except for a very thin, infinitely high barrier inserted at f =0. When this barrier is inserted, a) are any energies added or lost? b) do any degeneracies change? c) are exponential and sine–cosine forms both still acceptable for eigenfunctions? d) is angular momentum still a constant of motion? 2-18. Consider a particle of mass m in a two-dimensional box having side lengths Lx and Ly with Lx =2Ly and V =0 in the box,8outside. a) Write an expression for the allowed energy levels of this system. b) What is the zero-point energy? c) Calculate the energies and degeneracies for the lowest eight energy levels. d) Sketch the wavefunction for the fourth level. e) Suppose V = 10 J in the box. What effect has this on (i) the eigenvalues? (ii) the eigenfunctions? 2-19. Consider the particle in a three-dimensional rectangular box with Lx = Ly = Lz/2. What would be the energy when nx = 1,ny = 2,nz = 2? For nx = 1, ny =1, nz =4? Can you guess the meaning of the term “accidental degeneracy?” 2-20. Consider a particle of mass m in a cubical box with V =0 at 00). Thus, the spectrum of eigenvalues sketched in Fig. 4-2 is in qualitative accord with understandings developed earlier. The lowest allowed energy for the system is far above the low-energy limit (-8) of the potential well. This corresponds to the finite zero point energy which we have seen Figure 4-2  The potential function V = -e2/4pe0r with eigenvalues superimposed (dashed lines). Degeneracies for the first few levels are noted on the right. Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 93 in other systems where particle motion is constrained. Here it means that, at absolute zero, the electron does not come to rest on the nucleus (which would give T = 0, V =-8, E=-8), but rather continues to move about with a finite total energy. All of the energy levels of Fig. 4-2 are degenerate except for the lowest one. The order of the degeneracy is listed next to each of the lowest few levels in Fig. 4-2. This degeneracy is not surprising since we are dealing here with a three- dimensional system, and we have earlier seen that, in such cases (e.g., the cubic box), the physical equivalence of different directions in space can produce degeneracies (called “spatial degeneracies”). We shall see later that some of the degeneracies in this system do indeed result from directional equivalence (here, spherical symmetry), whereas others do not. The discrete, negative eigenvalues are given by the formula En = -µZ2e4 8e2 0h2n2 =(-13.6058eV) Z2 n2 , n=1, 2, 3, . . . (4-8) EXAMPLE 4-2 Calculate the ionization energy (IE) of C5+ in its ground state, in electron volts. SOLUTION  The ionization energy equals the negative of the ground state energy. Z =6 and n=1, so IE =(13.6058eV) 36 1 =489.808eV.  4-1.C The Lowest-EnergyWavefunction We will now discuss the lowest-energy eigenfunction of Eq. (4-6) in some detail, since an understanding of atomic wavefunctions is crucial in quantum chemistry. The derivation of formulas for this and other wavefunctions will be discussed in later sections, but it is not necessary to labor through the mathematical details of the exact solution of Eq. (4-6) to be able to understand most of the essential features of the eigenfunctions. The formula for the normalized, lowest-energy solution of Eq. (4-6) is .(r)=(1/vp)(Z/a0)3/2 exp(-Zr/a0) (4-9) where a0=5.2917706×10-11 m(called the Bohr radius) and Ze is the nuclear charge. A sketch of . versus r for Z=1 is superimposed on the potential function in Fig. 4-3a. It is apparent that the electron penetrates the potential barrier (Problem 4-3). The square of the wavefunction (4-9) tells us how the electron is distributed about the nucleus. In Fig. 4-3b is plotted .2(r) as a function of r. We refer to .2 as the electron probability density function. In this case, the probability density is greatest at the nucleus (r =0) and decays to zero as r becomes infinite. It is important for the chemist to be able to visualize the electron distributions, or charge “clouds,” in atoms and molecules, and various methods of depicting electron distributions have been devised. In Fig. 4-3 a few of these are presented for the lowestenergy wavefunction. The dot picture (Fig. 4-3c) represents what one would expect if one took a multiflash photograph of a magnified, slowed-down hydrogenlike ion (assuming no disturbance of the ion by the photographing process). Each dot represents an instantaneous electron position, and the density of these dots is greatest at the nucleus. 94 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor 0 0 0 Cusp Penetrates barrier Classical turning point 0 0 Z = 1 r Z = 2 r 0 r . .H v (a) (b) (c) (d) (e) (f) 2 .He+ 2 Figure 4-3  (a) H-atom wavefunction superposed on -e2/r potential curve. (b) Wavefunction squared for H and He+. (c) Dot picture of electron distribution. (d) Contour diagram of electron distribution. (e) Computer-generated graphic version of (d). (f) Single-contour representation of electron distribution. An alternative way of picturing the charge is to draw a contour diagram, each contour indicating that the density has increased or decreased by a certain amount (Fig. 4-3d). A more striking version of the contour plot for .2 1s is shown in Fig. 4-3e. Perhaps the simplest representation is a sketch of the single contour that encloses a certain amount (say 90%) of the electronic charge (Fig. 4-3f). (We have been describing the electron as a point charge moving rapidly about the nucleus. However, for most purposes it is just as convenient to picture the electron as being smeared out into a charge cloud like those sketched in Fig. 4-3. Thus, the statements the electron spends 90% of its time inside this surface, and 90% of the electronic charge is contained within this surface, are equivalent.) The multiflash “photograph” sketched in Fig. 4-3c shows the electron probability density to be greatest at the nucleus. Suppose that we were just to take a single flash photograph. Then the electron would appear as a single dot. At what distance from the nucleus would this dot be most likely to occur? Surprisingly, the answer is not zero. Despite the fact that the probability density is a maximum at r =0, the probability for finding the electron in a volume element at the nucleus approaches zero. This is because the probability density, .2(r), is the measure of the probability per unit volume for the electron being at various distances from the nucleus. When we compare a tiny volume Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 95 Figure 4-4  The volume-weighted probability density for the lowest-energy eigenfunction of the hydrogenlike ion. The most probable value of r occurs at rmp. element near the nucleus with an identical one farther out, we see from Fig. 4-3 that there is indeed more likelihood for the electron being in the volume element nearer the nucleus. But there are more volume elements associated with the larger distance. (The number of identical volume elements varies as the area of the surface of the sphere, 4pr2.) Hence, the probability for the electron being in a radial element dr at a given distance r from the nucleus is given by the number of volume elements at r times the probability density per unit volume element. The reason for the near-zero probability for finding the electron in a volume element at the nucleus is that the number of volume elements associated with r =0 is vanishingly small compared to the number associated with larger r values. Figure 4-4 is a graph of 4pr2.2, the volume-weighted probability density. It is apparent that the most probable value of r, rmp, occurs at a nonzero distance from the nucleus. [The reader is familiar with analogous distinctions. Rhode Island has a higher population density than Texas, but the population of Texas (density times area) is greater. Again, matter in the universe has a much higher mass density in stars and planets than in intergalactic gas or dust, but the total mass of the latter far exceeds that of the former due to the much greater volume of “empty” space.] EXAMPLE 4-3 Estimate, for the hydrogen atom in the 1s state, the amount of electronic charge located in a spherical shell that is 1.00 pm thick and which has a radius of 60.0 pm. SOLUTION  Recall from Example 4-1 that, for relatively thin shells like this, we can estimate the volume of the shell by taking its area times its thickness. Also, we expect the charge density, .2, to change very little over the 1.00 pm thickness of the shell, so we can take its value at 60.0 pm as constant. Then charge density = .2(r = 60.0pm) = (pa3 0)-1 exp(-2(60.0pm)/a0). Recalling that a0 = 52.9 pm, this gives 0.0329a-3 0 . This gives density per cubic bohr. To proceed, we can either convert this to density per cubic picometer by dividing by (52.9pm/bohr)3 or by converting r to bohr, by dividing by 52.9pm/bohr.We choose the 96 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor latter. Then r =60.0pm/52.9pm/bohr =1.13a0, and r = 1.00pm/52.9pm/bohr = 0.0189a0, so volume of shell = 4pr2r = 4p(1.13a0)2(0.0189a0) = 0.3028a3 0. Volume times density =(0.3028a3 0)(0.0329a-3 0 )=0.010. So 1% of the electronic charge resides in this shell.  We can calculate the value of rmp by finding which r value gives the maximum value of 4pr2.2. Recall that we can do this by finding the value of r that causes the first derivative of 4pr2.2 to vanish, that is, we require (d/dr)[4pr2p-1(Z/a0)3 exp(-2Zr/a0)]=0 (4-10) This gives constants · [2r -(2Zr2/a0)]exp(-2Zr/a0)=0 (4-11) The term in brackets vanishes when r = a0/Z, so this is the value of rmp. For Z =1, rmp =a0; a0 is the most probable distance of the electron from the nucleus in the hydrogen atom. For the He+ ion (Z =2), the most probable distance is only half as great, consistent with a more contracted charge cloud. Of more interest, often, is the average value of the distance of the electron from the nucleus. If we could sample the instantaneous distance of the electron from the nucleus a large number of times and calculate the average value, what sort of result would we obtain? The probability for finding the electron at any given distance r is given by the volume-weighted probability density of Fig. 4-4. Inspection of that figure suggests that the average value of the position of the electron ¯r is greater than rmp, the most probable value. But exactly how much bigger is ¯r than rmp? How should we compute the average value? The reader is familiar with the way an average test score is calculated from a collection of scores. For example, suppose the series of scores to be averaged is 0, 2, 6, 6, 7, 7, 7, 10, and that 0 and 10 are the minimum and maximum possible scores. The average is given by average = sum of scores number of scores = 0+2+6+6+7+7+7+10 8 = 45 8 (4-12) Another way to write this is average = frequency of score×score sum of frequencies = 1 · 0+0 · 1+1 · 2+0 · 3+0 · 4+0 · 5+2 · 6+3 · 7+0 · 8+0 · 9+1 · 10 1+0+1+0+0+0+2+3+0+0+1 or average= 10 i=0 (frequency of i) · i 10 i=0 (frequency of i) (4-13) The same idea is used to compute a quantum-mechanical average. For the average value of r we take each possible value of r times its frequency (given by .2 dv) and sum Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 97 over all these values.3 For a continuous variable like r, we must resort to integration to accomplish this. We divide by the “sum” of frequencies by dividing by  .2 dv. Thus ¯r =  all space r.2dv  all space .2dv =  2p 0 df  p 0 sin. d. 8 0 r.2r2dr  all space .2dv (4-14) The denominator is unity since . is normalized. The integrals over . and f involve parts of the volume element dv, and not .2, because this wavefunction (4-9) does not depend on . or f. Continuing, ¯r = f|2p 0 ·-cos.|p0 · p-1 (Z/a0)3 8 0 r3 exp (-2Zr/a0) dr (4-15) Utilizing the information in Appendix 1 for the integral over r, this becomes ¯r = 2p[-(-1)+1](1/p)(Z/a0)33! (2Z/a0)4 (4-16) = 4p · (1/p)(Z/a0)3 · 6a4 0/16Z4 = 3a0 2Z (4-17) (It is useful to remember that integration over the f and . parts of dv gives 4p as the result if no other angle-dependent functions occur in the integral.) Comparing (4-17) with our expression for rmp indicates that ¯r is 1.5 times greater than rmp. Notice that the lowest-energy eigenfunction is finite at r =0 even though V is infinite there. This is allowed by our arguments in Chapter 2 because the infinity in V occurs at only one point, so it can be cancelled by a discontinuity in the derivative of .. This is possible only if . has a corner or cusp at r =0 (see Fig. 4-3a and e). 4-1.D Quantum Numbers and Nomenclature There are three quantum numbers, n, l, andm(all integers), characterizing each solution of the Schr¨odinger equation (4-6). Of these, only n enters the energy formula (4-8), so all solutions having the same values of n but different values of l and m will be energetically degenerate. As is shown in following sections, these quantum numbers are related in their allowed values. The l quantum number must be nonnegative and smaller than n. The m number may be positive, negative or zero, but its absolute value cannot exceed l. Thus, l =0, 1, 2, . . . ,n-1 (4-18) |m|=l (4-19) For the lowest-energy wavefunction we have already described, n=1, so l =m=0. No other choices are possible, so this level is nondegenerate. The convention (from atomic spectroscopy) is to refer to an l =0 solution as an “s function,” or “s orbital.” (For l =0, 1, 2, 3, 4, 5, the spectroscopic designation goes s, p, d, f, g, h.) Because n equals unity, the wavefunction is labeled 1s. 3The 4pr2 part of 4pr2.2 in Fig. 4-4 is included in dv, as will be seen shortly. 98 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor When n=2, there are four sets of l and m quantum numbers satisfying rules (4-18) and (4-19). They are listed below with their spectroscopic labels: l =0,m=0 2s l =1,m=0 2p0 l =1,m=-1 2p- 1 l =1,m=+1 2p+ 1 (4-20) Extending these rules to the n=3 energy level produces nine functions designated 3s, 3p-1, 3p0, 3p+1, 3d-2, 3d-1, 3d0, 3d+1, 3d+2. In general, the degeneracy of the energy level characterized by n is n2. EXAMPLE 4-4 Explain how it comes about from the quantum number rules that the degeneracy equals n2. SOLUTION  The rules indicate that, for each value of n, there are n values of l (e.g., n = 1, l = 0;n = 2, l = 0 and 1, . . . ), and each value of l is associated with 2l +1 ml values (e.g., l = 2, ml =-2,-1, 0, 1, 2, which is five values). Note that 2l + 1 must be an odd number of member states, and the odd number within each set keeps increasing as l increases. Thus, for n=4, l =0, 1, 2, 3, leading to degenerate sets of states, respectively having 1, 3, 5, and 7 members. But a sequence of n odd numbers, starting from 1, is always equal to n2. QED  4-1.E Nature of the Higher-Energy Solutions The second energy level is associated with the 2s, 2p-1, 2p0 and 2p+1 orbitals. The wavefunction for the 2s state is .2s = 1 4v2p  Z a0 3/2 2- Zr a0 exp-Zr 2a0 (4-21) Since this is a function of r only, it is a spherically symmetric function. (In fact, all s orbitals are spherically symmetric.) The 2s orbital is more “spread out” than the 1s orbital because the exponential in .2s decays more slowly and because the exponential is multiplied by Zr/a0 (the 2 becomes negligible compared to Zr/a0 at large r). As a result, the charge cloud associated with the 2s orbital is more diffuse. (For this reason, when we approximate a polyelectronic atom like beryllium by putting electrons in 1s and 2s orbitals the 2s electrons are referred to as “outer” and the 1s electrons are called “inner.”) At small values of r, (2-Zr/a0) is positive, and at large distances it is negative, so .2s has a spherical radial node (a zero in the r coordinate). In Fig. 4-5 the first three s orbitals are plotted. We see that the nth s orbital has (n-1) spherical nodal surfaces dividing regions where the wavefunctions have different sign. The appearance of more and more nodes in the radial coordinate as the energy increases is certainly familiar from previous examples. Notice how the wavefunctions oscillate most rapidly and nodes are most closely spaced in the regions near the nucleus where the electron classically would have its greatest kinetic energy. The 2s orbital is orthogonal to the 1s orbital, and also to all higher s orbitals. This would not be possible if there were no radial nodes. The product .1s.2s will vanish upon integration only if it either vanishes everywhere or else has positive and negative regions that cancel on integration. Since .1s and .2s are almost everywhere finite, the Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 99 former condition does not occur. Since .1s has the same sign everywhere, their product can have positive and negative regions only if .2s has positive and negative regions, and hence, a node. Let us now consider the 2p functions. They are4 .2p0 = 1 4v2p  Z a0 3/2 Zr a0 exp-Zr 2a0 cos . (4-22) .2p±1 = ± 1 8vp  Z a0 3/2 Zr a0 exp-Zr 2a0 sin . exp (±if) (4-23) All of these functions have the same radial exponential decay as the 2s orbital, so we can say that the 2s and 2p orbitals are about equal in size. However, since the 2p orbitals contain the factor Zr/a0 where the 2s contains (2-Zr/a0), the 2p orbitals vanish at the nucleus and not at any intermediate r value; they have no radial nodes. The 2p orbitals are endowed with directional properties by their angular dependences. The 2p0 orbital is particularly easy to understand because the factors r cos . behave exactly like the z Cartesian coordinate. Hence, we can rewrite 2p0 (also called 2pz ) in mixed coordinates as .2pz = 1 4v2p  Z a0 5/2 z exp-Zr 2a0 (4-24) 0 0 0 0 0 r r 0 r 0 r r r 0 0 . . 4pr 2 1s 1s .2s .3s 2 . 4pr 2 2s 2 . 4pr 2 3s 2 Figure 4-5  s wavefunctions versus r and volume-weighted electron densities versus r for the hydrogenlike ion. 4The ± factor in .2p±1 results from a phase factor that is omitted from many textbooks. It has no effect on our discussion here, but is consistent with an implicit choice of sign for certain integrals appearing in Appendix 4. See, e.g., Zare [4, Chapter 1]. 100 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor (Be careful not to confuse the atomic number Z with the coordinate z.) The exponential term in Eq. (4-24) is spherically symmetric, resembling a diffuse 1s orbital. The function z vanishes in the xy plane and becomes increasingly positive or negative as we move away from the plane in either direction. As a result, .2pz looks as sketched in Fig. 4-6. It has nearly spherical lobes, one with positive phase and one with negative phase. When .2pz is squared, the contour lines remain unchanged in relative position, but the function becomes everywhere positive in sign. The 2p±1 orbitals are more difficult to visualize since they are complex functions. The charge distributions associated with these orbitals must be real, however. These are given by .*., where .* is the complex conjugate of .. (Recall that, for complex wavefunctions, .*. must be used for probability distributions, rather than .2.) One obtains the complex conjugate of a function by merely reversing the signs of all the i’s in the function. It is evident from Eq. (4-23) that .*2p+1 =-.2p-1 and .*2p-1 =-.2p+1 . Hence .*2p+1 .2p+1 =.*2p-1 .2p-1 =-.2p+1.2p-1 : both the 2p+1 and the 2p-1 orbitals give the same charge distribution. This distribution is the same as that for the 2pz orbital except that the angle dependence is 1 2 sin2 . instead of cos2 .. However, since sin2 . +cos2 . =1, it follows that the sum of 2p+1 and 2p-1 charge clouds must be such Figure 4-6  (a) Drawing of the 2pz orbital. (b) Drawing of the square of the 2pz orbital. (c) Drawing of .2 2pz +.*2p-1 .2p-1 +.*2p+1 .2p+1 = spherically symmetric distribution. The curved lines in (c) are a visualization aid and are not mathematically significant. Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 101 that, when added to the charge cloud for 2p0 a spherical charge cloud results (since the angular dependence is removed). We already know that the 2pz distribution is dumbbell shaped, so it follows that 2p+1 and 2p-1 produce doughnut-shaped distributions. (A sphere minus a dumbbell equals a doughnut. See Fig. 4-6.) The shape can also be inferred from the sin2 .-dependence, which is a maximum in the xy plane. When solving the particle-in-a-ring problem, we saw that we could arrive at either a set of real trigonometric solutions or a set of complex exponential solutions. Since the f dependence of the 2p+1 orbitals is identical to that for m=±1 solutions of the particle in the ring, the same situation holds here. Because .2p+1 and .2p-1 are energetically degenerate eigenfunctions, any linear combination of them is also an eigenfunction of the hamiltonian (Problem 2-11). Therefore, let us find linear combinations that are entirely real. The complex part of .2p±1 , exp(±if), satisfies the relation exp(±if)=cosf ±i sin f (4-25) so that exp(+if)+exp(-if)=2 cosf (4-26) and i-1[exp(+if)-exp(-if)]=2 sin f (4-27) Thus, we have two linear combinations of exp(±if) that are real. It follows immediately that .2px = 1 v2 .2p-1 -.2p+1= 1 4v2p  Z a0 3/2 Zr a0 exp-Zr 2a0 sin . cosf (4-28) .2py = iv 2 .2p-1 +.2p+1= 1 4v2p  Z a0 3/2 Zr a0 exp-Zr 2a0 sin . sin f (4-29) where the factor 2-1/2 is used to maintain normality. Since r sin . cos f and r sin . sin f are equivalent to the Cartesian coordinates x and y, respectively, Eqs. (4-28) and (4-29) are commonly referred to as the 2px and 2py orbitals. They are exactly like the 2pz orbital except that they are oriented along the x and y axes (merely replace the z in Eq. (4-24) with x or y). The 2s, 2px, 2py, and 2pz orbitals are all orthogonal to one another. This is easily shown from symmetry considerations. Each 2p orbital is antisymmetric for reflection in its nodal plane, whereas 2s is symmetric for all reflections. Hence, the product .2s.2p is always antisymmetric with respect to some reflection so its integral vanishes. The 2p functions are mutually orthogonal because, if one 2p orbital is antisymmetric for some reflection, the others are always symmetric for that reflection. Hence, the product is antisymmetric for that reflection. Another way to show that the 2p orbitals are mutually orthogonal is to note that they behave like x, y, and z vectors and that these 102 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor vectors are orthogonal (i.e., perpendicular; orthogonality in functions is equivalent to perpendicularity in vectors). Sometimes the orthogonality of functions is most clearly seen if we sketch out the product and note whether the positive and negative regions cancel by symmetry (Fig. 4-7). The n = 3 level has nine solutions associated with it. The 3s orbital, plotted in Fig. 4-5, has one more node than the 2s orbital and is more diffuse. The 3p orbitals have the same angular terms as did the 2p orbitals so they can be written as real functions having the same directional properties as x, y and z vectors. The 3p orbitals differ from the 2p orbitals in that they possess a radial node and are more diffuse (see Fig. 4-8). The remaining five levels, 3d levels, may also be written in either complex or real form. The real orbitals are given by the formulas 3dz2 = 3dx2-y2 = 3dxy = 3dxz = 3dyz = ....... ...... 2 v2592p  Z a0 3/2 2Zr 3a0 2 exp-Zr 3a0 ........ ....... 1/v3 3 cos2 . -1 sin2 . cos 2f sin2 . sin 2f sin 2. cosf sin 2. sinf (4-30) These angular factors, times r2, have directional properties identical to the Cartesian subscripts on the left, except that 3dz2 is a shorthand for 3d3z2-r2 . These orbitals are sketched in Fig. 4-8. It is obvious from these figures that 3dx2-y2 has the same symmetry and orientation as the sum of the two vectors x2 and -y2, and that the other 3d orbitals have a similar connection with the notation (except for 3dz2 ). The 3d functions are Figure 4-7  Drawings of orbitals and their products to demonstrate orthogonality. Section 4-1 The Schr ¨ odinger Equation and the Nature of Its Solutions 103 Figure 4-8  Some hydrogenlike orbitals at the n=3 level. about the same size as the 3p and 3s functions, but have no radial nodes at intermediate r values. A general pattern emerges when we examine the nodal properties of the orbitals at various energies. At the lowest energy we have no nodes and the level is nondegenerate. At the next level, we find that each function possesses a single node. There is oneway to put in a radial node and so we get one 2s orbital. Or we can put in a planar node. But we 104 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor have three choices for orthogonal orientations of this plane leading to three independent p orbitals. At the n=3 level we find orbitals containing two nodes. The possibilities are: two radial nodes (3s), a radial node and a planar node (3px, 3py, 3pz ), two planar nodes (3dxy, 3dxz, 3dyz, 3dx2-y2 , 3dz2-x2 , 3dz2-y2 ). (But, since z2-y2-(z2-x2)= x2 - y2, the last three 3d orbitals are not linearly independent. Hence the last two are combined to form 3dz2 : z2 - x2 + z2 - y2 = 3z2 - (x2 + y2 + z2) = 3z2 - r2. This function can be seen to correspond to a positive dumbbell encircled by a small negative doughnut, or “belly band.” The two nodes in this orbital arising from nodes in . are conical surfaces rather than planes.) It is apparent that the degeneracies between various 2p orbitals, or 3d orbitals, are spatial degeneracies, due only to the physical equivalence of various directions in space. The degeneracy between 2s and 2p, or 3s, 3p, and 3d is not due to spatial symmetry. The fact that an angular node is energetically equivalent to a radial node is a peculiarity of the particular potential (-Ze2/4pe0r) for this problem. This degeneracy is removed for noncoulombic central-field potentials. The eigenfunctions corresponding to states in the energy continuum, like the bound states, can be separated into radial and angular parts. The radial parts of the spherically symmetric eigenfunctions at two nonnegative energies are given in Fig. 4-9. Note that the rate of oscillation of these functions is greatest at the nucleus, where the local kinetic energy is largest, in accord with the ideas presented in Chapters 1 and 2. Unbound-state Figure 4-9  Radial part of unbound H-atom states (times r) versus r at two energies: (a) E = 13.6eV; (b) E =0. Section 4-2 Separation of Variables 105 wavefunctions are not used in most quantum chemical applications, so we will not discuss them further in this book. EXAMPLE 4-5 Anodal plane is one through which awavefunction is antisymmetric for reflection. Consider the 3dxy and 3dyz orbitals of Fig. 4-8. Through which planes do these two orbitals have different reflection symmetries? SOLUTION  3dxy is antisymmetric for reflection only through the x, z and y, z planes. 3dyz is antisymmetric for reflection only through the x, z and x,y planes. Hence, these orbitals differ in their symmetries for reflection through the y, z and x,y planes.  4-2 Separation of Variables We shall indicate in some detail the way in which the Schr¨odinger equation (4-6) is solved. Recall the strategy of separating variables which we used in Section 2-7: 1. Express . as a product of functions, each depending on only one variable. 2. Substitute this product into the Schr¨odinger equation and try to manipulate it so that the equation becomes a sum of terms, each depending on a single variable. These terms must sum to a constant. 3. Since terms for different variables are independent of each other, the terms for each variable must equal a constant. This enables one to set up an equation in each variable. If this can be done, the initial assumption (1) is justified. In this case we begin by assuming that .(r, .,f)=R(r) (.) (f) (4-31) Substituting into Eq. (4-6) gives -h2 8p2µr2  d dr r2dR dr +R 1 sin . d d. sin . d d. +R 1 sin2 . d2 df2  - Ze2 4pe0r R =ER (4-32) Since each derivative operator now acts on a function of a single coordinate, we use total, rather than partial, derivative notation. Let us first see if we can isolate the f dependence. Multiplying Eq. (4-32) by (-8pµr2 sin2 ./h2R ) and rearranging gives sin2 . R d dr r2dR dr + 8p2µr2 sin2 . h2 E + Ze2 4pe0r + sin . d d. sin . d d. + 1 d2 df2 =0 (4-33) The r and . dependence is still mixed in the first two terms, but we now have a rather simple term in the coordinate f. Now we can argue, as in Section 2-7, that, as f alone 106 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor changes, the first three terms in Eq. (4-33) do not change. That is, if only f changes, Eq. (4-33) may be written constant+constant+constant+(1/ )(d2 /df2)=0 (4-34) so that (1/ )(d2 /df2)=-m2 (a constant) (4-35) We call the constant -m2 for future mathematical convenience. We can rearrange Eq. (4-35) into the more familiar form for an eigenvalue equation: d2 /df2=-m2 (4-36) We arrived at Eq. (4-36) by assuming that only f changes while r and . are constant. However, it should be obvious that the behavior of the term in f is uninfluenced by changes in r and . since it has no dependence on these coordinates. Thus, by establishing that this term is constant under certain circumstances, we have actually shown that it must be constant under all circumstances, and we have produced an eigenvalue equation for . We can now proceed with further separation of variables. Since we know that the last term in Eq. (4-33) is a constant, we can write 1 R d dr r2dR dr + 8p2µr2 h2 E + Ze2 4pe0r + 1 sin . d d. sin . d d. - m2 sin2 . =0 (4-37) Note that we have separated the . and r dependences by dividing through by sin2 .. We now have two terms wholly dependent on r and two wholly dependent on ., their sum being zero. Hence, as before, the sum of the two r-dependent terms must equal a constant, ß, and the sum of the .-dependent terms must equal -ß. Thus d dr r2dR dr + 8p2µr2 h2 E + Ze2 4pe0r R = ßR (4-38) 1 sin . d d. sin . d d. - m2 sin2 . = -ß (4-39) where we have multiplied through by R in the first equation and by in the second. The assumption that . =R has led to separate equations for R, , and . This indicates that the assumption of separability was valid. However, there is some linkage between R and via ß, and between and via m. 4-3 Solution of the R, , and  Equations 4-3.A The  Equation The solution of Eq. (4-36) is similar to that of the particle in a ring problem of Section 2-6. The normalized solutions are =(1/v2p)exp(imf), m=0,±1,±2, . . . (4-40) As shown in Section 2-6, the constant m must be an integer if is to be a single-valued function. Section 4-3 Solution of the R, , and  Equations 107 4-3.B The  Equation There is great similarity between the mathematical techniques used in solving the R and equations and those used to solve the one-dimensional harmonic oscillator problem of Chapter 3. Hence, we will only summarize the steps involved in these solutions and make a few remarks about the results. More detailed treatments are presented in many texts.5 The equation can be solved as follows: 1. Change the variable to obtain a more convenient form for the differential equation. 2. Express the solution as a power series and obtain a recursion relation. 3. Observe that the series diverges for certain values of the variables, producing nonsquare-integrable wavefunctions. Correct this by requiring that the series terminate. This requires that the truncated series be either symmetric or antisymmetric in the variable and also that ß of Eq. (4-38) and (4-39) be equal to l(l +1) with l an integer. 4. Recognize these truncated series as being the associated Legendre functions. 5. Return to the original variable to obtain an expression for in terms of the starting coordinate. Reference to the end of Section 3-4 will illustrate the similarity between this and the harmonic oscillator case. The final result for m0 is l,m(.)=(-1)m (2l +1) 2 (l -|m|)! (l +|m|)!  1/2 P|m| l (cos .) (4-41) For m<0 the phase factor (-1)m should be omitted.6 The term in square brackets is a normalizing function, and P|m| l (cos .) represents some member of the series of associated Legendre functions. When m = 0, these become the ordinary Legendre polynomials. The first few ordinary Legendre polynomials are P0(x) = 1, P1(x)=x, P2(x)= 1 2 (3x2 -1) P3(x) = 1 2 (5x3 -3x) (4-42) The first few associated Legendre functions are P1 1 (x)=(1-x2)1/2, P1 2 (x)=3(1-x2)1/2x, P2 2 (x)=3(1-x2), P1 3 (x)= 3 2 (1-x2)1/2(5x2 -1), P2 3 (x)=15(1-x2)x, P3 3 (x)=15(1-x2)3/2 (4-43) It is also true that P|m| l (x)=0 if |m|>l (4-44) 5See, e.g., Pauling andWilson [3, Chapter 5]. 6This is the same phase factor that we saw earlier for .2p±1 . 108 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor Thus, (.), and hence .(r, .,f), vanishes unless |m|=l, giving us one of our quantum number rules [Eq. (4-19)]. The associated Legendre functions satisfy an orthogonality relation: +1 -1 P|m| l (x)P |m| l (x) dx = 2 (2l +1) (l +|m|)! (l -|m|)! dll (4-45) For a further discussion of these functions, the reader should consult a more advanced text on quantum mechanics. 4-3.C The R Equation The R equation can be solved as follows: 1. Assume thatE is negative (this restricts us to bound states), and note that ß =l(l +1) from the previous solving of the equation. 2. Change variables for mathematical convenience. 3. Find the asymptotic solution pertaining to the large r limit, where the R equation becomes simplified. 4. Express the wavefunction as a product of the asymptotic solution and an unknown function. Express this unknown function as a power series and (after dealing with some singularities) obtain a recursion relation. 5. Note that the power series overpowers the asymptotic part of the solution unless the series is truncated. This leads to the requirement that n be an integer and hence that E be quantized. It also requires that n>l. 6. Recognize the truncated series to be associated Laguerre polynomials times .l , where . is defined below. The resulting solution is, if µ=me, Rnl =-  2Z na0 3 (n-l -1)! 2n [(n+l)!]3  1/2 exp (-./2) .lL 2l+1 n+l (.) (4-46) where .=2Zr/na0 and a0=0h2/pmee2=5.2917706×10-11 m. The term in brackets is a normalizing function. The exponential term is the asymptotic solution and it guarantees that R(r) will approach zero as r approaches infinity. The third term, .l, is produced in the course of removing singularities (i.e., places where parts of a differential equation become infinite). The last term, L(.), symbolizes the various members Section 4-4 Atomic Units 109 of the set of associated Laguerre polynomials. Like the Legendre functions, these are mathematically well characterized. A few of the low-index associated Laguerre polynomials are L11 (.)=1, L12 (.)=2. -4, L13 (.)=-3.2 +18. -18, L33 (.)=-6 (4-47) 4-4 Atomic Units It is convenient to define a system of units that is more natural for working with atoms and molecules. The commonly accepted system of atomic units for some important quantities is summarized in Table 4-1. [Note: the symbol ¯h (“h-cross or h-bar”) is often used in place of h/2p.] Additional data on values of physical quantities, units, and conversion factors can be found in Appendix 10. In terms of these units, Schr¨odinger’s equation and its resulting eigenfunctions and eigenvalues for the hydrogenlike ion become much simpler to write down. Thus, the TABLE 4-1  Atomic Units Quantity Atomic unit in cgs or other units Values of some atomic properties in atomic units (a.u.) Mass me=9.109534×10-28 g Mass of electron = 1 a.u. Length a0=4pe0 ¯h2/mee2 Most probable distance of 1s =0.52917706×10-10m electron from nucleus ofH (=1 bohr) atom = 1 a.u Time t0=a0 ¯h/e2 Time for 1s electron in H =2.4189×10-17 s atom to travel one bohr =1 a.u. Charge e=4.803242×10-10 esu Charge of electron=-1 a.u. =1.6021892 ×10-19 coulomb Energy e2/4pe0a0=4.359814×10-18 J Total energy of 1s electron in (=27.21161eV=1 hartree) H atom =-1/2 a.u. Angular ¯h=h/2p Angular momentum for momentum =1.0545887×10-34 J s particle in ring = 0, 1, 2, . . . a.u. Electric field strength e/a2 0=5.1423×109 V/cm Electric field strength at distance of 1 bohr from proton = 1 a.u. 110 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor TABLE 4-2  Eigenfunctions for the Hydrogenlike Ion in Atomic Units Spectroscopic symbol Formula 1s (1/vp)Z3/2 exp(-Zr) 2s (1/4v2p)Z3/2(2-Zr) exp(-Zr/2) 2px (1/4v2p)Z5/2r exp(-Zr/2) sin . cosf 2py (1/4v2p)Z5/2r exp(-Zr/2) sin . sin f 2pz (1/4v2p)Z5/2r exp(-Zr/2) cos . 3s (1/81v3p)Z3/2(27-18Zr +2Z2r2) exp(-Zr/3) 3px (v2/81vp)Z5/2r(6-Zr) exp(-Zr/3) sin . cosf 3py (v2/81vp)Z5/2r(6-Zr) exp(-Zr/3) sin . sin f 3pz (v2/81vp)Z5/2r(6-Zr) exp(-Zr/3) cos . 3dz2 (=3d3z2-r2) (1/81v6p)Z7/2r2 exp(-Zr/3)(3 cos2 . -1) 3dx2-y2 (1/81v2p)Z7/2r2 exp(-Zr/3) sin2 . cos 2f 3dxy (1/81v2p)Z7/2r2 exp(-Zr/3) sin2 . sin 2f 3dxz (1/81v2p)Z7/2r2 exp(-Zr/3) sin 2. cosf 3dyz (1/81v2p)Z7/2r2 exp(-Zr/3) sin 2. sin f Schr¨odinger equation in atomic units is (assuming infinite nuclear mass, so that µ=me) - 1 2.2 - Z r . =E. (4-48) The energies are En=- Z2 2n2 (4-49) The lowest-energy solution is .1s =Z3/p exp(-Zr) (4-50) The formulas for the hydrogenlike ion solutions (in atomic units) of most interest in quantum chemistry are listed in Table 4-2. The tabulated functions are all in real, rather than complex, form. Problems involving atomic orbitals are generally far easier to solve in atomic units. 4-5 Angular Momentum and Spherical Harmonics We have now discussed three problems in which a particle is free to move over the entire range of one or more coordinates with no change in potential. The first case was the free particle in one dimension. Here we found the eigenfunctions to be simple trigonometric or exponential functions of x. The trigonometric form is identical to the harmonic amplitude function of a standing wave in an infinitely long string. We might refer to such functions as “linear harmonics.” The second case was the particle-in-aring problem, which again has solutions that may be expressed either as sine-cosine or Section 4-5 Angular Momentum and Spherical Harmonics 111 exponential functions of the angle f. By analogy with linear motion, we could refer to these as “circular harmonics.” Finally, we have described the hydrogenlike ion, where the particle can move over the full ranges of . and f (i.e., over the surface of a sphere) with no change in potential. The solutions we have just described—the products l,m(.) m(f)—are called spherical harmonics and are commonly symbolized Yl,m(.,f). Thus for m0 Yl,m(.,f)=(-1)m (2l +1) 4p (l -|m|)! (l +|m|)!  1/2 P|m| l (cos .) exp (imf) (4-51) and for m<0 the factor (-1)m is omitted. Because so many physical systems have spherical symmetry, spherical harmonics are very important in classical and quantum mechanics. Closely linked with spherical harmonics is angular momentum. Angular momentum is an important physical property because it is conserved in an isolated dynamical system; it is a constant of motion for the system. Angular momentum is described by magnitude and direction, so it is a vector quantity.7 The classical system, in the absence of external forces, is constrained to move in such a way as to preserve both the direction and the magnitude of this vector. For a mass of m kg moving in a circular orbit of radius r m with an angular velocity of . radians per second, the angular momentum has magnitude mr2. kg m2/s (or, alternatively, joule seconds). The direction of the vector is given by the right-hand rule: the index finger of the right hand points along the particle trajectory and the extended thumb points along the angular momentum vector (see Fig. 4-10). (Alternatively, in a right-handed coordinate system, motion of a mass in the xy plane from +x toward +y produces angular momentum in the +z direction.) In vector notation, L=r×p, where L is angular momentum, r is the position vector, and p is the linear momentum. Some of the more interesting properties of angular momentum relate to the situation where circular motion occurs in the presence of an external field. A familiar example is a gyroscope mounted on a pivot and experiencing the gravitational field of the earth. The gyroscope flywheel is usually started with the gyroscope in an almost vertical position. After release, the gyroscope precesses about the axis of field direction. As time passes, the tilt of the gyroscope away from the field direction (which we take to be the z direction) increases (see Fig. 4-11). If there were no friction in the bearings, the angle of tilt would not change, and the gyroscope would precess about z indefinitely, maintaining whatever angle of tilt it found itself with initially. Notice that, in such Figure 4-10  The angular momentum vector L for a particle of mass m moving with angular velocity . about a circular orbit of radius r in the direction indicated. 7Strictly speaking, angular momentum is a pseudovector—it is dual to a second order antisymmetric tensor. However, for the remainder of this book, we can and shall ignore this distinction. 112 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor a case, the angular momentum due to the flywheel, Lf, is conserved in magnitude only. Its direction is constantly changing. Thus, Lf is not a constant of motion in the presence of a z-directed field. Neither are the components Lfx and Lfy , which change in magnitude as the gyroscope precesses. However, Lfz is a constant of motion if the angle of tilt does not change. If we add on to Lf the angular momentum Lg due to the precession of the gyroscope as a whole (including the center of mass of the flywheel but ignoring its rotation), we find that the total angular momentum for the gyroscope (including flywheel motion), L and its components Lx, Ly, and Lz behave similarly to Lf and its components (see Fig. 4-12). We may summarize these observations from classical physics as follows: a rotating rigid body conserves L (hence, Lx, Ly, Lz ) in the absence of external forces. In the presence of a z-directed, time-independent external force, Lz and |L|, the magnitude of L (but not its direction) are conserved. Furthermore, in a system comprising several moving parts, the total angular momentum is the sum of the individual angular momenta, and the z component is the sum of the individual z components: L = i Li (4-52) Lz = i Lzi (4-53) Many characteristics of the classical situation are maintained in quantum mechanics. In particular, it can be shown that a hydrogenlike ion eigenfunction can always be associated with “sharp” values for Lz , but not for Lx or Ly, and that the magnitude of L is sharp, but not its direction. We have indicated several times in this book that a sharp value (i.e., a constant of motion) exists when a state function is an eigenfunction for an operator associated with the property. For example, all of our hydrogenlike ion wavefunctions are eigenfunctions for the hamiltonian operator, so all are associated with sharp energies. We introduced the operator for the z-component of angular momentum, in Section 2-6, as (h/2pi)d/df. Generalizing to situations with several variables requires switching to partial derivative notation. Then our operator, Figure 4-11  A gyroscope with the angular momentum of the flywheel, Lf, together with x, y, and z components of Lf, at a given instant. Section 4-5 Angular Momentum and Spherical Harmonics 113 Figure 4-12  The total angular momentum of the gyroscope L is shown as the sum of Lf, the angular momentum of the flywheel, and Lg, the angular momentum of the gyroscope. L precesses, so only Lz and the magnitude of L are constants of motion. symbolized Lˆz , is (h¯/i)./.f. In atomic units, Lˆz is (1/i)./.f. This operator was introduced in Section 2-6, where it was given the symbol pf. (A general discussion on operators will be given in Chapter 6. The carat symbol is frequently used to denote an operator.) Our statement that hydrogenlike eigenfunctions have sharp Lz means that we expect Lˆz.n,l,m(r, .,f)= constant · .n,l,m(r, .,f). Since all these eigenfunctions have exp(imf) as their only f-dependent term, it follows immediately that Lˆz.n,l,m =mh¯.n,l,m (4-54) or, equivalently, LˆzYl,m (.,f)=mh¯Yl,m (.,f) (4-55) Hence, the quantum number m is equal to the z component of angular momentum in units of ¯h for the state in question. This means that the angular momentum associated with an s state (l = 0, so m = 0) has a zero z component, while a p state (l = 1, so m=-1, 0,+1) can have a z component of -¯h, 0, or ¯h. The other quantity that we have stated is conserved in these systems is the magnitude of L. In quantum mechanics, it is convenient to deal with the square of this magnitude L2. The quantum-mechanical operator associated with this quantity is Lˆ2 = -¯h2 (.2/..2)+cot .(./..)+(1/ sin2 .)(.2/.f2) = -¯h2 (1/ sin .)(./..) sin .(./..)+(1/ sin2 .)(.2/.f2) (4-56) The result of operating on Yl,m(.,f) with this operator is Lˆ2Yl,m (.,f)=l(l +1) ¯h2Yl,m (.,f) (4-57) This means that the square of the magnitude of the total angular momentum equals l(l+1) ¯h2. Hence, for an s state it is zero, for a p state it is 2¯h2, for a d state it is 6¯h2, etc. One can construct vector diagrams to parallel these relationships. A few of these are sketched in Fig. 4-13. Operators for Lˆx and Lˆy can also be constructed. They are Lˆx = ih¯[sin f(./..)+cot . cosf(./.f)] (4-58) Lˆy = -ih¯[cosf(./..)-cot . sin f(./.f)] (4-59) 114 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor Figure 4-13  Vector relationships that satisfy the following rules: L2 = l(l + 1),Lz = m, m=-l,-l +1, . . . , 0, . . . , l -1, l. The quantum rules correspond to a classical analog where the gyroscope can have only certain discrete angles of tilt. (a) s; l = 0; l(l + 1) = 0; m = 0. (b) p; l =1; l(l +1)=2; m=-1, 0,+1. (c) d; l =2;l(l +1)=6; m=-2,-1, 0,+1,+2. (all in atomic units) The hydrogenlike eigenfunctions are not necessarily eigenfunctions for either of these operators (Problem 4-23). It is interesting to consider the physical meaning of these results. If a quantity has a sharp value, it means that we will always get that value no matter when we measure that property for systems in the state being considered. Thus, the z component of angular momentum for hydrogen atoms in the 2p+1 state will always be measured to be +1¯h, or one atomic unit. For the x or y component, however, repeated measurements (on an ensemble of 2p+1 atoms) will yield a spread of values. We can measure (or compute) an average value of Lx or Ly but not a sharp value. In terms of our mental model (a gyroscope) this seems sensible enough except for one thing. Our hydrogenlike eigenfunctions are solutions for a central field potential with no external field. Under such conditions, L, Lx, Ly, and Lz are classically all constants of motion. Why, then, are they not all sharp quantum mechanically? The answer is that quantum-mechanical state functions never contain more information than is, in principle, extractable by measurement. To measure a component of angular momentum in a system always means, in practice, subjecting the system to some sort of external force. Furthermore, this system must obey the limitations decreed by the uncertainty principle. The hydrogenlike ion wavefunctions cannot simultaneously be eigenfunctions for Lˆx, Lˆy, and Lˆz because that would give simultaneous sharp values (i.e., no uncertainty) for the conjugate variables angular momentum vector length and angular momentum vector orientation. This would violate the uncertainty principle, which is in turn a reflection of limitations on our ability to measure one variable without affecting another (see Section 1-8). It is possible, working only with the quantum-mechanical operators, to generate the eigenvalues of Lˆz and Lˆ2. This approach is a deviation from our main line of Section 4-6 Angular Momentum and Magnetic Moment 115 development and is contained in Appendix 4. (It is recommended that Chapter 6 be completed before reading Appendix 4.) We give only the results. They are Lˆzfl,m = mh¯fl,m, m=-l,-l +1, . . . , l -1, l (4-60) Lˆ2fl,m = l (l +1) ¯h2fl,m (4-61) These look like the results already given in Eqs. (4-55) and (4-57). There is a difference, however. Here there is no indication that m is an integer, whereas in Eq. (4-55) m must be an integer, as indicated by the presence of zero in its value list. There are two ways in which we can have a sequence of the form -l,-l +1, . . . , l -1, l. One way is to have an integer series, for example, -2,-1, 0,+1,+2, which must contain zero. The other way is to have a half-integer series, for example, -3 2 ,-1 2 ,+1 2 ,+3 2 , which skips zero. If we work only with the properties of the operators, we find that either possibility is allowed. But if we assume that the as-yet-unspecified eigenfunctions fl,m are separable into .- and f-dependent parts, we find ourselves restricted to the integer series. For orbital angular momentum (due to motion of the electron in the atomic orbital), the z component must be (in atomic units) an integer, for we have seen that the state functions . contain the spherical harmonics Yl,m, which are indeed separable. Spin angular momentum for an electron (to be discussed in more detail in the next chapter), has half-integer z components of angular momentum, and the eigenfunctions corresponding to spin cannot be expressed with spherical harmonics. 4-6 Angular Momentum and Magnetic Moment If a charged particle is accelerated, a magnetic field is produced. Since circular motion of constant velocity requires acceleration (classically) it follows that a charged particle having angular momentum will also have a magnetic moment. The magnetic moment is proportional to the angular momentum, colinear with it, and oriented in the same direction if the charge is positive. For an electron, the magnetic moment is given by µ=-ßeL (4-62) where ße, the Bohr magneton, has a value of 9.274078×10-24 J T-1 (equal to 1 2 a.u.), where T is magnetic field strength in Tesla. (ße is defined to contain the ¯h that belongs to L, so it is only the vl(l +1) part of L that is used in the calculation.) EXAMPLE 4-6 What is the magnitude of the orbital magnetic moment for an electron in a 3d state of a hydrogen atom? In a 4d state of He+? SOLUTION  For any d state, l = 2, so, in a.u., |µ| = ßeL = ßevl(l +1) = v6ße = 2.27×10-23JT-1. (Since we want magnitude, we can ignore the minus sign.) The value does not depend on the quantum number n nor on atomic number Z, so it is the same for He+.  Applying a magnetic field of strength B defines a z-direction about which the magnetic moment vector precesses. The z-component, µz , of the precessing vector interacts with the applied field B. The interaction energy is E=-µzB =ßeLzB =ßemB (4-63) 116 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor This means that some of the degeneracies among the energy levels of the hydrogenlike ion will be removed by imposing an external magnetic field. For instance, the 2p+1 and 2p-1 energy levels will be raised and lowered in energy, respectively, while 2s and 2p0 will be unaffected (see Fig. 4-14). This, in turn, will affect the atomic spectrum for absorption or emission. The splitting of spectral lines due to the imposition of an external magnetic field is known as Zeeman splitting. Because the splitting of levels depicted in Fig. 4-14 is proportional to the z component of orbital angular momentum, given by m¯h, it is conventional to refer to m as the magnetic quantum number. In the absence of external fields, eigenfunctions having the same n but different l and m are degenerate. We have seen that this allows us to take linear combinations of eigenfunctions, thereby arriving at completely real eigenfunctions like 2px and 2py, instead of 2p+1 and 2p-1. When a magnetic field is imposed, the degeneracy no longer exists, and we are unable to perform such mixing. Under these conditions, 2px, 2py, 3dxy, etc. are not eigenfunctions, and we are restricted to the pure m=0,±1,±2 . . . type solutions. Thus far we have indicated that the stationary state functions for the hydrogenlike ions are eigenfunctions for Lˆ2 and Lˆz , and we have compared this to the fact that |L| and Lz are constants of motion for a frictionless gyroscope precessing about an external field axis. But howabout atoms with several electrons? And howabout molecules? Are their stationary state functions also eigenfunctions for Lˆ2 and Lˆz? A general approach to this kind of question is discussed in Chapter 6. For now we simply note that the spherical harmonics are eigenfunctions of Lˆ2 and Lˆz [Eqs. (4-55) and (4-57)] and that any state function of the form .(r, .,f)=R(r)Yl,m(.,f) will necessarily be an eigenfunction of these operators. But the spherical harmonics are solutions associated with spherically symmetric potentials. Therefore, it turns out that eigenfunctions of the time-independent hamiltonian operator are also eigenfunctions for Lˆ2 and Lˆz only if the potential is spherically symmetric. In the more restricted case in which . has Figure 4-14  Energy levels of a hydrogenlike ion in the absence and presence of a z-directed magnetic field. Section 4-7 Angular Momentum in Molecular Rotation—The Rigid Rotor 117 the form .(r, .,f)=f (r, .) exp(imf), . will still be an eigenfunction of Lˆz but not of Lˆ2. This situation applies in systems having cylindrically symmetric potentials, dependent on r and . but not f (e.g., H+2 , N2). We discuss such cases in more detail in Chapter 7. 4-7 Angular Momentum in Molecular Rotation—The Rigid Rotor We have seen that the two-particle system of an electron and a nucleus rotating about a center of mass (COM) can be transformed to the one-particle system of a reduced mass rotating about a fixed point. However, this transformation can be made for any two-mass system, and so it applies also to the case of the nuclei of a rotating diatomic molecule. As we now show, the mathematical outcome for the rotating diatomic molecule is strikingly similar to that for the hydrogenlike ion. The simplest treatment of molecular rotation ignores vibrational motion by assuming that the distance between the nuclei is fixed. The resulting model is therefore called the rigid-rotor model. Let there be two nuclear masses, m1 and m2, separated from the COM by distances r1 and r2, respectively. Then, because of the way that the COM is defined, we have that m1r1 =m2r2 (4-64) The moment of inertia, I, is I =m1r2 1 +m2r2 2 (4-65) It is not difficult to show (Problem 4-35) that the same moment of inertia results from a reduced mass µ rotating about a fixed point at a distance r =r1 +r2. That is, if µ= m1m2 m1 +m2 (4-66) then I =µr2 (4-67) Therefore, solving the problem of a reduced mass µ rotating about a fixed point at the fixed distance r =r1 +r2 is equivalent to solving the two-mass rigid-rotor problem. In effect, the rotating-diatomic problem is transformed to a particle-on-the-surface-of-asphere problem. As usual, we write the Schr¨odinger equation by starting with the general prescription [(-¯h2/2µ).2 +V ]. =E.. Then we recognize that V is constant over the spherical surface (corresponding to the diatomic molecule having no preferred orientation), so we can set V =0. Since r is a constant, the first term in .2 [Eq. (4-7)] vanishes due to the ./.r operators. The resulting Schr¨odinger equation is [using Eq. (4-67)] [(1/ sin .)(./..) sin .(./..)+(1/ sin2 .)(.2/.f2)].(.,f)=(-2IE/¯h2).(.,f) (4-68) 118 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor Equation (4-68) is the same as the , equations seen earlier [(4-36) and (4-39)] with ß = 2IE/¯h, and so we already know the eigenvalues and eigenfunctions. We noted earlier that ß =l(l +1). For molecular rotation it is conventional to symbolize the quantum number as J , rather than l. This leads to J(J +1)=2IE/¯h2 (4-69) or E =J(J +1) ¯h2/2I J=0, 1, 2, . . . (4-70) Since V =0, E is entirely kinetic energy. Because r is not a variable, there is no analog to the principal quantum number n, and J is not limited in its highest value. (Note also that atomic units, which are designed to simplify electronic problems, are not normally used for molecular rotation or vibration.) The eigenfunctions are, as before, the spherical harmonics YJ,mJ (.,f), with mJ = 0,±1,±2, . . . ,±J . Thus we have an s-type solution (J =0,mJ =0) that has constant value over the spherical surface, three p-type solutions (J =1,mJ =+1, 0,-1), five d-type solutions, etc. For each value of J , there are 2J +1 eigenfunctions. The s-type solution has zero energy. One can imagine that the reduced mass is motionless on the surface of the sphere and has equal probability for being found anywhere. This transforms back to a picture where the diatomic molecule is not rotating and where there is no preferred orientation. Since E = 0 when J = 0, we conclude that there is no zero-point energy for free rotation. (However, if rotation is restricted so that some orientations become preferred, the zero-point energy becomes finite.) The three p-type solutions are degenerate, hence can be mixed to give real functions analogous to px, py, pz . We could represent the pz function by taking a globe and marking a circular region of positive phase around the northern polar region, with a matching region of negative phase around the southern pole. Clearly, these rigid-rotor wavefunctions have the same symmetries as their hydrogenlike counterparts. A pz rotational state corresponds to a molecule (or ensemble of molecules) rotating with kinetic energy 2¯h2/2I (since J =1) with little likelihood of finding the reduced mass near the equator (i.e., with little probability of finding the diatomic molecule oriented nearly perpendicular to the z axis.) Angular momentum for the rigid rotor also follows the hydrogenlike system rules. The square of the total angular momentum equals J(J +1) ¯h2, and the z component equals mJ ¯h. Transitions between rotational energy levels can be detected spectroscopically. Once such energy differences are identified with specific changes in J , it becomes a simple matter to solve for r, the internuclear distance in the rigid rotor. For example, suppose an absorption peak for H81Br seen at 101.58 cm-1 is assigned to the J = 6 . 5 transition. Since the energies of these levels are 42¯h2/2I and 30¯h2/2I , respectively, their difference is 12¯h2/2I . This energy is equal to that of the photons of light at 101.58 cm-1. Solving this relation gives I = 3.3069×10-47 kg m2. We know that this equals µr2, and we know how to get µ from mH and mBr [Eq. (4-46)]. So we can solve for r, finding r = 141.44 pm (Problem 4-37). Section 4-8 Summary 119 4-8 Summary 1. The motion of two masses moving about a center of mass can be transformed to motion of a single reduced mass moving about a fixed point. The radius of rotation for the reduced mass is identical to the distance of separation of the original two masses. For hydrogenlike ions, the nuclear mass is so much greater than the electron mass that the reduced mass is almost identical to the electron mass. 2. The bound-state energies for time-independent states of the hydrogenlike ion depend on only the atomic number Z and the quantum number n (a positive integer) and vary as -Z2/2n2. This means that the energies get closer together as n increases and that there is an infinite number of such negative energy levels. Each such energy level has degeneracy n2. A continuum of energies exists for unbound (E>0) states. 3. Each stationary state wavefunction is characterized by three quantum numbers, n, l, and m, all integers, with l ranging from 0 to n - 1 and m ranging from -l to +l. If l =0, we have an s state and . is spherically symmetric with a cusp at the nucleus. If l =1, 2, . . . we have a p, d, . . . state, and . vanishes at the nucleus and is not spherically symmetric. In all states there is a finite probability for finding the electron beyond the classical turning point. 4. Eigenfunctions Rn,l(r) l,m(.) m(f) with m =0 are complex but can be mixed to form real eigenfunctions. However, if an external field causes states of different m to be nondegenerate, such mixing will produce noneigenfunctions. 5. All the stationary state eigenfunctions are orthogonal to each other, and radial and/or angular (usually planar) nodes are instrumental in this. The effect of a radial node on energy is the same as that of an angular node, so that all eigenfunctions with, say, three nodes (all radial, all angular, or a combination) are degenerate. This is peculiar to the -r-1 potential. 6. Separation of variables is not “perfectly clean” since the differential equations for R and (Eqs. (4-38) and (4-39)), are linked through ß and those for and (Eq. 4-40) are linked through m. This leads to interdependencies in the values of n, l, and m. 7. Spherical harmonics are the angular parts of solutions to Schr¨odinger equations for systems having spherically symmetric potentials. These functions are eigenfunctions of Lˆz and Lˆ2 as well as Hˆ , so such states have sharp values of Lz , L2, and E. The value of Lz is m¯h, and for L2 it is l(l +1) ¯h2, where l and m must be integers. In atomic units the quantity ¯h does not appear in these formulas. 8. The z component of the magnetic moment due to orbital motion of a charged particle is proportional to m¯h, and so m is called the magnetic quantum number. This magnetic moment contributes to the Zeeman splitting seen in spectra of hydrogenlike ions in magnetic fields. 9. Eigenfunctions other than spherical harmonics exist for Lˆ2 and Lˆz , but these are not separable into .- and f-dependent functions. In these cases, l and m can be halfintegers. These cases do not arise in orbital motion, but do arise in spin problems. 120 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor 10. If V is cylindrically symmetric [i.e., V =V (r,.)], the eigenfunctions of the hamiltonian are still eigenfunctions for Lˆz but not for Lˆ2. Hence, mh¯ still equals the z component of angular momentum for such a system. Here, z is the direction of the axis of rotational symmetry, i.e., the internuclear axis of the molecule. 11. The rules for allowed angular momentum magnitudes and orientations are the same for the rigid rotor as for the hydrogenlike ion. This leads to the following relationships in terms of the rotational quantum numbers J = 0, 1, 2, . . . , mJ = 0,±1, . . . ,±J : Length of angular momentum vector =vJ(J +1) ¯h. Component of angular momentum perpendicular to internuclear axis = mJ ¯h. Kinetic energy of rotation: TJ =J(J +1) ¯h2/2I. Degeneracy of level: gJ =2J +1. 4-8.A Problems 4-1. An observed spectroscopic transition in the hydrogen atom involves the 2p.1s transition. Using Eq. (4-8), evaluate this energy difference in units of hertz (Hz). (1 Hz=1 s-1.) Do the calculation using both me and µ. (See Appendix 10 for constants and conversion factors.) How much error in this calculation, in parts per million, is introduced by ignoring the finite mass of the nucleus (i.e., using me instead of µ)? 4-2. Sketch qualitatively, on the same r axis, .2(r) for a 1s state and 4pr2 for the variation of dv with r. Sketch the radial distribution function, which is the product of these two functions, and explain why it vanishes at r =0,8. 4-3. For a hydrogen atom in the 1s state, . =(1/vp)exp(-r), in atomic units. a) Calculate the value of r (in a.u.) at the classical turning point. b) Calculate the percentage of the electronic charge that is predicted to be beyond the classical turning point. (See Appendix 1 for useful integrals.) 4-4. Using atomic units, compute for a 1s electron of the hydrogenlike ion [. = (Z3/p) exp(-Zr)]. (See Appendix 1 for useful integrals.) a) the most probable distance of the electron from the nucleus, b) the average distance of the electron from the nucleus, c) the distance from the nucleus of maximum probability density, d) the average value of the potential energy (V =-Z/r). Note how these quantities depend on atomic number Z. Also, why do you think that, when Z=1, (d) is not minus the reciprocal of (b)? Why is (d) lower than minus the reciprocal of (b)? 4-5. Demonstrate by integration that the 1s and 2s orbitals of the hydrogen atom are orthogonal. (See Appendix 1 for useful integrals.) Section 4-8 Summary 121 4-6. Normalize the function r exp(-r) cos .. 4-7. Obtain the average value of position ¯x, for a particle moving in a one-dimensional harmonic oscillator potential in a state with the normalized wavefunction . =(ß/482p)1/4 (2ßx)3 -12ßx exp(-ßx2/2) [There is an easy way to do this problem.] 4-8. For a particle in a one-dimensional box with boundaries at x =0 and L and for any quantum number n: a) Show how you would set up the calculation for the mean square deviation of the particle from its average position. b) Explain qualitatively how you would expect the value of (a) to vary with quantum number n. c) Evaluate the expression from part (a) in terms of n and L. Calculate the value for n=1, 2. Discuss the relative values of these numbers for reasonableness. 4-9. Sketch the 2pz and the 3dxy wavefunctions. Demonstrate, without explicitly integrating, that these are orthogonal. 4-10. Repeat Problem 4-4, but for the 2p0 wavefunction. 4-11. Find an expression for the classical turning radius for a hydrogenlike ion in terms of n, l,m, and Z. 4-12. Show that the sum of the charge distributions of all five 3d orbitals is spherically symmetric. 4-13. Try to answer the following questions without looking up formulas or using pencil and paper. Use atomic units. a) What is the energy of the hydrogen atom in the 1s state? b) What is the energy of He+ when n=1? n=2? c) What is the degeneracy of the n=5 energy level of hydrogen? d) How many planar nodes does the 4dxz orbital have? How many radial nodes? e) What is the potential energy in a hydrogen atom when the electron is 0.50 a.u. from the nucleus? 4-14. You should be able to answer the following questions (use a.u.) in your head or with trivial calculations. An unnormalized eigenfunction for the hydrogen atom is . =(27-18r +2r2) exp(-r/3) a) What are the l and m quantum numbers for this state? b) How many radial nodes does this function possess? c) What is the energy of this state? d) What is the classical turning radius for this state? 122 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor 4-15. . =N(6r -r2) exp(-r/3) sin . sin f is an eigenfunction, in a.u., for the hydrogen atom hamiltonian. N is the normalizing constant. Without looking at formulas in the text, answer the following questions by inspection: a) Is there a node in the r coordinate? If so, where? b) Which state is this? (Give orbital symbol—e.g., 1s, 2pz , etc.) 4-16. Without comparing to tabulated formulas, state whether each of the following could reasonably be expected to be an eigenfunction (unnormalized) of the hamiltonian for a hydrogenlike ion. Explain why. a) (27-18Zr +2Z2r2) exp(-2Zr/3) b) r exp(Zr/2) sin . cosf c) r sin . exp(-if). 4-17. Calculate the average value of x for the 1s state of the hydrogen atom. Explain why your result is physically reasonable. 4-18. a) Calculate the most probable value of . in the 2pz state of the hydrogen atom. b) Calculate the values of . corresponding to nodal cones in the 3dz2 orbital. 4-19. Demonstrate that Eq. (4-30) for 3dxy has the same angular dependence as the function xy. 4-20. Verify Eq. (4-45) using P1(x) with P1(x) and with P3(x) [from Eq. (4-42)]. What does Eq. (4-45) imply for the integral over all space of .3p0.3d-1? 4-21. Using Eqs. (4-51) and (4-43), write the normalized spherical harmonic function Y3,-2(.,f). For which type of hydrogenlike AO does this function give the angular dependence? 4-22. Verify Eq. (4-55). 4-23. Test the 2p0 eigenfunction to see if it is an eigenfunction for Lˆx or Lˆy, [Eqs. (4-58) and (4-59)]. Show that the 1s function is an eigenfunction of Lˆx, Lˆy, Lˆz , and Lˆ2. Explain, in terms of the vector model, this seeming violation of the discussion in the text. 4-24. Work out the value of Lˆ2.2p0 by brute force and show that the result agrees with Eq. (4-57). 4-25. Sketch the vector diagram (as in Fig. 4-13) for the 4f orbitals of hydrogen. How does this compare to the diagram for the 6f orbitals of He+? 4-26. Sometimes eigenfunctions for an operator can be mixed together to produce new functions that are still eigenfunctions. Listed below are some operators with pairs of their eigenfunctions. Indicate in each case whether mixtures of these pairs will or will not continue to be eigenfunctions for the operator shown. (You should be able to do this by inspection, using your knowledge of these systems.) Section 4-8 Summary 123 Operator Eigenfunctions a) Hˆ (1 dim. box) .1 .3 b) Hˆ (ring) sin 3f cos 3f c) Lˆz (ang. mom.) exp(3if) exp(-3if) d) Hˆ (cubical box) .1,2,3 .2,2,2 e) Hˆ (H atom) .2s .2p0 f) Hˆ (H atom) .3s .4s 4-27. A hydrogenlike ion is in a state having a z-component of angular momentum equal to -2 a.u. a) What is the smallest possible value of the length of the angular momentum vector for this state? b) What symbol describes the state corresponding to your answer to part (a)? 4-28. Calculate the average angular momentum, L¯z , for a particle in a ring of constant potential having wavefunction a) (1/vp)sin 3f b) (1/v2p)exp(-3if) 4-29. Evaluate each of the following integrals. Look for labor-saving approaches. Integrals are over all space unless otherwise indicated. a)  .2px Lˆz.2pxdv b)  .2px Lˆ2.2pxdv c)  .2px Lˆx.2pxdv d)  .3dx2-y2.2pxdv e)  2p 0 exp (2if) exp (-3if) df 4-30. Evaluate each of the following in a.u. (You should be able to answer these by inspection.) Note: .n,l,m stands for a hydrogen atom eigenfunction of Hˆ . a) Lˆ2.3,2,1 b) Lˆ2.2px c) Hˆ .3px d) (1/i) (./.f).2p-1 4-31. For each of the following operators, indicate by “yes” or “no” whether .3px (with Z =1) is an eigenfunction. If it is, then also give the eigenvalue in a.u. a) -1 2.2 -1/r b) -1 2.2 -3/r c) Lˆz d) -1 2.2 e) Lˆx f) Lˆ2 124 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor g) r h) 1/r 4-32. For the 3s state of the hydrogen atom, estimate the amount of electronic charge between 211 pm and 213 pm. 4-33. Calculate in hertz the splitting between 2p0 and 2p+1 of a hydrogen atom by a magnetic field of 2 tesla. Compare this with the 2p.-1s transition energy (in parts per million). 4-34. Try to answer (by inspection) the following questions about the n=4 states of the hydrogen atom. a) What is the energy of the level, in a.u.? b) What is the degeneracy of the level? c) What values for the length of the orbital angular momentum vector (in a.u.) are possible? d) Into how many sublevels (of energy) is the n=4 level split by imposition of a magnetic field? e) What is the degeneracy of the unshifted portion of the sublevels referred to in (d)? 4-35. Show that the moment of inertia for two masses, m1 and m2, moving on a rigid massless bar about the center of mass at distances r1 and r2, respectively, is identical to that of a reduced mass µ=m1m2/(m1 +m2) moving about a point at a distance of r =r1 +r2. 4-36. Show that Eq. (4-68) can be written, using Lˆ2, in a form that is analogous to the classical relation for a freely rotating mass, L2/2I =T . 4-37. Using thatmH=1.0078 a.m.u. andmBr = 80.9163 a.m.u., calculate µ and verify the value of r given at the end of Section 4-7. 4-38. Assuming an internuclear distance of 127.5 pm in D35Cl, compute the expected positions in cm-1 for the absorption peaks corresponding to J =1.0, 2.1, 3.2 (D = 2.0141 a.m.u., 35Cl = 34.9688 a.m.u.). 4-39. The J =1.0 transition in 12C16O occurs at 3.86 cm-1. Calculate the internuclear distance. (12C = 12 a.m.u. by definition, 16O = 15.9949 a.m.u.) 4-40. HCl has a permanent electric dipole moment, which means that the reduced mass has a partial electric charge in the transformed version of the system. This in turn means that there is a magnetic moment vector parallel to the total angular momentum vector. Describe qualitatively what happens to the energies of the J =3 rotational states when HCl is subjected to a uniform magnetic field. 4-41. Uncertainty in position in one dimension, x, is defined as x2 - ¯x2 1/2 That is, it is the square root of the difference between the average squared position and the square of the average position. Calculate r for the 1s state of the hydrogen atom. Section 4-8 Summary 125 Multiple Choice Questions (Try to answer these without referring to the text or using pencil and paper.) 1. A particle on the surface of a sphere has quantum number J =7. The energy level to which this state belongs has a degeneracy of a) 56 b) 49 c) 42 d) 14 e) None of these. 2. A particle on the surface of a sphere in the state having J =4, mJ =4 a) has E =16¯h2/2I. b) has a z-component of angular momentum of 4¯h. c) doesn’t exist because this state violates quantum number rules. d) has a degeneracy of 20. e) None of the above is a true statement. 3. HI and DI are made to undergo the same transition (say J =11.-J =10, but the particular J values are not important). The light frequency inducing the transition for HI is equal to .. Approximately which frequency would you expect to induce the same transition for DI? a) 2. b) ./2 c) v2. d) ./v2 e) None of these. 4. The electronic energy of Li2+ in the 2s state is a) the same as that of H in the 1s state. b) nine times that of H in the 1s state. c) one-fourth that of H in the 1s state. d) four-ninths that of H in the 1s state. e) nine-fourths that of H in the 1s state. 5. Consider the following expressions, where . is a hydrogen-atom wavefunction. 1)  allspace .*r. dv 2) d dr.*.r2 sin . =0 3) d dr.*. =0 4)d dr. =0 Which one of the following is a true statement? a) Expression 1 is equal to unity if . is normalized. b) Expression 4 is true when r is the position of a radial node. c) Expression 3 is true everywhere because .*. is a constant. d) Expression 2 is true when r is at its most probable value. e) Expression 3 is true when r is at its most probable value. 126 Chapter 4 The Hydrogenlike Ion, Angular Momentum, and the Rigid Rotor 6. A 5dxy atomic orbital has a) 2 planar and 1 radial nodes. b) 2 planar and 3 radial nodes. c) 3 planar and 1 radial nodes. d) 5 nodes all together. e) None of the above is correct. 7. For a hydrogen atom in an n = 4 state, the maximum possible z-component of orbital angular momentum is a) 2¯h b) 3¯h c) v12¯h d) v6¯h e) None of the above is correct. 8. The following is an eigenfunction for the hydrogen atom: . =1/32pa3 0  (r/a0) exp(-r/2a0) cos . Which one of the following statements about . is true? a) The term on the left, up to the exponential, is the normalizing constant. b) This . has a nonzero value at the nucleus. c) For this state, l =1 and ml =0. d) This state has a spherical electron cloud distribution. e) None of the above statements is true. 9. The radial distribution function for a 1s state, 4pr2.2 1s , indicates that a) the most probable value of the distance from the nucleus is zero. b) the average value of r is zero. c) the average value of r is greater than the most probable value. d) the average value of r is equal to the most probable value. e) the electron cloud density per cubic picometer is greatest at a radius other than zero. References [1] H. Eyring, J. Walter, and E. D. Kimball, Quantum Chemistry, Chapter VI. Wiley, NewYork, 1944. [2] I. N. Levine, Quantum Chemistry, 5th ed. Prentice Hall, Upper Saddle River, New Jersey, 2000. [3] L. Pauling and E. B. Wilson, Jr., Introduction to Quantum Mechanics. McGraw- Hill, NewYork, 1935. [4] R. N. Zare, Angular Momentum.Wiley, NewYork, 1988. Chapter 5 Many-Electron Atoms 5-1 The Independent Electron Approximation In previous chapters we have dealt with the motion of a single particle in various potential fields. When we deal with more than one particle, new problems arise and new techniques are needed. Some of these are discussed in this chapter. In constructing the hamiltonian operator for a many electron atom, we shall assume a fixed nucleus and ignore the minor error introduced by using electron mass rather than reduced mass. There will be a kinetic energy operator for each electron and potential terms for the various electrostatic attractions and repulsions in the system. Assuming n electrons and an atomic number ofZ, the hamiltonian operator is (in atomic units) H(1, 2, 3, . . . ,n)=- 1 2 n  i=1 .2 i - n  i=1 (Z/ri)+ n-1  i=1 n  j=i+1 1 rij (5-1) The numbers in parentheses on the left-hand side of Eq. (5-1) symbolize the spatial coordinates of each of the n electrons. Thus, 1 stands for x1, y1, z1, or r1, .1, f1, etc. We shall use this notation frequently throughout this book. Since we are not here concerned with the quantum-mechanical description of the translational motion of the atom, there is no kinetic energy operator for the nucleus in Eq. (5-1). The index i refers to the electrons, so we see that Eq. (5-1) provides us with the desired kinetic energy operator for each electron, a nuclear electronic attraction term for each electron, and an interelectronic repulsion term for each distinct electron pair. (The summation indices guarantee that 1/r12 and 1/r21 will not both appear in H. This prevents counting the same physical interaction twice. The indices also prevent nonphysical self-repulsion terms, such as 1/r22, from occurring.) Frequently used alternative notations for the double summation in Eq. (5-1) are 1 2n i=j 1/rij , which counts each interaction twice and divides by two, and n i= 1 2 (-1 2 a.u.)+ 1 2 (-18 a.u.)=- 5 16 a.u., which has no time-dependence.  6-5 The Postulate Relating Measured Values to Eigenvalues The second postulate indicated that every observable variable of a system (such as position, momentum, velocity, energy, dipole moment) was associated with a hermitian operator. The connection between the observed value of a variable and the operator is given by Postulate IV Any result of a measurement of a dynamical variable is one of the eigenvalues of the corresponding operator. Any measurement always gives a real number, and so this postulate requires that eigenvalues of the appropriate operators be real. We will prove later that hermitian operators satisfy this requirement. If we measured the electronic energy of a hydrogen atom (the negative of its ionization energy), we could get any of the allowed eigenvalues (-1/2n2 a.u.) but no intermediate value. What if, instead, we measured the distance of the electron from the nucleus. By postulate II, the operator for this property is just the variable r itself. That is, ˆr =r. Hence, we need to consider the eigenvalues of r in the equation r d(r, .,f)=.d(r, .,f) (6-6) 170 Chapter 6 Postulates and Theorems of Quantum Mechanics where d is an eigenfunction and . is a real number (corresponding to the distance of the electron from the nucleus). We can rewrite this equation as (r -.)d(r, .,f)=0 (6-7) This form makes it more apparent that the function d must vanish at all points in space except those where r = .. But . is an eigenvalue of r and hence is a possible result of a measurement. Thus, we see that postulate IV implies some connection between a measurement of, say, r = 2 a.u. and an eigenfunction of r that is finite only at r =2 a.u. We symbolize this eigenfunction d(r -2 a.u.), this “delta function” being zero whenever the argument is not zero. If we measured the electron’s position to be at r =5.3 a.u., the corresponding eigenfunction would be d(r -5.3 a.u.)—a function that is zero everywhere except in a shell of infinitesimal thickness at r = 5.3 a.u. If instead we measured the point in space of the electron, rather than just the distance from the nucleus, and found it to be r0, .0, f0, then the corresponding eigenfunction of the position operator would be d(r -r0)d(. -.0)d(f -f0). This function vanishes everywhere except at r0, .0, f0. It is evident that any value of . from zero to infinity in Eq. (6-7) may be chosen without spoiling the ability of d to serve as an eigenfunction of r. This means that, unlike the energy measurement, the measurement of the distance of the electron from the nucleus can have any value. The eigenfunctions of the position operator are called Dirac delta functions. They are “spike” functions having infinitesimal width. They are normalized through the equation d (x -x0) dx =1 (6-8) where the integration range includes x0.3 On first acquaintance, these functions seem mathematically peculiar, but they make physical sense in the following way. One can interpret the actual measurement of position as a process that forces the particle to acquire a certain position at some instant. At that instant, .2 for the system (now perturbed by the measuring process) ought to give unit probability for finding the particle at that point (where it definitely is) and zero probability elsewhere, and this is just what the Dirac delta function does.4 Postulate IV, then, is in accord with a picture wherein the process of measurement forces the measured system into an eigenstate for the appropriate operator, giving the corresponding eigenvalue as the measurement. This definition of “measurement” is somewhat restrictive and can be deceptive. Often scientists refer to measurements that are really measurements of average values rather than eigenvalues. This point is discussed further below. 3The reader should avoid confusing the Dirac delta function d(x -x0) with the Kronecker delta di,j encountered earlier. They are similar in that both vanish unless x = x0 in the former and i = j in the latter. But they differ in that the value of di,j is definite (unity) while the value of d(x0 -x0) is not defined. The Dirac delta function has definite value only in integrated expressions like Eq. (6-8). The spin functions a and ß may be thought of as Dirac delta functions in the spin “coordinate” .. The Dirac delta function is admittedly unusual, and one tends to be uneasy with it at first. This function is important and useful in quantum mechanics. However, since we will make almost no use of it in this text, we will not develop the topic further. 4Notice that Eq. (6-8) does not involve d*d, but merely d. Because d is nonzero only at one point, d*d is likewise nonzero only at the same point. d and d*d are therefore not independent functions. It is convenient to view the d function as both the eigenfunction for the position operator and also as the probability distribution function for the particle. Section 6-7 Hermitian Operators 171 6-6 The Postulate for Average Values Suppose that we had somehow prepared a large number of hydrogen atoms so that they were all in the same, known, stationary state. Then we could measure the distance of the electron from the nucleus once in each atom and average these measurements to obtain an average value. We have already indicated that this average would be given by the sum of all the r values, each multiplied by its frequency of occurrence, which is given by .2 dv if . is normalized. Since r is a continuous variable, the sum becomes an integral. This is the content of Postulate V When a large number of identical systems have the same state function ., the expected average of measurements on the variable M (one measurement per system) is given by Mav = .*Mˆ . dt .*. dt (6-9) The denominator is unity if . is normalized. It is important to understand the distinction between average value and eigenvalue as they relate to measurements. A good example is the dipole moment. The dipole moment operator for a system of n charged particles is µ= n i=1 ziri where zi is the charge on the ith particle and ri is its position vector with respect to an arbitrary origin. (We get this by writing the classical formula and observing that momentum terms do not appear. Hence, the quantum-mechanical operator is the same as the classical expression.) What will the eigenfunctions and eigenvalues of µˆ be like? The charge zi is only a number, while ri is a position operator, which has Dirac delta functions as eigenfunctions. For a hydrogen atom, one eigenfunction of ri would be a delta function at r = 1 a.u., . = 0, f = 0. The corresponding eigenvalue for µˆ would be the dipole moment obtained when a proton and an electron are separated by 1 a.u., clearly a finite number. But “everybody knows” that an unperturbed atom in a stationary state has zero dipole moment. The difficulty is resolved when we recognize that measurement of a variable in postulates IV and V means measuring the value of a variable at a given instant. Hence, we must distinguish between the instantaneous dipole moment of an atom, which can have any value from among the eigenvalues of µˆ and the average dipole moment, which is zero for the atom. In everyday scientific discussion, the term “dipole moment” is usually understood to refer to the average dipole moment. Indeed, the usual measurements of dipole moment are measurements that effectively average over many molecules or long times (in atomic terms) or both. 6-7 Hermitian Operators Let f and . be any square-integrable functions and Aˆ be an operator, all having the same domain. Aˆ is defined to be hermitian if .*Aˆfdv = fAˆ*.*dv (6-10) The integration is over the entire range of each spatial coordinate. Recall that the asterisk signifies reversal of the sign of i in a complex or imaginary term. The hermitian property has important consequences in quantum chemistry. 172 Chapter 6 Postulates and Theorems of Quantum Mechanics As an example of a test of an operator by Eq. (6-10), let us take the . and f to be square-integrable functions of x and Aˆ to be i(d/dx). Then the left-hand side of Eq. (6-10) becomes, upon integration by parts, (6-11) Since . and f are square integrable, they (and their product) must vanish at infinity, giving the zero term in Eq. (6-11). We now write out the right-hand side of Eq. (6-10): +8 -8 f(i d/dx)*.*dx=-i +8 -8 f(d.*/dx) dx (6-12) where the minus sign comes from carrying out the operation indicated by the asterisk. Equation (6-12) is equal to Eq. (6-11), and so the operator i(d/dx) is hermitian. Since the effect of i was to introduce a necessary sign reversal, it is apparent that the equality would not result for Aˆ = d/dx. Clearly, any hermitian operator involving a first derivative in any Cartesian coordinate must contain the factor i. The operators for linear momenta (Chapter 2) are examples of this. It is important to realize that Eq. (6-10) does not imply that .*Aˆf = fAˆ*.*. A simple example will make this clearer. Let Aˆ be the hydrogen atom hamiltonian, Hˆ =-1 2.2 -1/r, and let f be the 1s eigenfunction: f =(1/vp)exp(-r). Also, let . =v8/p exp(-2r) which is not an eigenfunction of Hˆ . Then, since Hˆ f =-1 2f, .*Hˆ f =- 1 2 .*f (6-13) But fH*.* = f - 1 2 (1/r2)(d/dr)r2(d/dr)-1/r 8/p exp(-2r) (6-14) = f [(1/r)-2]8/p exp(-2r)=[(1/r)-2].*f (6-15) (Since . has no . or f dependence, the parts of Hˆ * that include ./.. and ./.f have been omitted in Eq. (6-14).) Here we have two functions, -1 2.*f and [(1/r)-2].*f. They are obviously different. However, by Eq. (6-10), their integrals are equal since Hˆ is hermitian. 6-8 Proof That Eigenvalues of Hermitian Operators Are Real Let Aˆ be a hermitian operator with a square-integrable eigenfunction .. Then Aˆ. =a. (6-16) Each side of Eq. (6-16) must be expressible as a real and an imaginary part. The real parts must be equal to each other and so must the imaginary parts. Taking the complex Section 6-9 Eigenfunctions of a Hermitian Operator Form an Orthogonal Set 173 conjugate of Eq. (6-16) causes the imaginary parts to reverse sign, but they remain equal. Therefore, we may write Aˆ*.* =a*.* (6-17) We multiply Eq. (6-16) from the left by .* and integrate over all spatial variables: .*Aˆ. dv =a .*. dv (6-18) Similarly, we multiply Eq. (6-17) from the left by . and integrate: .Aˆ*.*dv =a* ..* dv (6-19) SinceAˆ is hermitian, the left-hand sides of Eqs. (6-18) and (6-19) are equal by definition (Eq. 6-10). Therefore, the right-hand sides are equal, and their difference is zero: (a -a*) .*. dv =0 (6-20) Since . is square integrable the integral cannot be zero. Therefore, a - a* is zero, which requires that a be real. 6-9 Proof That Nondegenerate Eigenfunctions of a Hermitian Operator Form an Orthogonal Set Let . and f be two square-integrable eigenfunctions of the hermitian operator Aˆ: Aˆ. = a1. (6-21) Aˆ*f* = a2f* (6-22) Multiplying Eq. (6-21) from the left by f* and Eq. (6-22) from the left by ., and integrating gives f*Aˆ. dv = a1 f*. dv (6-23) .Aˆ*f* dv = a2 .f* dv (6-24) The left sides of Eqs. (6-23) and (6-24) are equal by (6-10), and (a1 -a2) f*. dv =0. (6-25) If a1 = a2, the integral vanishes. This proves that nondegenerate eigenfunctions are orthogonal. 174 Chapter 6 Postulates and Theorems of Quantum Mechanics EXAMPLE 6-2 It has been shown (Section 6-7) that i(d/dx) is a hermitian operator. We know that it has eigenfunctions exp(±ikx) with eigenvalues ±k, which are real. So far, so good. However, this operator also has eigenfunctions exp(±kx), with eigenvalues ±ik, which are imaginary. This appears to violate the proof that eigenvalues of hermitian operators are real. Explain why neither of these eigenfunction sets is covered by the proof of section 6-8, and how one of them manages to obey the rule anyway. SOLUTION  The test for Hermiticity requires that i.*f|+-88 = 0. If f is ., and if . is square-integrable, this condition is satisfied, because .*. vanishes at ±8, giving 0-0=0. But neither of the exponential functions given above is square-integrable: They are both unequal to zero at ±8, so they both fall outside of the proof as given. Despite this, exp(±ikx) does have real eigenvalues, leading us to look more closely. Is it the case that i.*.|+-88 =0 for this set of functions, even though they do not vanish at infinity? It is indeed, since .*. =1, giving i -i =0 for this term. Thus we see that our requirement that . be square integrable is more restrictive than what is necessary, namely that i.*.|+-88 =0. Note that the other set of exponentials, exp(±kx), leads to i.*. =i exp(±2kx), which does not produce a value of zero when values at x=8and x=-8are subtracted. Note also that the functions exp(±ikx) are orthogonal for different values of k, whereas the functions exp(±kx) are not.  The point of the above example is that all of our proofs about eigenvalues or eigenfunctions of hermitian operators refer to cases where the eigenfunctions satisfy the requirement that i.* .|+-88 = 0. Square-integrability guarantees this, but some nonsquare-integrable sets of functions can satisfy it too. A hermitian operator can have eigenfunctions that are associated with complex or imaginary eigenvalues, but these must result from eigenfunctions that do not satisfy the requirement. 6-10 Demonstration That All Eigenfunctions of a Hermitian Operator May Be Expressed as an Orthonormal Set If a1=a2, Eq. (6-25) is satisfied even when the integral is finite. Therefore, degenerate eigenfunctions need not be orthogonal. But they must be linearly independent or else they are the self-same function (to within a multiplicative constant), and if they are linearly independent, they can be converted to an orthogonal pair. Hence, it is always possible to express the degenerate eigenfunctions of a hermitian operator as an orthogonal set (and, as we have just proved, it is necessary that nondegenerate eigenfunctions be orthogonal). Furthermore, the functions must be square integrable, hence normalizable. In general, then, we are able to assume that all of the eigenfunctions of a hermitian operator can be expressed as an orthonormal set. Oneway to orthogonalize two nonorthogonal, linearly independent functions (which may or may not be eigenfunctions) will now be demonstrated. Let the functions be . and f (assumed normalized) and the integral of their product have the value S: .*f dv =S (6-26) Section 6-11 Proof That Commuting Operators Have Simultaneous Eigenfunctions 175 We keep one of the functions unchanged, say ., and let f=f-S. be our new second function. . and f are orthogonal since .*f dv = .*(f -S.)dv = .*f dv  S  -S .*. dv  1  =0 (6-27) (The new function f needs to be renormalized.) This process, known as Schmidt orthogonalization, may be generalized and applied sequentially to any number of linearly independent functions. EXAMPLE 6-3 Two normalized 1s AOs are located on nearby nuclei, A and B, and overlap each other enough so that  1sA1sB dv=0.500. Construct a function from these two that is orthogonal to 1sA and is normalized. SOLUTION  1sB =1sB -0.5 · 1sA is orthogonal to 1sA. It is not yet normalized because (1sB )2 dv = (1s2B +0.25 · 1s2A -2 · 0.5 · 1sA1sB)dv = 1+0.25-2 · 0.5 · 0.5=0.75= 3 4 . So the normalized function we seek is 2 v3 (1sB -0.5 · 1sA).  6-11 Proof That Commuting Operators Have Simultaneous Eigenfunctions Aˆ and Bˆ are commuting operators if, for the general square-integrable function f, AˆBˆf =BˆAˆf . This can bewritten (AˆBˆ -BˆAˆ)f =0,which requires thatAˆBˆ -BˆAˆ =0ˆ. (ˆ0 is called the null operator. It satisfies the equation, ˆ0f = 0.) This difference of operator products is called the commutator of Aˆ and Bˆ and is usually symbolized5 by [Aˆ,Bˆ]. If the commutator [Aˆ,Bˆ] vanishes, then Aˆ and Bˆ commute. We will now prove an important property of commuting operators, namely, that they have “simultaneous” eigenfunctions (i.e., that a set of eigenfunctions can be found for one of the operators that is also an eigenfunction set for the other operator). Let ßi be the eigenfunctions for Bˆ :Bˆßi =bißi . For the moment, assume all the numbers bi are different (i.e., the eigenfunctions ßi are nondegenerate). Let [Aˆ,Bˆ]=0ˆ. Then Bˆ(Aˆßi)=AˆBˆßi =Aˆbißi =bi(Aˆßi) (6-28) The parentheses emphasize that the function obtained by operating on ßi with Aˆ is an eigenfunction of Bˆ with eigenvalue bi . But that function can only be a constant times ßi itself. Hence, for nondegenerate ßi we have that Aˆßi =cßi , and so ßi is an eigenfunction of Aˆ. This proves that the nondegenerate eigenfunctions for one operator will also be eigenfunctions for any other operators that commute with it. 5Other less common conventions are [Aˆ,Bˆ]- and (Aˆ,Bˆ). 176 Chapter 6 Postulates and Theorems of Quantum Mechanics If ßi is degenerate with other functions ßi,k , then we can only go so far as to say that Aˆßi = k ckßi,k, for this general linear combination is an eigenfunction of Bˆ having eigenvalue bi . But if this is so, then ßi is evidently not necessarily an eigenfunction of Aˆ. We shall not prove it here, but it is possible to show that one can always find some linear combinations of ßi,k to produce a set of newfunctions, ßi , that are eigenfunctions of Aˆ (and remain eigenfunctions of Bˆ as well). Therefore we can state that, if Aˆ and Bˆ commute, there exists a set of functions that are eigenfunctions for Aˆ and Bˆ simultaneously. An example of this property occurred in the particle-in-a-ring system described in Chapter 2. The hamiltonian and angular momentum operators commute for that system. There we found one set of functions, the trigonometric functions, that are eigenfunctions for Hˆ but not for Lˆz . But by mixing the energy-degenerate sines and cosines we produced exponential functions that are eigenfunctions for both of these operators. Another example concerns the familiar symmetry operations for reflection, rotation, etc. If one of these operations, symbolized Rˆ, commutes with the hamiltonian, then we should expect there to be a set of eigenfunctions for Hˆ that are simultaneously eigenfunctions for Rˆ. It was proved in Chapter 2 that this means that nondegenerate eigenfunctions must be symmetric or antisymmetric with respect to Rˆ. A symmetry operator that leaves Hˆ unchanged can be shown to commute with Hˆ . That is, if RˆHˆ =Hˆ , then RˆHˆ f =Hˆ Rˆf , where f is any function. To show this, let Rˆ be, say, a reflection operator. Then Rˆ operates on functions and operators to its right by reflecting the appropriate coordinates: Rˆf (q) = f (Rq). If Hˆ is invariant under reflection Rˆ, then H(q)=H(Rq), and it follows that RˆHˆ (q)f (q)=Hˆ (Rq)f (Rq)= Hˆ (q)f (Rq) = Hˆ (q)Rˆf (q), and so RˆHˆ f = Hˆ Rˆf . We shall formally develop the ramifications of symmetry in quantum chemistry in Chapter 13. The existence of simultaneous eigenfunctions for various operators has important ramifications for the measurement of a system’s properties. This is discussed in Section 6-15. 6-12 Completeness of Eigenfunctions of a Hermitian Operator In Chapter 3 we discussed the concept of completeness in connection with the power series expansion of a function. Briefly, a series of functions6 {f} having certain restrictions (e.g., all derivatives vary smoothly) is said to be complete if an arbitrary function f having the same restrictions can be expressed in terms of the series7 f = i cifi (6-29) Proofs exist that certain hermitian operators corresponding to observable properties have eigenfunctions forming a complete set in the space of well-behaved (continuous, 6A symbol in braces is frequently employed to represent an entire set of functions. 7Equation (6-29) is overly restrictive in that it requires that the function and the series have identical values at every point, whereas it is possible for them to disagree at points of zero measure. However, at the level of this book, we can ignore this distinction and use Eq. (6-29) without encountering difficulty. Section 6-12 Completeness of Eigenfunctions of a Hermitian Operator 177 single-valued, square-integrable) functions. These proofs are difficult and will not be given here.8 Instead we shall introduce Postulate VI The eigenfunctions for any quantum mechanical operator corresponding to an observable variable constitute a complete set. (Furthermore, we have seen in Section 6-10 that we can assume that this set has been made orthonormal.) We will now use this property to investigate further the nature of the average value of an operator. Let the systembe in some state . (normalized), not an eigenfunction ofMˆ . However, Mˆ possesses eigenfunctions {µ} that must form a complete set. Therefore, we can express . in terms of µ’s: . = i ciµi (6-30) Now we calculate the average value of M for the state .: Mav = .*Mˆ .dv =  i c*i µ*i Mˆ  j cjµj dv = i  j c*i cj µ*i Mˆ µj dv (6-31) But Mˆ µi =miµi , and so Mav = i  j c*i cj µ*i mjµj dv = i  j c*i cjmj µ*i µj dv (6-32) But we are assuming that {µ} is an orthonormal set, and so Mav = i  j c*i cjmj dij = i c*i cimi (6-33) What does this expression mean? Each measurement of the property corresponding to Mˆ must give one of the eigenvalues mi (postulate IV) and the average of many such measurements must be Mav. Equation (6-33) states how the individual measurements must be weighted to give the average, so it follows that each ci*ci is a measure of the relative frequency for observing the corresponding mi . Putting it another way, the absolute squares of the mixing coefficients in Eq. (6-30) give the probabilities that a measurement of the variable M will give the corresponding eigenvalue. For example, if . happens to be equal to (1/v2)µ1 + (1/v2)µ3, it follows that Mav =(1/2)m1 +(1/2)m3. EXAMPLE 6-4 What is the average value for the z-component of orbital angular momentum for the normalized function f =(1/v5)(.2s +2 ·.2p+1 )? SOLUTION  Since we know that the 2s and 2p+1 eigenfunctions have z-components of angular momentum of 0 and +1 respectively, we can say at once that p¯z = 15 · 0 + 4 5 · 1 = 0.8 a.u.  8See, e.g., Kemble [1, Section 25]. 178 Chapter 6 Postulates and Theorems of Quantum Mechanics 6-13 The Variation Principle Many of the calculations of quantum chemistry are based on the Rayleigh-Ritz variation principle which states: For any normalized, acceptable function f, Hav = f*Hˆ f dt =E0 (6-34) where E0 is the lowest eigenvalue of Hˆ . This statement is easily proved. We expand f in terms of {.i }, the complete, orthonormal set of eigenfunctions of Hˆ : f = i ci.i (6-35) As in the preceding section, this leads to f*Hˆ fdt = i c*i ciEi (6-36) Now c*i ci is never negative, and so Eq. (6-36) is merely a weighted average of the eigenvalues Ei . Such an average can never be lower than the lowest contributing member and the principle is proved. The variation principle is sometimes stated in an equivalent way by saying that the average value ofHˆ over f is an upper bound for the lowest eigenvalue ofHˆ . Following the approach of the example at the end of the previous section, if f for a hydrogen atom happens to be a function equal to (1/v2).1s +(1/v2).2s , the average energy for f is (1/2)E1s +(1/2)E2s , which obviously lies above the lowest eigenvalue E1s . 6-14 The Pauli Exclusion Principle We have already discussed the Pauli exclusion principle in Chapter 5. In its most general form, this is: Postulate VII . must be antisymmetric (symmetric) for the exchange of identical fermions (bosons). 6-15 Measurement, Commutators, and Uncertainty If we measure the exact position of the electron in a hydrogen atom, we force it into a state having a Dirac delta function as its wavefunction. Since this function is also an eigenfunction for the dipole moment operator, it follows that we also know the (instantaneous) dipole moment for the atom at that instant. In effect, measuring position measures dipole moment too. But the delta function is not an eigenfunction for the hamiltonian operator of the atom, and so we have not simultaneously measured the electronic energy of the atom. We have earlier seen that an eigenfunction for one operator can serve also as eigenfunction for another operator when the operators commute. In the above example, the Section 6-15 Measurement, Commutators, and Uncertainty 179 operators for position and dipole moment commute with each other but not with the hamiltonian operator. This leads us to recognize that we can simultaneously measure values for two variables only if their operators commute. Let us consider this situation more deeply by imagining two successive measurements on a hydrogen atom, one immediately following the other. If we first measure position and find r =2.0 a.u., and then measure dipole moment, we will get the value (µ=2.0 a.u.) corresponding to the electron being at r =2.0 a.u. That is where we found it in the first measurement, and it has not had time to move elsewhere before the second measurement. If we immediately follow with yet another position measurement, the electron will still be found at r =2 a.u. (We are imagining that no time elapses between measurements, which is a limit we cannot actually achieve. In the present case, though, since measurement of r is also a measure of µ, both measurements are done at once, so this is really not a problem.) Hence, it makes sense to say that we knowthese two values “simultaneously.” However, if we first measure position and then measure energy, we find something very different. Suppose that we find r =2 a.u. and then, in a subsequent measurement, E = -1/2 a.u. (E must, after all, be an eigenvalue of Hˆ , according to postulate IV.) We know that the eigenfunction during the first measurement was d(r -2 a.u.), and that during the second measurement was a 1sAO. If we immediately do yet another position measurement, we can find any value of r (with probabilities given by 4pr2.2 1s dr). The processes of measuring position and energy are incompatible in the sense that there is no single function that can describe the situation that exists during both measurements. The energy-measuring process can be pictured as forcing a reconstruction of the wavefunction in such a manner that it no longer corresponds to a particular position, while measurement of position forces a state function that does not correspond to a particular energy. (In this case, separate measurements would really be necessary, so the impossibility of doing a second measurement truly immediately after the first must be recognized. Indeed, one has to allow for the fact that finding an electron in one place and then at some other place must imply a lapse of time permitting the electron to travel.) The reader may suspect that there is some connection between commutators and the uncertainty principle, and this is indeed the case. It can be shown9 that the product of widths of simultaneous measurements (i.e., the uncertainty in their values) of two variables satisfies the relation a · b= 1 2  .* Aˆ,Bˆ.dt  (6-37) where . is normalized, and the absolute value |X| is defined as the positive square root of X*X. If A and B are conjugate variables, such as position and momentum, Eq. (6-37) becomes a ·b = h¯/2, which is Heisenberg’s uncertainty relation. If Aˆ and Bˆ commute, the right-hand side of Eq. (6-37) vanishes, and the values of both variables may, in theory, be simultaneously known exactly. Among the properties of greatest interest in molecular quantum mechanics are energy, symmetry, and electron orbital angular momentum because, for many molecules, some of these operators commute. Thus, if we know that an oxygen molecule is in a nondegenerate stationary electronic state, we know that it is possible to characterize that state by a definite value of the orbital angular momentum along 9See Merzbacher [2, Section 10-5]. 180 Chapter 6 Postulates and Theorems of Quantum Mechanics the internuclear axis. Also, we know that the wavefunction must be symmetric or antisymmetric for inversion through the molecular midpoint. 6-16 Time-Dependent States Much of quantum chemistry is concerned with stationary states, for which  is a product of a space term . (an eigenfunction of Hˆ ) and a time-dependent factor exp(-iEt/¯h), which we usually ignore because it has no effect on particle probability distribution. Sometimes, however, it becomes necessary to consider time-dependent states. In this section we illustrate how some of these may be treated. There are two types of situation to distinguish. One is situations where the potential is changing as a function of time, and hence the hamiltonian operator is time dependent. An example is a molecule or atom in a time-varying electromagnetic field. The other is situations where the potential and hamiltonian operator do not change with time, but the particle is nonetheless in a nonstationary state. An example is a particle that is known to have been forced into a nonstationary state by a measurement of its position. We deal here with the second category. As our first example, consider a particle in a one-dimensional box with infinite walls. Suppose that we measure the particle’s position and find it in the left side of the box (i.e., between x = 0 and L/2; we will be more specific shortly) at some instant that we take to be t = 0. We are interested in knowing what this implies about a future measurement of the particle’s position. Our knowing that the particle is on the left side at t =0 means that the wavefunction for this state is not one of the time-independent box eigenfunctions we sawin Chapter 2, because those all predict equal probabilities for finding the particle on the two sides of the box. If the state function is not stationary, it must be time dependent, and it must satisfy Schr¨odinger’s time-dependent equation (6-1). We have, then, that the state function is time dependent, and that * =||2 is zero everywhere on the right side of the box when t =0. (We have not yet been specific enough to describe ||2 in detail on the left side of the box.) Schr¨odinger’s time-dependent equation (6-1) is not an eigenvalue equation. However, Eq. (6-2) shows that Schr¨odinger’s time-dependent equation is satisfied by timeindependent eigenfunctions of Hˆ if they are multiplied by their time-dependent factors f (t) = exp(-iEt/¯h). Furthermore, Eq. (6-2) continues to be satisfied if the term .(q)f (t) is replaced by a sum of such terms. (See Problem 6-9.) This means that we can seek to express the time-dependent state function, (x, t), as a sum of timeindependent box eigenfunctions as long as each of these is accompanied by its time factor f (t). When t = 0, all the factors f (t) equal unity, so at that point in time  becomes the same as the sum of box eigenfunctions without their time factors. Our strategy, then, is to find a linear combination of time-independent box eigenfunctions, .n, that describe  when t =0. This is easy to do because the time factors are all equal to unity. Once we have found the proper mixture of .n, we multiply each by its time factor and then observe the behavior of ||2 as t increases. In the case of our particle-in-a-box example, we can start with a very simple approximation to . at t =0 by taking a 50–50 mixture of .1 and .2: (x, t)=(1/v2) .1 exp(-iE1t/¯h)+.2 exp(-iE2t/¯h) (6-38) Section 6-16 Time-Dependent States 181 .1 0 .2 .(x, 0) 0 0 2/L 2/L 2/L Figure 6-1  Stationary eigenfunctions (n=1, 2) for the particle in a box and their normalized sum. We have included the functions f (t), even though they equal unity when t =0, because they are needed to make (x, t) a solution to Schr¨odinger’s equation (6-1) and because they will inform us of the nature of  at later times. We choose this pair of functions because, when t =0, both are positive on the left, but they differ in sign on the right, giving us some cancellation there. (See Fig. 6-1.) Obviously, we have not succeeded in describing a function that has no probability density on the right, but we already have a definite imbalance in that direction. (It is not difficult to see that some .3 with a positive coefficient should help remove much of the remaining probability density on the right.) Nowwe are in a position to examine this ||2 as time progresses—the time evolution of the square of a wavepacket that describes the probability distribution for a particle that is known to have been in the left half of the box at t =0. This is mathematically straightforward (Problem 6-20) and leads to the probability distributions sketched in Fig. 6-2 after time steps of t. The figure shows a changing distribution suggestive of the particle bouncing back and forth in the box with a cycle time of 8t. It is not difficult to see why this happens. 1 and 2 have different “frequency factors” exp(-iEnt/¯h), so they behave like two waves oscillating at different frequencies. Since E2 = 4E1 (recall E .n2 in the box), 2 oscillates four times faster than 1. This means that, by the time 1 has made half a cycle (and is equal to -1 times its starting coordinates), 2 has made two cycles and is just as it was at t =0. It is easy to see from Fig. 6-1 that this will give a  that is skewed to the right, leading to the distribution shown in Fig. 6-2(e). (This allows us to conclude that 4t equals 1/2 of the cycle time of 1. See Problem 6-21.) If we want a more accurate starting representation for the localized particle, we must mix together a larger number of stationary-state wavefunctions. In order to decide how much of each is needed, we must have a better-defined description of  at t =0. 182 Chapter 6 Postulates and Theorems of Quantum Mechanics 3 /L 2 /L 1 /L 0 0 0 L 0 L t = 0 8.t .t 7.t 6.t 5.t 2.t 3.t 4.t (a) (b) (c) (d) (e) L 0 L 0 L Figure 6-2  |(x, t)|2 from Eq. (6-38) as it appears at various times. Figure 6-3  A normalized half sine wave in the left half of a “box.” The numbers at left are values of , not of E. Suppose, for instance, we choose to describe the starting wavefunction (x, 0) as a normalized half sine wave in the left side of the box and zero at the right (Fig. 6-3). Then we can calculate the amount (cn) of each of the stationary-state functions .n present in this function as follows: cn = L 0 .n(x, 0)dx (6-39) This follows from the completeness10 and orthonormality of {.n}. (See Problem 6-4.) Evaluation of Eq. (6-39) for the first few terms gives (Problem 6-22) (x, t)=0.600.1 +0.707.2 +0.360.3 +0.000.4 +0.086.5+··· (6-40) This modifies slightly our earlier commonsense combination and also verifies our prediction that .3 times a positive coefficient would be beneficial. 10Because (x, 0) has a discontinuous derivative at the midpoint of the box, it falls outside the class for which {.n} is complete. However, because this problem is restricted to dx around one point, it should have little effect. Section 6-16 Time-Dependent States 183 This example illustrates the basic approach to such problems: 1. Find a function that represents the initial particle distribution (x, 0). 2. Expand that function as a series of eigenfunctions for the hamiltonian, and include the time-dependent factor for each term. 3. Evaluate the probability distribution at other times t by examining | (x, t)|2. As a second example, suppose one were considering the behavior of the electronic state immediately after a tritium atom emits a beta particle to become a helium ion: 31 H.32 He+0 -1 e. A crude analysis could be attempted by imagining that the nuclear charge suddenly changes from 1 to 2 a.u., and the orbiting electron (not the beta particle) suddenly finds itself in a state (the original 1s state) that is not an eigenfunction for the new hamiltonian. We would accordingly set (t =0) to be the 1sAO of hydrogen and expand this in terms of He+ eigenfunctions. Only s-typeAOs could contribute because of symmetry. The coefficients are given by cn = allspace .1s(Z =1).ns(Z =2)dv (6-41) and the time-dependent wavefunction is (in a.u.) (r, .,f, t) = c1.1s(Z =2) exp(-2it)+c2.2s(Z =2) exp(-it/2) +c3.3s(Z =2) exp(-2it/9)+··· (6-42) This function could be evaluated at various times t and would be found to give an oscillating spherical distribution, as though the electron cloud were shrinking, then rebounding to its original distance, then shrinking again, etc. Our next example is perhaps the most important: It is a particle initially localized in some region of space, say by measurement of its position, and free to move anywhere thereafter. Taking the one-dimensional case, we imagine that the particle has been detected around x =0 at t =0 (the measurement caused it to be “unfree” for an instant). We assume that the average momentum of the particle is zero. We seek to know how the probability distribution function for the particle will evolve in time. As before, we need a functional description of the wavefunction at t =0,(x, 0). We will then expand that in terms of eigenfunctions of the free-particle hamiltonian. As we have seen in Section 2-5, the free-particle eigenfunctions may be written exp(±iv2mEx/¯h), whereE is any nonnegative number. These are also eigenfunctions for the momentum operator, with eigenvalues v2mE¯h. The function usually selected to describe (x, 0) is a gaussian function: (x, 0)= 4 2a/p exp(-ax2) (6-43) The constant a affects the width of the gaussian and reflects our degree of certainty in our knowledge of position. Large a gives a tight function and small uncertainty x. The relationship between the gaussian function in x and the coefficients of the eigenfunctions as a function of v2mE/¯h is depicted in Fig. 6-4. Remarkably, the coefficient values are also described by a gaussian function (inv2mE/¯h). Furthermore, the tighter the gaussian function is in x, the broader the corresponding gaussian function 184 Chapter 6 Postulates and Theorems of Quantum Mechanics Figure 6-4  (a and c) Gaussianwave packets describing particles found to be at x=0 with differing degrees of certainty. (b and d)Values of ck (where k=v2mE/¯h) for momentum eigenfunctions that combine to express the gaussian wave packets to their left. (a, b) corresponds to relatively certain position and relatively uncertain momentum, whereas (c, d) corresponds to the opposite situation. is inv2mE/¯h (Problem 6-24). That is, we need to combine free-particle eigenfunction contributions from a wider range of momenta to create a tighter position function. This means that greater certainty in position goes with greater uncertainty in momentum, in accord with the uncertainty principle. Once we have the appropriate mixture of momentum eigenfunctions, each with its time-dependent term, we can follow the time evolution of the particle wave packet. We find that the packet spreads out more and more about x =0 as time passes, which means that our knowledge of position is decreasing as time passes. Even though the average position is not changing, the probability for finding the particle at a distance from x =0 is increasing. We can interpret this by remembering that the square of the wavefunction predicts the results of many experiments. In each of many position measurements finding the particle near x = 0, we impart some degree of momentum to the particle. Then, in a second measurement, we find that some of the particles have moved away from x = 0. The longer we wait before taking the second measurement, the greater this spread in x values. (Our assumption of zero average momentum amounts to saying that large deviations from zero momentum are equally likely for motion toward x =+8 Section 6-17 Summary 185 and-8.) The more precise our first position measurement is, the greater the likelihood of introducing momenta quite different from zero and the more rapidly the wave packet spreads out as time passes. In the first example, a packet located in half of a one-dimensional box, we saw ||2 oscillate back and forth, changing shape in the process so that the motion cannot be described with a single frequency. A related important case is that of an oscillator moving in a harmonic potential. Let us assume the oscillator’s position at t =0 to be described by a gaussian wavefunction. If this function is not centered at the oscillator’s equilibrium position, we have a time dependent situation analogous to the above particle-in-a-box case. For a harmonic potential, it can be shown11 that ||2 remains a gaussian function as time passes (i.e., does not change shape), and that the center of this gaussian oscillates back-and-forth about the equilibrium position with the classical frequency. This is a situation of interest to spectroscopists because it bears on the process of electronically exciting a sample of diatomic molecules. Suppose the molecules are initially in their ground electronic and ground vibrational states. Then their vibrational wavefunction is a simple gaussian function (the lowest-energy harmonic oscillator wavefunction) centered at Re (i.e., not off-center). If the molecules are excited by a laser pulse to a new electronic state having an equilibrium internuclear distance of Re , the vibrational wavefunction at t =0 is still the simple gaussian centered at Re, which means that it is now off-center. As time passes, the center of this function oscillates back and forth about Re in a coherent manner (i.e., describable with a single frequency). Thus, we have gone from an initial state describing an ensemble of molecules vibrating about Re with zero-point energy hv/2 and with random phases (i.e., a time-independent state wherein such measurable properties as average molecular dipole moment appear to be constant) to a final state where the molecules are vibrating in phase about Re (a time-dependent state, wherein one might expect to see time variation of such properties). If Re were quite a bit smaller than Re , for instance, then almost all the molecules would find themselves to be “too short” at t = 0, “too long” a short time later, etc., as they swing in phase about Re . As a result of this simple behavior, it is possible to take advantage of it and time a second laser pulse to strike the molecules when they are almost all at their shortest, or all at their longest, extension. Of course, in real molecules the potential is not exactly harmonic. Furthermore, phase coherence is eventually lost due to collisions. So the second pulse must come very soon after the first one. 6-17 Summary Some of the postulates and proofs described in this chapter are most important for what follows in this book, and we list these points here. 1. . describes a state as completely as possible and must meet certain mathematical requirements (single-valued, etc.). .*. is the probability density distribution function for the system. 2. For any observable variable, there is an operator (hermitian) which is constructed from the classical expression according to a simple recipe. (Operators related to 11See Schiff [3, pp. 67, 68], and Tanner [4]. 186 Chapter 6 Postulates and Theorems of Quantum Mechanics “spin” are the exception because the classical analog does not exist.) The eigenvalues for such an operator are the possible values we can measure for that quantity. The act of measuring the quantity forces the system into a state described by an eigenfunction of the operator. Once in that state, we may know exact values for other quantities only if their operators commute with the operator associated with our measurement. 3. If the hamiltonian operator for a system is time independent, stationary eigenfunctions exist of the form .(q,.) exp(-iEt/¯h). The time-dependent exponential does not affect the measurable properties of a system in this state and is almost always completely ignored in any time-independent problem. 4. The formula for the quantum-mechanical average value [Eq. (6-9)] is equivalent to the arithmetic average of all the possible measured values of a property times their frequency of occurrence [Eq. (6-33)]. This means that it is impossible to devise a function that satisfies the general conditions on . and leads to an average energy lower than the lowest eigenvalue of Hˆ . 5. The square-integrable eigenfunctions for an operator corresponding to an observable quantity form a complete set, which may be assumed orthonormal. The eigenvalues are all real. 6. Any operation that leaves Hˆ unchanged also commutes with Hˆ . 7. Wavefunctions describing time-dependent states are solutions to Schr¨odinger’s timedependent equation. The absolute square of such a wavefunction gives a particle distribution function that depends on time. The time evolution of this particle distribution function is the quantum-mechanical equivalent of the classical concept of a trajectory. It is often convenient to express the time-dependent wave packet as a linear combination of eigenfunctions of the time-independent hamiltonian multiplied by their time-dependent phase factors. 6-17.A Problems 6-1. Prove that d2/dx2 is hermitian. 6-2. Integrate the expressions in Eqs. (6-13) and (6-15) to show that their integrals are equal. 6-3. Prove that, if a normalized function is expanded in terms of an orthonormal set of functions, the sum of the absolute squares of the expansion coefficients is unity. 6-4. Show that a particular coefficient ck in Eq. (6-30) is given by ck = µ*k . dv. 6-5. A particle in a ring is in a state with wavefunction . =1/vp cos(2f). a) Calculate the average value for the angular momentum by evaluating  .*Lˆz. df, where Lˆz =(h¯/i) d/df. (Use symmetry arguments to evaluate the integral.) b) Express . as a linear combination of exponentials and evaluate the average value of the angular momentum using the formula Lz,av = i c*i ciLzi where Lzi is the eigenvalue for the ith exponential function. Section 6-17 Summary 187 6-6. Using Eq. (6-37), show that x · px = ¯h/2. 6-7. What condition must the function f satisfy for the equality part of = to hold in Eq. (6-34)? 6-8. Suppose you had an operator and a set of eigenfunctions for it that were associated with real eigenvalues. Does it necessarily follow that the operator is hermitian as defined by Eq. (6-10)? [Hint: Consider d/dr and the set of all functions exp(-ar), where a is real and positive definite.] 6-9. a) Show that the nonstationary state having wavefunction  =(1/v2).1s exp(it/2)+(1/v2).2p0 exp(it/8) is a solution to Schr¨odinger’s time-dependent equation when Hˆ is the timeindependent Hˆ of the hydrogen atom. Use atomic units (i.e., h¯ =1). b) This time-dependent state is dipolar and oscillates with a characteristic frequency .. Show that . satisfies the relation E2 - E1 = E = 2p. in a.u. (The dipole oscillates at the same frequency as that of light required to drive the 1s .. 2p transition. This is central to the subject of spectroscopy.) 6-10. From the definition that f =f -S. [see the discussion following Eq. (6-26)], evaluate the normalizing constant for f, assuming that f and . are normalized. 6-11. Given the two normalized nonorthogonal functions (1/vp)exp(-r) and v1/3pr exp(-r), construct a new function f that is orthogonal to the first function and lies within the function space spanned by these two functions, and is normalized. 6-12. If the hydrogen atom 1s AO is expanded in terms of the He+ AOs, what is the coefficient for (a) the He+ 1s AO? (b) the He+ 2p0 AO? 6-13. The lowest-energy eigenfunction for the one-dimensional harmonic oscillator is .n=0 =(ß/p)1/4 exp(-ßx2/2). a) Demonstrate whether or not momentum is a constant of motion (i.e., is a “sharp” quantity) for this state. b) Calculate the average momentum for this state. 6-14. Demonstrate whether x2d/dx and xd2/dx2 commute. What about xd/dx and x2d2/dx2? 6-15. Evaluate the following integrals over all space. In neither case should you need to do this by brute force. a)  (3dxy)Lˆ2(3dxy) dv b)  3dxyLˆz(3dxy) dv 188 Chapter 6 Postulates and Theorems of Quantum Mechanics 6-16. The operators for energy and angular momentum for an electron constrained to move in a ring of constant potential are, respectively, in a.u. -(1/2)d2/df2 and (1/i)d/df. a) Discuss whether or not there should be a set of functions that are simultaneously eigenfunctions for both operators. b) Discuss whether or not there is a set of functions that are eigenfunctions for one of these operators but not the other. c) Discuss whether it is reasonable to expect these two physical quantities to be exactly knowable simultaneously or whether the uncertainty principle makes this impossible. 6-17. Suppose a hydrogen atom state was approximated by the function f=(1/v3)1s +(1/v3)2s+(1/v3)3s, where 1s, 2s, and 3s are normalized eigenfunctions for the hydrogen atom hamiltonian. What would be the average value of energy associated with this function, in a.u.? 6-18. A function f is defined as follows: f = 0.1 · 1s + 0.2 · 2p1 + 0.3 · 3d2, where 1s is the normalized eigenfunction for the 1s state of the hydrogen atom, etc. Evaluate the average value of the z component of angular momentum in a.u. for this function. 6-19. Without looking back at the text, prove that (a) eigenvalues of hermitian operators are real, (b) nondegenerate eigenfunctions of hermitian operators are orthogonal, (c) nondegenerate eigenfunctions of Aˆ must be eigenfunctions of Bˆ if Aˆ and Bˆ commute. 6-20. Using Eq. (6-38), obtain an expression for ||2 as a function of x and t . 6-21. Evaluate t of Fig. 6-2 in terms of m, h, and L. 6-22. Verify the values of the coefficients in Eq. (6-40). How can you tell from simple inspection that (a) c2 will be largest and positive, (b) c4 will be zero, (c) ci will tend toward small values at large i? 6-23. a) Evaluate the first two coefficients in Eq. (6-42). b) What qualitative difference would you expect between  of Eq. (6-42) and one that takes explicit account of the changing potential resulting as the beta particle travels away from the nucleus? 6-24. Showby qualitative arguments based on mathematical functions why coefficients ck should drop off more rapidly with k if the position wave packet is broader. ck =  v4 2a/p exp(-ax2) exp(ikx) dx. 6-25. a) Prove that, ifV is real and(x, y, z, t) satisfies Schr¨odinger’s time-dependent equation, then (x, y, z,-t)* is also a solution. (This is called “invariance under time reversal.”) b) Show that, for stationary states, invariance under time reversal means that Hˆ .* =E.* if Hˆ . =E. and if V is real. c) Show from (b) that nondegenerate eigenfunctions of Hˆ (with real V ) must be real. Section 6-17 Summary 189 d) What becomes of the 2p-1AO(with time dependence included) upon complex conjugation and time reversal? the 2p0 AO? e) Can the statement in (c) be reworded to say that all degenerate eigenfunctions of Hˆ (with real V ) must be complex? Multiple Choice Questions (Try to answer these without referring to the text.) 1. Which one of the following statements about the eigenfunctions of a timeindependent Hamiltonian operator is true? a) Any linear combination of these eigenfunctions is also an eigenfunction for Hˆ . b) The state function for this system must be one of these eigenfunctions. c) The eigenvalues associated with these eigenfunctions must all be real. d) These eigenfunctions must all be orthogonal to one another. e) These eigenfunctions have no time dependence. 2. A hydrogen atom is in a nonstationary state having the wavefunction 1 2 {.1s exp(it/2¯h)+ .2s exp(it/8¯h)+.2p0 exp(it/8¯h)+.2p+1 exp(it/8¯h)} Which statement is true at any time t? a) A measurement of the energy has a 25% chance of giving -0.5 a.u. b) The average value of the z component of angular momentum is 0.5 a.u. c) The average energy is -78 a.u. d) A measurement of the total angular momentum has a 75% chance of giving v2 a.u. e) None of the above statements is true. 3. The function rexp(-0.3r2) cos . is expanded in terms of hydrogen atom wavefunctions. This series may have finite contributions from bound-state eigenfunctions a) of all types: s,px,py,pz, dxy, dyz, etc. b) of all types except s. c) of types px,py,pz only. d) of type pz only. e) of types pz and dz2 only. References [1] E. C. Kemble, The Fundamental Principles of Quantum Mechanics with Elementary Applications. Dover, NewYork, 1958. [2] E. Merzbacher, Quantum Mechanics, 3rd ed.Wiley, NewYork, 1998. [3] L. I. Schiff, Quantum Mechanics, 3rd ed. McGraw-Hill, NewYork, 1968. [4] J. J. Tanner, J. Chem. Ed., 67, 917 (1990). Chapter 7 The Variation Method 7-1 The Spirit of the Method The proof of the Rayleigh-Ritz variation principle (Section 6-12) involves essentially two ideas. The first is that any function can be expanded into a linear combination of other functions that span the same function space. Thus, for example, exp(ikx) can be expressed as cos(kx) + i sin(kx). An exponential can also be written as a linear combination of powers of the argument: exp(x)=1+x +x2/2!+x3/3!+···+xn/n!+··· (7-1) The second idea is that, if a function is expressed as a linear combination of eigenfunctions for the energy operator, then the average energy associated with the function is a weighted average of the energy eigenvalues. For example, if f = 1 v2 .1 + 1 v2 .2 (7-2) where ˆ H.1 =E1.1, ˆ H.2 =E2.2, E1 =E2 (7-3) then measuring the energy of many systems in states described by f would give the result E1 half of the time and E2 the other half. The average value, 1 2E1 + 1 2E2 must lie between E1 and E2. Alternatively, if f =1 3 .1 +2 3 .2 (7-4) measurements would give E1 one-third of the time, and E2 the rest of the time, for an average that still must lie between E1 and E2. It should be evident that, even when f is a linear combination of many eigenfunctions .i , the average value of E can never lie below the lowest or above the highest eigenvalue. The variation method is based on the idea that, by varying a function to give the lowest average energy, we tend to maximize the amount of the lowest-energy eigenfunction .0 present in the linear combination already discussed. Thus, if we minimize E¯ =  f*Hˆ f dv  f*f dv (7-5) the resulting f should tend to resemble .0 since we have maximized (in a sense) the amount of .0 in f by this procedure. 190 Section 7-2 Nonlinear Variation: The Hydrogen Atom 191 7-2 Nonlinear Variation: The Hydrogen Atom We have already seen (Chapter 4) that the lowest-energy eigenfunction for the hydrogen atom is (in atomic units) .1s = 1 vp exp(-r) (7-6) Suppose we did not knowthis and used the variation method to optimize the normalized trial function f = . . . 3 p . . exp(-.r) (7-7) In this example, when . =1, f becomes identical to .1s , but in more complicated systems the trial function never becomes identical to an eigenfunction of the hamiltonian. Nevertheless, this is a good example to start with since there are few mathematical complexities to obscure the philosophy of the approach. The variation method requires that we minimize E¯ = f*Hˆ f dv (7-8) by varying .. [f is normalized, so no denominator is required in Eq. (7-8).] Since the trial function f has no .- or f-dependent terms for .2 to operate on, only the radial part of .2 is needed in Hˆ . Thus [from Eq. (4-7)] Hˆ =- 1 2 1 r2 d dr r2 d dr - 1 r (7-9) According to Eq. (7-8), we need first to evaluate the quantity Hˆ f: Hˆ f = - 1 2 1 r2 d dr r2 d dr - 1 r . 3 p exp (-.r) (7-10) ... = (. -1) r - . 2 2 . 3 p exp (-.r) (7-11) Incorporating this into Eq. (7-8) gives (after integrating . and f in dv to give 4p) E¯ = 4p . 3 p  8 0 (. -1) r - . 2 2 exp (-2.r) r2 dr (7-12) = 4. 3 (. -1) 8 0 r exp (-2.r) dr -. 2 2  8 0 r2 exp (-2.r) dr (7-13) Using the integral table in Appendix 1, we obtain E¯ =. 2 2 -. (7-14) 192 Chapter 7 The Variation Method Now we have a simple expression for E¯ as a function of . . To obtain the minimum, we set the derivative of E¯ to zero: dE¯ d. =0=. -1 (7-15) As we expected, . =1. Inserting this value for . into Eq. (7-14) gives E¯ =-1 2 a.u., which is identical with the lowest eigenvalue for the hydrogen atom. This example demonstrates thatminimizingE¯ for a trial function causes the function to become like the lowest eigenfunction for the system. But it is more realistic to examine a case where the trial function is incapable of becoming exactly identical with the lowest eigenfunction. Suppose we assumed a trial form (normalized) of f =  . 5 3p r exp(-.r) (7-16) Proceeding as before, we first evaluate Hˆ f: Hˆ f = - 1 r2 + 2. -1 r - . 2 2 f (7-17) This leads to E¯ (. )= 4 3 . 2 8 - 3. 8 (7-18) so that dE¯ d. =0= 4 3 . 4 - 3 8 (7-19) and E¯ is a minimum when . = 3 2 . Thus, our energy-optimized f is f =  35 96p r exp- 3r 2  (7-20) This is obviously not identical to the eigenfunction given by Eq. (7-6), but it must be expressible as a linear combination of hydrogen atom eigenfunctions, and the amount of .1s present should be quite large unless the trial form was unwisely chosen. Since f also must contain contributions from higher energy eigenfunctions, it follows that E¯ must be higher in energy than -1 2 a.u. We test this by inserting . = 3 2 into Eq. (7-18), obtaining an E¯ of -38 a.u., (-0.375 a.u.). This value is above the lowest eigenvalue, but it is well below the second-lowest eigenvalue (-18 a.u.) associated with 2s, 2p eigenfunctions, and so we know that f does indeed contain much 1s character. We can find out exactly how much 1s eigenfunction is contained in f by calculating the overlap between f and the 1s eigenfunction. That is, since the 1s function is orthogonal to all the other hydrogen atom eigenfunctions; .1s.j dv = 0 j =1s 1 j =1s (7-21) Section 7-2 Nonlinear Variation: The Hydrogen Atom 193 and since f =c1s.1s + j =1s cj.j (7-22) it follows that (7-23) Integrating  .1sf dv, where f is given by Eq. (7-20), gives c1s=0.9775, so f does indeed “contain” a large amount of .1s, (If c1s =1, then .1s and f are identical.) This suggests another way of trying to get a “best” approximate wavefunction. We could find the value of . that maximized the overlap between f and.1s. This maximizes c1s in Eq. (7-22). If one does this (Problem 7-6), one obtains . = 53 , which corresponds to an overlap of 0.9826 and an E¯ of -0.370 a.u. At first sight, this seems puzzling. f. = 53  has a larger amount of .1s in it, but f. = 3 2  has a lower average energy. But it is really not so unreasonable. Maximizing the value of c1s causes . to take on a certain value that may cause the coefficient for some high-energy state (say, 6s) to become relatively larger, producing a tendency to raise the average energy. On the other hand, minimizing E¯ is a process that is implicitly concerned with what all the coefficients are doing. This allows for a different sort of compromise wherein c1s may be allowed to be a bit smaller if the associated energy loss is more than compensated for by a favorable shifting in values of higher coefficients (e.g., if c2s increases and c6s decreases). The purpose of this discussion is to emphasize that the variation method optimizes the trial function in a certain sense (best energy), but that other kinds of optimization are conceivable. The optimization that gives best overlap is not generally useful because it requires that we know the exact solution to begin with. If we vary . to obtain optimal agreement for r¯, or V¯ , or r2, we would find a different value of . appropriate for each property. In each case, however, we would need to know the correct value of ¯r, etc., before starting. A great virtue of the energy variation method is that it does not require foreknowledge of the exact eigenvalue or eigenfunction. However, the function that gives the lowest value for E¯ might not be especially good in describing other properties. This is demonstrated in Fig. 7-1 and Table 7-1. The figure indicates that the trial function differs from the exact function chiefly near the nucleus, where r <1. This discrepancy shows up when we compare average values for various powers of r. The operators r, r2, and r3 become large when r is large. Thus, these operators magnify .2 dv at large r. Since the two functions are fairly similar at large r, the average values show fair agreement. But r-1 and r-2 become large when r is small. Thus, the fact that the approximate function is too small at small r shows up as a marked disagreement in the average value of r-2, this average being much larger for the exact function (see Table 7-1). Any trial function that is known to be especially inaccurate in some region of space (e.g., at small r) will give unreliable average values for operators that are largest in that region of space (e.g., r-2). Choosing a trial form such as Eq. (7-16), which must vanish at r =0, might seem foolish since we know that .1s does not vanish at r =0. And if we were interested in electron density at the nucleus for, say, calculating the Fermi contact interaction, this would indeed be a self-defeating choice. But if our interest is in energies or other properties having operators that are large in regions where the trial function is not too deficient, 194 Chapter 7 The Variation Method Figure 7-1  Plots of .1s and f [Eq. (7-20)] versus r. TABLE 7-1  Comparison between Exact Values for Some Properties of (1s) Hydrogen and Values Calculated from the Function Eq. (7-20) Quantity Exact (1s) value (a.u.) Trial function value (a.u.) E¯ -1 2 -38 Electron density at nucleus 1 p 0 ¯r 1.5 1.67 r2 3.0 3.33 r3 7.5 7.78 r-1 1.0 0.75 r-2 2.0 0.75 this choice would serve. One often settles for a mathematically convenient form even though it is known to be inadequate in someway. Care must then be exercised, however, to avoid using that trial wavefunction in ways which emphasize its inadequacies. 7-3 Nonlinear Variation: The Helium Atom We mentioned in Chapter 5 that the ground-state wavefunction 1s(1)1s(2) for helium was much too contracted if the 1s functions were taken from the He+ ion without modification. Physically, this arises because, in He+, the single electron sees only a Section 7-3 Nonlinear Variation: The Helium Atom 195 doubly positive nucleus, whereas in He each electron sees a doubly positive nucleus and another electron, so that in He the repulsion between electrons prevents them from spending as much time near the nucleus as in He+. Somehow, the 1s functions should be modified to reflect this behavior. We will now show how the variation method may be used to accomplish this. The form of the hydrogenlike ion 1s solution is 1s= Z3 p exp(-Zr) (7-24) For He+, Z =2, but we have just seen that this gives a function that is too contracted. Smaller values ofZ would cause the function to die away more slowly with r. Therefore, it is reasonable to replace the atomic number Z with a variable parameter . and find the value of . that gives the lowest average energy. Hence, we let 1s (1)= . 3 p exp(-.r1) (7-25) and our trial wavefunction is1 f(1, 2)=1s(1)1s(2) 1/v2a(1)ß(2)-ß(1)a(2) (7-26) The average energy is [since f(1, 2) is normalized] E¯ = f*(1, 2)Hˆ (1, 2)f(1, 2)dt (1)dt (2) (7-27) Since Hˆ (1, 2) contains no spin operators at our level of approximation, the integral separates into an integral over the space coordinates of both electrons and an integral over the spin coordinates of both electrons. The integration over spins gives a factor of unity. There remains E¯ = 1s(1)1s(2)Hˆ (1, 2)1s(1)1s(2)dv(1)dv(2) (7-28) where Hˆ (1, 2)=- 1 2.2 1 - 1 2.2 2 - 2 r1 - 2 r2 + 1 r12  (7-29) and where the . and f parts of .2 can be ignored since f(1, 2) is independent of these variables. The calculation is easier if we recognize that Hˆ (1, 2)=HˆHe+ (1)+HˆHe+ (2)+ 1 r12 (7-30) 1In this trial function, . has the same value in each atomic orbital. This is not a necessary restriction. There is no physical reason for not choosing the more general trial function where orbitals with different . are used. Symmetry requires that such a function be written 2-1/2[1s(1)1s(2)+1s(1)1s(2)]2-1/2[a(1)ß(2)-ß(1)a(2)]. This type of function is called a “split shell” wavefunction. It gives a lower energy for He than does the function (7-26). However, for most quantum-chemical calculations split shells are not used, the gain in accuracy usually not being commensurate with the increased computational effort. 196 Chapter 7 The Variation Method This allows us to express Eq. (7-28) as the sum of three integrals, the first of these being 1s(1)1s(2)HˆHe+ (1) 1s(1)1s(2)dv(1)dv(2) (7-31) Since the operator in the integrand operates only on coordinates of electron 1, we can separate this into a product of two integrals: 1s(2)1s(2)dv(2) 1s(1)HˆHe+ (1) 1s(1)dv(1) (7-32) The 1s functions are normalized, and so the first integral is unity. The second integral is almost identical to the integral in Eq. (7-8) and has the value . 2/2-2. . Therefore, the first of the three integrals mentioned above equals . 2/2 - 2. . The second of the three integrals is identical with Eq. (7-31) except that the operator acts on electron 2 instead of 1. This integral is evaluated in the same manner and gives the same result. The third integral, in which the operator is 1/r12, is more difficult to evaluate. This interesting and instructive problem constitutes a detour from the main sequence of ideas in this chapter and is therefore discussed in Appendix 3. We here simply take the result, 58 . , and proceed with the variation calculation. We now have an expression for E¯ as a function of . : E¯ =2 . 2/2-2. + 5 8 . =. 2 - 27 8 . (7-33) Minimizing E¯ with respect to . gives dE¯ d. =0=2. - 27 8 (7-34) so that . = 27 16 (7-35) This value of . is smaller than the unmodified He+ value of 2, as we anticipated. Let us see how much the average energy has been improved. According to Eq. (7-33), when . is 2, E¯ is equal to -2.75 a.u. When . = 27 16 ,E¯ equals -2.848 a.u., so the average energy has been lowered by approximately 0.1 a.u., or 2.7 eV, or 62 kcal/mole. (The exact nonrelativistic energy for He is-2.903724377 a.u.) Further analysis would show that, by decreasing . , we have decreased the average kinetic energy (the less compressed wavefunction changes slope less rapidly), raised the nuclear-electron attraction energy from a large negative to a smaller negative value (the decreased attraction resulting from the electrons being farther from the nucleus, on the average), and decreased the interelectronic repulsion energy from a higher positive value to a lower one. The variational procedure has allowed the wavefunction to adjust to the best compromise it can achieve among these three factors. If . becomes less than 27 16 the loss of nuclear– electron attraction is too great to be offset by the loss of interelectronic repulsion and kinetic energy. Varying a parameter in the argument of an exponential produces a nonlinear change in the function, and so calculations of the type described above are referred to as nonlinear variation calculations. Such calculations tend to become mathematically complicated and are not frequently used except for fairly simple systems. The fact Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 197 that the hamiltonian operator is a linear operator [i.e., Hˆ (c1f1 +c2f2) = (c1Hˆ f1 + c2Hˆ f2)] makes a linear variation procedure more convenient for most purposes. 7-4 Linear Variation: The Polarizability of the Hydrogen Atom Suppose we wish to express a wavefunction . (which may be approximate) as a linear combination of two known functions f1 and f2: . (c1, c2)=c1f1 +c2f2 (7-36) The question is, what values of c1 and c2 give a . that best approximates the exact wavefunction for a particular system? The usual approach is to determine which values of c1 and c2 give the . associated with the minimum average energy attainable. The technique for achieving this, called the linear variation method, is by far the most common type of quantum chemical calculation performed. An example of a problem that can be treated by this method is the polarizability of the hydrogen atom. The wavefunction for the unperturbed hydrogen atom in its ground state is spherically symmetrical. But, when a uniform z-directed external electric field of strength F is imposed, the positive nucleus and the negative electron are attracted in opposite directions, which leads to an electronic distribution that is skewed with respect to the nucleus. The wavefunction describing this skewed distribution may be approximated by mixing with the unperturbed 1s function some 2pz function: . = c11s+c22pz . As indicated in Fig. 7-2, this produces a skewed wavefunction because the 2pz function is of the same sign as the 1s function on one side of the nucleus and of the opposite sign on the other. We will work out the details of this example after developing the method for the general case. Let the generalized trial function . be a linear combination of known functions f1,f2, . . . ,fn. (This set of functions is called the basis set for the calculation.) . =c1f1 +c2f2+···+cnfn (7-37) where the coefficients c are to be determined so that  .*Hˆ . dt  .*. dt =E¯ (7-38) is minimized. Substituting Eq. (7-37) into Eq. (7-38) gives E¯ =  c*1f*1 +c*2f*2 +···+c*nf*nHˆ (c1f1 +c2f2 +···+cnfn) dt  c*1f*1 +c*2f*2 +···+c*nf*n (c1f1 +c2f2+···+cnfn) dt = num denom (7-39) Since we will be dealing with cases in which the c’s and f’s are real, we will temporarily omit the complex conjugate notation to simplify the derivation. At the minimum value of E¯, .E¯ .c1 = .E¯ .c2 =···= .E¯ .cn =0 (7-40) 198 Chapter 7 The Variation Method Figure 7-2  Values of . versus z for 1s state of H atom (—-) and for approximate wavefunction given by 0.982 1s-0.188 2pz (- - -). The nucleus is at z=0 for each case. The partial derivative of Eq. (7-39) with respect to c1 is .E¯ .c1 =  f1Hˆ (c1f1 +···+cnfn) dt denom +  (c1f1 +···+cnfn)Hˆ f1 dt denom -(num) (denom)-2 f1 (c1f1+···+cnfn) dt + (c1f1+···+cnfn)f1dt = 0 (7-41) Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 199 Multiplying through by denom, recalling that num/denom equals E¯, and rearranging, gives c1 f1Hˆ f1 dt -E¯ f1f1 dt +c2 f1Hˆ f2 dt -E¯ f1f2 dt +···+cn f1Hˆ fn dt -E¯ f1fn dt =0 (7-42) At this point, it is convenient to switch to an abbreviated notation: fiHˆ fj dt = Hij (7-43) fifj dt = Sij (7-44) The integral Sij is normally called an overlap integral since its value is, in certain cases, an indication of the extent to which the two functions fi and fj occupy the same space. Use of this abbreviated notation produces, for Eq. (7-42), c1 H11 -E¯S11+c2 H12 -E¯S12+···+cn H1n -E¯S1n=0 (7-45) A similar treatment for .E¯/.ci gives a similar equation: c1 Hi1 -E¯Si1+c2 Hi2 -E¯Si2+···+cn Hin -E¯Sin=0 (7-46) Thus, requiring that .E¯/.ci vanish for all coefficients produces n homogeneous linear equations (homogeneous, all equal zero; linear, all ci ’s to first power). If one chooses a value for E¯, there remain n unknowns—the coefficients ci . (The integrals Hij and Sij are presumably knowable since Hˆ and the functions fi are known.) Of course, one trivial solution for Eqs. (7-46) is always possible, namely, c1 =c2=···=cn =0. But this corresponds to . =0, a case of no physical interest. Are there nontrivial solutions as well? In quantum chemical calculations, nontrivial solutions usually exist only for certain discrete values of E¯. This provides the approach for solving the problem. First, find those values of E¯ for which nontrivial coefficients exist. Second, substitute into Eqs. (7-46) whichever of these values of E¯ one is interested in and solve for the coefficients. (Each value of E¯ has its own associated set of coefficients.) But how do we find these particular values of E¯ that yield nontrivial solutions to Eqs. (7-46)? The answer is given in Appendix 2, where it is shown that the condition which must be met by the coefficients of a set of linear homogeneous equations in order that nontrivial solutions exist is that their determinant vanish. Notice that, in the standard treatment given in Appendix 2, the coefficients are known and x,y, and z are unknown. Here, however, the coefficients ci are unknown, and Hij and Sij are known. Therefore, it is the determinant of the H’s, S’s and E¯ in Eqs. (7-46) that must equal zero:  H11 -E¯S11 H12 -E¯S12 ··· H1n -E¯S1n H21 -E¯S21 H22 -E¯S22 ··· H2n -E¯S2n ... ... ... Hn1 -E¯Sn1 Hn2 -E¯Sn2 ··· Hnn -E¯Snn  =0 (7-47) 200 Chapter 7 The Variation Method Expansion of this determinant gives a single equation containing the unknown E¯. Any value of E¯ satisfying this equation is associated with a nontrivial set of coefficients. The lowest of these values of E¯ is the minimum average energy achievable by variation of the coefficients. Substitution of this value of E¯ back into Eqs. (7-46) produces n simultaneous equations for the n coefficients. Equation (7-47) is referred to as the secular equation, and the determinant on the left-hand side is called the secular determinant. This method is best illustrated by example, and we will now proceed with the problem of a hydrogen atom in a z-directed uniform electric field of strength F a.u. As mentioned earlier, a suitable choice of functions to mix together to approximate the accurate wavefunction is the 1s and 2p, hydrogenlike functions. The choice of two basis functions leads to a 2×2 secular determinantal equation:  H11 -E¯S11 H12 -E¯S12 H21 -E¯S21 H22 -E¯S22  =0 (7-48) If we arbitrarily associate the 1s function with the index 1 and the 2pz function with index 2 (consistent with . = c11s + c22pz ), then the terms in the determinant are (returning to general complex conjugate notation): H11 = 1s*Hˆ 1s dt H12 = 1s*Hˆ 2pz dt H21 = 2p*z ˆ H1s dt H22 = 2p*z Hˆ 2pz dt (7-49) (The electron label has been omitted since there is only one electron.) The corresponding S integrals are obtained from these if Hˆ is omitted in each case. The hamiltonian operator is just that for a hydrogen atom with an additional term to account for the z-directed field: Hˆ =- 1 2.2 -(1/r)-Fr cos . (7-50) [The energy of a charge -e in a uniform electric field of strength F and direction z is -eF z. In atomic units, one unit of field strength is e/a2 0 =5.142×1011 V/m. The unit of charge in atomic units is e, so this symbol does not appear explicitly in Eq. (7-50). Also, the identity z=r cos . has been used.] This may also be written Hˆ =Hˆhyd -Fr cos . (7-51) where Hˆhyd is the hamiltonian for the unperturbed hydrogen atom. The secular determinant contains four H-type and four S-type terms. However, evaluating these eight integrals turns out to be much easier than one might expect. In the first place, S12 = S21 since these integrals differ only in the order of the two functions in the integrand, and the functions commute. Also, because Hˆ is hermitian, it follows immediately that H21 =H* 12. This leaves us with three S terms and three H terms to evaluate. The three S terms are simple. Because the hydrogenlike functions are orthonormal, S11 and S22 equal unity, and S12 vanishes. These points have already reduced the secular determinantal equation to  H11 -E¯ H12 H1*2 H22 -E¯  =0 (7-52) Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 201 Consider next the term H11. This may be written as H11 = 1s*Hˆhyd1s dt - 1s* (Fr cos .) 1s dt (7-53) But the 1s function is an eigenfunction of Hˆhyd with eigenvalue -1 2 a.u. Therefore, the first integral on the right-hand side of Eq. (7-53) is 1s*Hˆhyd1s dt =- 1 2 1s*1s dt =- 1 2 a.u. (7-54) The second term on the right-hand side of Eq. (7-53) is zero by symmetry since 1s*1s is symmetric for reflection in the xy plane while r cos .(=z) is antisymmetric. Thus, H11=-1 2 a.u. Similarly, H22 =-18 a.u. [Recall that the 8 eigenvalues of hydrogen are equal to -1/(2n2), and here n=2.] All that remains is H12: H12 = 1s*Hˆhyd2pz dt -F 1s* (r cos .) 2pz dt (7-55) The first term on the right-hand side is easily shown to be zero: 1s*Hˆhyd2pz dt = 1s* - 1 8 2pz dt =0 (7-56) Here we employ the fact that 2pz is an eigenfunction of Hˆhyd and then the fact that 1s and 2pz AOs are orthogonal. The second term on the right-hand side must be written out in full and integrated by “brute force.” It is remarkable that, of the eight terms originally considered, only one needs to be done by detailed integration. Proceeding, we substitute formulas for 1s and 2pz in this last integral to obtain -F (p)-1/2 exp(-r)r cos .(32p)-1/2r exp(-r/2) cos .(r2 sin .) dr d. df (7-57) We consider the integration over spin to have been carried out already, giving a factor of unity. Integrating over f to obtain 2p, and regrouping terms gives -2pF/(4v2p) 8 0 r4 exp(-3r/2)dr p 0 cos2 . sin. d. (7-58) =-F/2v2 4! 3 2  5  2 3 =- 215/2F 35 a.u. (7-59) This completes the task of evaluating the terms in the secular determinant.2 The final result is (in atomic units)  -1 2 -E¯ -215/2(F/35) -215/2(F/35) -18 -E¯  =0 (7-60) 2Since H12 is real, it is clear that H21 =H12. 202 Chapter 7 The Variation Method which expands to 1 16 +(5E¯/8)+E¯2 -215F2/310 =0 (7-61) This is quadratic in E¯ having roots E¯ =- 5 16 ±  9 64 +217F2/310 1/2 2 (7-62) When no external field is present, F = 0 and the roots are just -1 2 and -18 a.u., the 1s and 2pz eigenvalues for the unperturbed hydrogen atom. As F increases from zero, the roots change, as indicated in Fig. 7-3. We see that, for a given field strength, there are only two values of E¯ that will cause the determinant to vanish. If either of these two values of E¯ is substituted into the simultaneous equations related to Eq. (7-52), then nontrivial values for c1 and c2 can be found. Thus, at F =0.1 a.u., E¯ =-0.51425 a.u., and E¯ = -0.1107 a.u. are values of E¯ for which .E¯/.c1, and .E¯/.c2 both vanish. The former is the minimum, the latter the maximum in the curve of E¯ versus c1 (Fig. 7-4). (The normality requirement results in the two c’s being dependent, and so E¯ may be plotted against either of them.) Since the variation principle tells us that E¯ = Elowest exact, we can say immediately that the energy of the hydrogen atom in a uniform electric field of 0.1 a.u. is -0.51425 a.u. or lower. That is, -0.51425 a.u. is an upper bound to the true energy. Now that we have the value of the lowest E¯1 we can solve for c1 and c2 and obtain the approximate ground state wavefunction. The homogeneous equations related to the determinant in Eq. (7-52) are c1(H11 -E¯)+c2H12 = 0 (7-63) c1H12 +c2 H22 -E¯ = 0 (7-64) Substituting -0.51425 for E¯, and inserting the values for H11,H22, and H12 found earlier gives (when F =0.1 a.u.) 0.01425c1 -0.074493c2 = 0 (7-65) -0.074493c1 +0.38925c2 = 0 (7-66) Equation (7-65) gives c1 =5.2275c2 (7-67) If we substitute this expression for c1 into Eq. (7-66) we get -0.3892c2 +0.3892c2 =0 (7-68) This is useless for evaluating c2. It is one of the properties of such a set of homogeneous equations that the last unused equation is useless for determining coefficients. This arises because an equation like (7-63) still equals zero when c1 and c2 are both multiplied by the same arbitrary constant. Therefore, these equations are inherently capable of telling us the ratio of c1 and c2 only, and not their absolute values. We shall determine Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 203 Figure 7-3  Average energies for a hydrogen atom in a uniform electric field of strength F as given by a linear variation calculation using a 1s, 2pz basis. (- - -) Results from accurate calculations. absolute values by invoking the requirement that . be normalized. In this case, this means that c2 1 +c2 2 =1 (7-69) or (5.2275c2)2 +c2 2 =1 (7-70) which gives c2=±0.18789 (7-71) If we arbitrarily choose the positive root for c2, it follows from Eq. (7-67) that c1 =0.98219. 204 Chapter 7 The Variation Method Figure 7-4  E¯ versus c1 for a hydrogen atom in a uniform electric field of strength 0.1 a.u. Thus, when F =0.1 a.u., the linear variation method using a 1s, 2pz basis set gives an upper bound to the energy of -0.51425 a.u. and a corresponding approximate wavefunction of . =0.98219 1s+0.18789 2pz (7-72) As mentioned earlier, the admixture of 2pz with 1s produces the skewed charge distribution shown in Fig. 7-2. It is important to note that the extent of mixing between 1s and 2pz depends partly on the size of the off-diagonal determinantal element H12. When H12 is zero (no external field), no mixing occurs. As H12 increases, mixing increases. Generally speaking, the larger the size of the off-diagonal element connecting two basis functions in the secular determinant, the greater the degree of mixing of these basis functions in the final solution, other factors being equal. Hij is sometimes referred to as the interaction element between basis functions i and j . If we carried through the same procedure using the maximum E¯ of -0.1107 a.u., we would obtain the approximate wavefunction . =0.98219 2pz -0.18789 1s (7-73) This wavefunction is orthogonal to .. It may be proved that the second root, E¯ = -0.1107 is an upper bound for the energy of the second-lowest state of the Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 205 hydrogen atom in the field. However, . is probably not too good an approximation to the exact wavefunction for that state. This is partly because the wavefunction . is one that maximizes E¯. Hence, there is no particular tendency for the procedure to isolate the second-lowest state from the infinite manifold of states. Also, our basis set was chosen with an eye toward its appropriateness for approximating the lowest state. The true second-lowest state might be expected to contain significant amounts of the 2s AO, which is not included in this basis. The values of E¯ versus c1 are plotted in Fig. 7-4. The values of E¯ that we obtained by the variation procedure correspond to the extrema in this figure. The low-energy extreme corresponds to a wavefunction that shifts negative charge in the direction it is attracted by the field. The high-energy extreme corresponds to a wavefunction that shifts charge in the opposite direction. (When a calculation is performed over a more extensive basis to produce more than two roots, the highest and lowest roots correspond, respectively, to the maximum and minimum, the other roots to saddle points, on the energy hypersurface.) The detailed treatment just completed is rather involved, and so we now summarize the main points. Step 1 involved selection of a basis set of functions which is capable of approximating the exact solution. Step 2 was the construction of the secular determinant, including evaluation of all the Hij - and Sij -type integrals. Step 3 was the conversion of the determinantal equation into its equivalent equation in powers of E¯ and solution for the roots E¯. Step 4 was the substitution of an E¯ of interest into the simultaneous equations that are related to the secular determinant and solution for c1/c2. Finally, we used the normality requirement to arrive at convenient values for c1 and c2. There are many ways one could increase the flexibility of the trial function in an effort to increase the accuracy of the calculation. By adding additional basis functions, one would stay within the linear variation framework, merely increasing the size of the secular determinant. If these additional basis functions are of appropriate symmetries, they will cause the minimum energy root to be lowered further and will mix into the corresponding wavefunction to make it a better approximation to the lowest-energy eigenfunction for the system. Also, the additional functions will increase the number of roots E¯, thereby providing upper bounds for the energies of the third-, fourth-, etc. lowest states of the system. Another possibility is to allow nonlinear variation of the 1s and 2pz orbital exponents, in combination with linear variation. This would be more involved than the calculation we have shown here, but could easily be accomplished with the aid of a computer. EXAMPLE 7-1 Suppose the variational process just described were performed using.(c1, c2, c3)=c11s+c22p0+c33d0. Would all three of theseAOs be present in the lowest-energy solution? SOLUTION  We already know that 1s and 2p0 will be mixed. They differ in reflection symmetry in just the right manner to provide a wavefunction skewed in the z direction, and this is manifested as a nonzero interaction element H12. 3d0, however, is, like 1s, symmetric for reflection through the x, y plane. Therefore, it cannot skew 1s in the z direction, and its interaction element with 1s, H13, equals zero (by symmetry). Therefore, it is tempting to think that that 3d0 will not contribute. However, because 3d0 and 2p0 have opposite reflection symmetries through the x,y 206 Chapter 7 The Variation Method plane, they do interact, H23 is not zero, and so all three basis functions show up in the lowest-energy solution. 1s and 3d0 are indirectly linked because they are each directly linked to 2p0. Physically, one can argue that a 2p0 AO that has been polarized by admixture with 3d0 can better polarize the 1s AO than can the pure 2p0 AO. (One could say that 3d0 is brought in “on the coat-tails” of 2p0.) Note also that 1s has zero angular momentum along the z axis before the electric field is turned on. The z-directed field distorts the ground state, but has no effect on the z component of angular momentum. Therefore, only eigenfunctions having ml =0 can contribute to the polarized ground state.  7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion We are now ready to consider using the linear variation method on molecular systems. We begin with the simplest case, H+2 . This molecule–ion has enough symmetry so that we could guess many important features of the solution without calculation. However, to demonstrate the method, we shall first simply plunge ahead mathematically, and discuss symmetry later. The H+2 system consists of two protons separated by a variable distance R, and a single electron (see Fig. 7-5). The hamiltonian for this molecule is, in atomic units Hˆ (rA, rB, r1)=- 1 2  .2 1 + .2 A 1836 + .2 B 1836  - 1 rA1 - 1 rB1 + 1 R  (7-74) Since all the particles in the system are capable of motion, the exact eigenfunction of Hˆ will be a function of the coordinates of the electron and the protons. However, the protons are each 1836 times as heavy as the electron, and in states of chemical interest their velocity is much smaller than that of the electron. This means that, to a very good degree of approximation, the electron can respond instantly to changes in internuclear separation. In other words, whenever the nuclei are separated by a given distance R, no matter how they got there, the motion of the electron will always be described in the same way by .. This means that we can separate the electronic and nuclear motions with little loss of accuracy.3 Given an internuclear separation, we can solve for the electronic wavefunction by ignoring nuclear motion [i.e., omitting .2 A and Figure 7-5  The H+2 molecule–ion. A and B are protons. The electron is numbered “1.” 3However, there are times when coupling of electronic and nuclear motions becomes important. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 207 .2 B in Eq. (7-74)]. This gives us a hamiltonian for the electronic energy and nuclear repulsion energy of the system: Hˆ (rA, rB, r1)=- 1 2.2 1 - 1 rA1 - 1 rB1 + 1 R  (7-75) For a given internuclear separation R, the internuclear repulsion 1/R is a constant, and we can omit it and merely add it on again after we have found the electronic energy. If we let Hˆelec stand for the first three terms on the right-hand side of Eq. (7-75), we can write Hˆelec.elec (r1)=Eelec.elec (r1) (7-76) and Eelec +Enuc rep =Eelec +1/R (7-77) Solving Eq. (7-76) for Eelec for every value of R allows us to plot the electronic and also the total energy of the system as a function of R. But Eelec(R)+1/R is just the potential energy for nuclear motion. Therefore, this quantity can be inserted as the potential in the hamiltonian operator for nuclear motion: Hˆnuc(rA, rB)=- 1 2  .2 A 1836 + .2 B 1836  +Eelec(R)+1/R (7-78) Hˆnuc(rA, rB).nuc(rA, rB)=Enuc.nuc (rA, rB) (7-79) The eigenfunctions of Eq. (7-79) describe the translational, vibrational, and rotational states of the molecule. Note that the eigenvalues of the hamiltonian for nuclear motion are total energies for the system because they contain the electronic energy in their potential parts. Hence, Etot =Enuc (7-80) But .tot(rA, rB, r1)=.elec(r1).nuc (rA, rB) (7-81) This approximation—that the electronicwavefunction depends only on the positions of nuclei and not on their momenta—is called the Born–Oppenheimer approximation.4 Only to the extent that this approximation holds true is it valid, for example, to separate electronic and vibrational wavefunctions and treat various vibrational states as a subset existing in conjunction with a given electronic state. We will assume the Born–Oppenheimer approximation to be valid in all cases treated in this book. Making the Born–Oppenheimer approximation for H+2 , we seek to solve for the electronic eigenfunctions and eigenvalues with the nuclei fixed at various separation distances R. We already know these solutions for the two extremes of R. When the two 4It is analogous to the concept of reversibility in thermodynamics: The piston moves so slowly in the cylinder that the gas can always maintain equilibrium, so pressure depends only on piston position; the nuclei move so slowly in a molecule that the electrons can always maintain their optimum motion at each R, so electronic energy depends only on nuclear position. 208 Chapter 7 The Variation Method nuclei are widely separated, the lowest-energy state is a 1s hydrogen atom and a distant proton. (Since there is a choice about which nucleus is “the distant proton,” there are really two degenerate lowest-energy states.) When R = 0, the system becomes He+ These two extremes are commonly referred to as the separated-atom and united-atom limits, respectively. In carrying out a linear variation calculation on H+2 , our first problem is choice of basis. In the separated-atom limit, the appropriate basis for the ground state would be a 1s atomic orbital (AO) on each proton. Then, regardless of which nucleus the electron resided at, the basis could describe the wavefunction correctly. The appropriate basis at the united-atom limit is a hydrogenlike 1s wavefunction with Z=2. At intermediate values of R, choice of an appropriate basis is less obvious. One possible choice is a large number of hydrogenlike orbitals or, alternatively, Stater-type orbitals (STOs), all centered at the molecular midpoint. Such a basis is capable of approximating the exact wavefunction to a high degree of accuracy, provided a sufficiently large number of basis functions is used.5 Calculations using such a basis are called single-center expansions. A different basis, and one that is much more popular among chemists, is the separated-atom basis—a 1s hydrogen AO centered on each nucleus. At finite values of R, this basis can produce only an approximation to the true wavefunction. One way to improve this approximation is to allow additionalAOs on each nucleus, 2s, 2p, 3s, etc., thereby increasing the mathematical flexibility of the basis. If we restrict our basis toAOs that are occupied in the separated-atom limit ground state (the 1sAOs in this case), then we are performing what is called a minimal basis set calculation. For now, we will use a minimal basis set. The wavefunction that we produce by linear variation will extend over the whole H+2 molecule, and its square will tell us how the electron density is distributed in the molecule. Hence, the one-electron molecularwavefunction is referred to as a molecular orbital (MO) just as the one-electron atomic wavefunction is referred to as an atomic orbital (AO). With a basis set of the type we have selected, the MOs are expressed as linear combinations of AOs. For this reason, this kind of calculation is referred to as a minimal basis set linear combination of atomic orbitals–molecular orbital (LCAO– MO) calculation. Our second problem, now that we have selected a basis, is construction of the secular determinant. Since we have only two basis functions (1sA, 1sB), we expect a 2 × 2 determinant:  HAA -E¯SAA HAB -E¯SAB HBA -E¯SBA HBB -E¯SBB  =0 (7-82) Here we have used the notation developed earlier, where HAB = 1s*A (1)Hˆelec (1) 1sB (1) dt (1) (7-83) etc. E¯ of Eq. (7-82) is the average value of the energy. Henceforth, the bar is omitted. If we take our basis functions to be normalized, SAA = SBB = 1. Since our basis functions and hamiltonian are all real, their integrals will be real. Therefore, SAB =SBA 5The hydrogenlike orbitals are a complete set if the continuum functions are included. Hence, this set can allow one to approach arbitrarily close to the exact eigenfunction and eigenvalue. The Slater-type orbitals do not constitute a complete set. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 209 and HAB = H*BA = HBA. Since the hamiltonian is invariant to an interchange of the labels A and B, it follows that HAA =HBB. (HAA is the energy of an electron when it is in a 1s AO on one side of the molecule, HBB when it is on the other side.) This leaves us with  HAA -E HAB -ESAB HAB -ESAB HAA -E  =0 (7-84) For Eq. (7-84) to be satisfied, the product of the diagonal terms must equal that of the off-diagonal terms, which means that HAA -E=±(HAB -ESAB) (7-85) This gives two values for E: E± = HAA ±HAB 1±SAB (7-86) To arrive at numerical values for E+ and E- requires that we choose a value for R and explicitly evaluate HAA, HAB, and SAB. We know in advance that SAB increases monotonically from zero at R=8 to unity at R = 0 because 1sA and 1sB are each normalized and everywhere positive. HAA is the average energy of an electron in a 1s AO on nucleus A, subject also to an attraction by nucleus B. Hence, HAA should be lower than the energy of the isolated H atom (-1 2 a.u.) whenever R is finite. HAB is easily expanded to HAB = 1sA - 1 2.2 -1/rB1sBdv + 1sA(-1/rA)1sB dv (7-87) The operator in the first integrand is simply the hamiltonian operator for a hydrogen atom centered at nucleus B. This operator operates on 1sB to give -1 2 1sB. Hence, the first integral becomes simply -1 2SAB. The second integral in Eq. (7-87) gives the attraction between a nucleus and the “overlap charge.” Thus, HAB is zero at R=8and negative for finite R. The formulas for these terms are (after nontrivial mathematical evaluation) SAB = 1sA1sB dv =exp(-R)1+R +R2/3 (7-88) HAA = 1sAHˆelec1sA dv =- 1 2 -(1/R) 1-e-2R (1+R) (7-89) HAB = 1sAHˆelec1sB dv =-SAB/2-e-R (1+R) (7-90) When R =2 a.u., SAB =0.586, HAA=-0.972 a.u. and HAB =-0.699 a.u. Inserting these values into Eq. (7-86) gives E+ =-1.054 a.u. and E- =-0.661 a.u. These are electronic energies. Internuclear repulsion energy (+1 2 a.u.) can be added to these values to give -0.554 a.u. and -0.161 a.u., respectively. The ways in which HAA, HAB, SAB, and 1/R contribute to the energy are illustrated for R=2 a.u. in Fig. 7-6. HAA is lower than the energy of an isolated H atom because 210 Chapter 7 The Variation Method Figure 7-6  Contributions to the energy of H+2 at R =2 a.u. in a minimal basis LCAO–MO calculation. the electron experiences additional nuclear attraction at R =2. The effect of the HAB interaction element is to split the energy into two levels equally spaced above and below HAA. The SAB term has the effect of partially negating this splitting. The internuclear repulsion energy 1/R merely raises each level by 1 2 a.u. The lower energy, E+ +1/R, has a final value that is lower than the separated-atom energy of -1 2 a.u. Since the exact energy at R=2 must be as low or lower than our value of-0.554 a.u., we can conclude that the H+2 molecule–ion has a state that is stable, with respect to dissociation into H +H+, by at least 0.054 a.u., or 1.47 eV, or 33.9 kcal/mole, neglecting vibrational energy effects. The data depicted in Fig. 7-6 are sometimes presented in the abbreviated form of Fig. 7-7. The energy levels for the pertinentAOs of the separated atoms are indicated on the left and right, and the final energies (either electronic or electronic plus internuclear) are shown in the center. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 211 Figure 7-7  Separated atom energies and energies at an intermediate R for H+2 . Figure 7-8  E± + 1/R versus R for H+2 . (—) Calculation described in text. (- - -) Exact calculation. The behavior of these energies as a function of R is plotted in Fig. 7-8. Included for comparison are the exact energies for the two lowest-energy states of H+2 . Only the lower of these shows stability with respect to molecular dissociation. Both energy levels approach infinity asymptotically as R approaches zero because of internuclear repulsion. (The zero of energy corresponds to complete separation of the protons and electron.) Having found the roots E± for the secular determinant, we can now solve for the coefficients which describe the approximate wavefunctions in terms of our basis set. Let us first find the approximatewavefunction corresponding to the lower energy. To do this, we substitute the expression for E+ [Eq. (7-86)] into the simultaneous equations associated with the secular determinant (7-84): cA(HAA -E+)+cB(HAB -E+SAB) = 0 (7-91) cA(HAB -E+SAB)+cB(HAA -E+) = 0 (7-92) 212 Chapter 7 The Variation Method Equation (7-91) leads to cA HAA -(HAA +HAB)/ 1+SAB =-cB HAB -(HAA +HAB)SAB/ 1+SAB  (7-93) which ultimately gives cA =cB (7-94) The same procedure for E- produces the result cA=-cB (7-95) The normality requirement is (7-96) so that c2 A +c2 B +2cAcBSAB =1 (7-97) For cA =cB, this gives cA =1/ [2 (1+SAB)]1/2 =cB (7-98) For cA=-cB, cA =1/ [2 (1-SAB)]1/2=-cB (7-99) Therefore, the LCAO–MO wavefunction corresponding to the lower energy E+ is .+ = 1 v2(1+SAB) (1sA +1sB) (7-100) The higher-energy solution is .- = 1 v2(1-SAB) (1sA -1sB) (7-101) Just aswas true forAOs, there are a number ofways to display these MOs pictorially. One possibility is to plot the value of .± or .2± along a ray passing through both nuclei. (Other rays could also be chosen if we were especially interested in other regions.) Another approach is to plot contours of . or .2 on a plane containing the internuclear axis, Still another way is to sketch a three dimensional view of a surface of constant value of . or .2 containing about 90–95% of the wavefunction or the electron charge. Finally, one can plot the value of . as distance above or below the plane containing the internuclear axis. All of these schemes are shown in Fig. 7-9 for .+ and .-. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 213 Figure 7-9  (a) Plot of .+, along the z axis. [Eq. (7-100)]. (b) Plot of .- along the z axis [Eq. (7-101)]. The wavefunctions .+ and .- may be seen from Fig. 7-9 to be, respectively, symmetric and antisymmetric for inversion through the molecular midpoint. [They are commonly called gerade (German for even) and ungerade, respectively.] This would be expected for nondegenerate eigenfunctions of the hamiltonian, since it is invariant to inversion. But .+ and .- are not eigenfunctions. They are only approximations to eigenfunctions. How is it that they show the proper symmetry characteristics of eigenfunctions? The reason is that the symmetry of the H+2 molecule is manifested as a symmetry in our secular determinant of Eq. (7-84). Note that the determinant is symmetric for reflection across either diagonal. The symmetry for reflection through the principal diagonal (which runs from upper left to lower right) is due to the hermiticity of Hˆ , and is always present in the secular determinant for any molecule regardless of symmetry.6 Symmetry for reflection through the other diagonal is due to the fact that the hamiltonian is invariant to inversion and also to the fact that the basis functions at 6Hermiticity requires that Hij =Hji *. If Hij is imaginary, the determinant will be antisymmetric for reflection through the principal diagonal. 214 Chapter 7 The Variation Method Figure 7-9  (Continued) (c) Contour diagram of .+. (d) Contour diagram of .-. the two ends of the molecule are identical. When theAOs in a basis set are interchanged (times±1) by a symmetry operation of the molecule, the basis set is said to be balanced for that operation. Thus, a 1sA and 1sB basis is balanced for inversion in H+2 , but a 1sA and 2sB basis is not. Whenever a symmetry-balanced basis is used for a molecule, the symmetry of the molecule is manifested in the secular determinant and ultimately leads to approximate MOs that show the proper symmetry characteristics. Since the eigenfunctions for H+2 must be gerade or ungerade, and since we started with the simple balanced basis 1sA and 1sB, it should be evident that it is unnecessary to go through the linear variation procedure for this case. With such a simple basis set, there is only one possible gerade linear combination of AOs, namely 1sA +1sB. Similarly, 1sA -1sB is the only possible ungerade combination. Therefore, we could have used symmetry to guess our solutions at the outset. Usually, however, we are not so limited in our basis. We shall see that, while symmetry is useful in such circumstances, it does not suffice to produce the variationally best solution. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 215 Figure 7-9  (Continued) (e) Three-dimensional sketch of contour envelope for .+ and (f) for .-. According to the theorem mentioned earlier (but not proved), the nth lowest root of a linear variation calculation for a state function must lie above the nth lowest exact eigenvalue for the system. However, the two states we are dealing with have different symmetries. In such a case, a more powerful boundedness theorem holds— one that holds even if we are not using a linear variation procedure. To prove this, we first recognize that every H+2 eigenfunction is either gerade (i.e., symmetric for inversion) or ungerade. Since the lowest-energy approximate wavefunction .+ is gerade, it must be expressible as a linear combination of these gerade eigenfunctions. Hence, its average energy E+ cannot be lower than the lowest eigenvalue for the gerade eigenfunctions. Similarly, E- cannot be lower than the lowest eigenvalue for the ungerade eigenfunctions, and so we have a separate lower bound for the average energy of trial functions of each symmetry type. For this reason, our approximate energies in Fig. 7-8 must lie above the exact energies for both states. If we were to make further efforts to lower the average energy of the ungerade function, even by going outside the linear variation procedure, we could never fall below the exact energy for the lowest-energy ungerade state unless we somehow allowed our trial function to change symmetry. This means that a lowest average energy criterion can be used in attempting to find the lowest-energy state of each symmetry type for a system, by either linear or nonlinear variation methods. 216 Chapter 7 The Variation Method Figure 7-9  (Continued) (g)Value of .+ and (h) of .- versus position in plane containing the nuclei at points a and b. EXAMPLE 7-2 Suppose one were to do a variational calculation on the hydrogen atom using the unnormalized trialwavefunction f=exp(-ar2) cos ., with a being varied. What lower bound could we expect for the average energy? SOLUTION  This function has a node in the x, y plane (where . =p/2), and this nodal plane exists regardless of the value of a. Therefore, our trial function is like the 2p0 AO in symmetry. It will be represented by a linear combination of p0 AOs–a combination that changes as a is varied. The average value of energy cannot be lower than the lowest eigenvalue in the set, which is-1/8 a.u. (for the 2p0 AO).  Inspection of Fig. 7-9 shows that the charge distribution for the state described by .+ is augmented at the molecular midpoint compared to the charge due to unperturbed Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 217 atoms. This state is also the one that gives H+2 stability at finite R. Because this MO puts electronic charge into the bond region and stabilizes the molecule, it is commonly called a bonding MO. In the state described by .-, charge is shifted out of the bond region and the molecule is unstable at finite R, and so .- is called an antibonding MO. Because the potential in H+2 (or in any linear molecule) is independent of f, the angle about the internuclear axis, the f dependence of the wavefunctions is always of the form (f)=(1/v2p)exp(imf), m=0,±1,±2, . . . (7-102) This fact may be arrived at in two ways. One way is to write down the Schr¨odinger equation for H+2 using spherical polar coordinates or elliptical coordinates. (f is a coordinate in each of these coordinate systems.) Then one attempts to separate coordinates and finds that the f coordinate is indeed separable from the others and yields the equation d2 (f) df2 =-m2 (f) (7-103) The acceptable solutions of this are the functions (7-102). The other approach is to note that, since the potential in Hˆ has no f dependence, Hˆ commutes with Lˆz , the angular momentumoperator, so that the eigenfunctions ofHˆ are simultaneously eigenfunctions of Lˆz .We know the eigenfunctions of Lˆz are the functions (7-102), and thus, we know that these functions must also give the f dependence of the eigenfunctions of Hˆ . Two important conclusions emerge. First, each nondegenerate H+2 wavefunction must have a f dependence given by one of the functions (7-102). This tells us something about the shapes of the wavefunctions. Second, the nondegenerate H+2 wavefunctions are eigenfunctions of Lˆz , which means that an electron in any one of these states has a definite, unvarying (i.e., sharp) component of angular momentum of value m¯h mks units (m a.u.) along the internuclear axis. It is thus useful to know the m value associated with a given one-electron wavefunction. A standard notation is used, which is analogous to the atomic orbital notation wherein s, p, d, f, correspond to l values of 0, 1, 2, 3, respectively. The corresponding Greek lower-case letters s, p, d, f indicate values of |m| of 0, 1, 2, 3, respectively, in one-electron orbitals of linear molecules. Thus, for the case at hand, .+ and .- are both s MOs because they are cylindrically symmetrical (i.e., no f dependence), which requires that m be zero. Because .+ is a gerade function, it is symbolized sg. .-, then, is a su MO. (A simple way to determine whether anMOis s, p, or d is to imagine viewing it (in its real form) end-on (i.e., along a projection of the internuclear axis.) If the MO from this view looks like an s AO, it is a s MO. If it looks like a p AO, it is a p MO. A d MO has the four-leaf-clover appearance of a d AO.) Let us nowexamine the dependence of our LCAO–MOresults on our original choice of basis. The 1s AOs we have used are capable of giving the exact energy when R=8, but become increasingly inadequate as R decreases. As a result, Fig. 7-8 shows that, for both states, the approximate energy deviates more and more from the exact energy as R decreases. At R =0, our sg function becomes a single 1s H atom (Z = 1) AO centered on a doubly positive nucleus. Yet we know that the lowestenergy eigenfunction for that situation is a single 1s He+(Z = 2) AO. An obvious way to improve our wavefunction, then, is to allow the 1s basis functions to change 218 Chapter 7 The Variation Method their orbital exponents as R changes. This adds a nonlinear variation, and the calculation is more complicated. It is very easily performed with the aid of a computer, however, and we summarize the results in Figs. 7-10 and 7-11. The internuclear repulsion has been omitted from the energies in Fig. 7-10. The lowest approximate energy curve is now in perfect agreement with the exact electronic energy both at R =0 and R=8, and shows improved, though still not perfect, agreement at intermediate R. The R dependence of the orbital exponent for this wavefunction (Fig. 7-11) varies smoothly from 1 at R=8 to 2 at R = 0, as expected. In contrast, the su function fails to reach a well-defined energy at R = 0 because the function becomes indeterminate at that point. [At R = 0, 1sA - 1sB becomes (1sA - 1sA) = 0.] However, Figs. 7-10 and 7-11 indicate that the exact energy E- is -1 2 a.u. at R = 0, corresponding to the n = 2 level of He+, and that the orbital exponent in our 1s basis functions approaches 0.4 in the effort to approximate this state function at small R. To understand this behavior, we must once again consider the symmetries of these functions. We have already seen that the nondegenerate eigenfunctions of H+2 must be either gerade or ungerade, and we note in Fig. 7-10 that the energy curve for the gerade state is continuous as is the one for the ungerade state. In other words, as we move along a given curve, we are always referring to a wavefunction of the same symmetry. This continuity of symmetry along an energy curve is central to many applications of quantum chemistry. The reason for continuity of symmetry can be seen by considering a molecule having some element of symmetry, and having a nondegeneratewavefunction or molecular orbital (which must be symmetric or antisymmetric with respect to the symmetry operations of the molecule). If we change the molecule infinitesimally (without destroying its symmetry), we expect thewavefunction to change infinitesimally also. Figure 7-10  E+ and E- for H+2 from (- - -) exact, (···) variable . , and (—) fixed . (. =1) treatments. Section 7-5 Linear Combination of Atomic Orbitals: The H+2 Molecule–Ion 219 Figure 7-11  Values of . minimizing E+ and E- as a function of R. In particular it should not change symmetry because this is generally not an infinitesimal change. (To change symmetry requires adding or removing nodes, changing signs in parts of the function. Such changes have more than infinitesimal effects on . and on kinetic and potential parts of the energy.) The entire curve can be traversed by an infinite number of such infinitesimal but symmetry conserving steps. The continuity of symmetry enables the sg state of H+2 to correlate with an s-type AO of He+ as R goes to zero. This correlation is symmetry allowed because the s-type AOs of He+ have the proper symmetry characteristics—they are gerade and have no dependence on angle about the axis that is the internuclear axis when R>0 (see Fig. 7-12). In contrast, the su MO cannot correlate with an s-type AO. It must correlate with an AO that is cylindrically symmetrical about the old internuclear axis but is antisymmetric for inversion. A p-type AO pointing along the old internuclear axis (called a ps AO) satisfies these requirements. Figure 7-12  Sketches demonstrating howseparated atom functions can be related to united-atom functions through symmetry invariance. These functions are not drawn to a common scale. 220 Chapter 7 The Variation Method These symmetry requirements help us understand the su curves of Figs. 7-10 and 7-11. The exact energy for E- goes to -1 2 a.u. at R =0 because the su state of H+2 correlates with a 2p AO of He+. Our basis set is incapable of reproducing a 2p AO at R =0, and so the calculated energy curve fails to rejoin the exact curve at R =0. At small R, our basis set is attempting to approximate the two lobes of an evolving 2p AO. Apparently the orbital exponent that best enables the basis to accomplish this is around 0.4. 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules We have already seen how one produces the ground configurations for manyelectron atoms by placing pairs of electrons of opposite spin into AOs, starting with the lowest-energy AO and working up. Subsequent manipulation of this product function to produce proper space and spin symmetry yields the approximate wavefunction. Precisely the same procedure is used for molecules. Thus, the electronic configurations for H+2 , H2, and H-2 are 1sg, (1sg)2, and (1sg)21su, respectively, and the approximate wavefunction for H2 is provided by the Slater determinant  1sg (1) 1 ¯sg (2) . When we come to consider heavier homonuclear diatomic molecules, such as O2, we must place electrons in higher energy MOs. Such MOs are still provided by the minimal basis set, which now includes 1s, 2s, and three 2p AOs on each atom since these AOs are occupied in the separated atoms. We now consider the natures of the additional MOs produced by this larger basis. We begin by making a change to a basis set that is mathematically equivalent to the starting set but is more convenient for discussing and analyzing the problem. This new set is the set of symmetry orbitals (SOs) (1sA ±1sB), (2sA ±2sB), etc. From our original ten AOs, we thus produce ten SOs. These may be normalized, if desired. Each of these SOs has definite symmetry. The SOs built from 2sAOs must be of sg and su symmetry since the 2sAOs act like 1sAOs for all the symmetry operations of the molecule. The 2p AOs pointing along the internuclear axis have cylindrical symmetry and hence also give rise to a sg and a su SO. (We will take the internuclear axis to be coincident with the z axis, and so these SOs are constructed from 2p0 (or 2pz )AOs.) The functions (2p+1A ±2p+1B ) are p SOs because |m|=1. Since it is difficult to visualize complex functions, however, the usual practice is to take linear combinations of the complex p functions to produce a corresponding set of real functions. (This is completely analogous to forming real px and py AOs from complex p+1 and p-1 AOs.) Thus, we obtain (2pxA ± 2pxB ) and (2pyA ± 2pyB ), which are not eigenfunctions for the Lˆz operator anymore, but are still given the symbol p. From Fig. 7-13, we see that the positive combinations give ungerade p SOs. This is just the opposite of the case for s-type SOs. We conclude from Fig. 7-13 that sg and pu SOs will tend to place charge into the bond and hence contribute to bonding, whereas su and pg SOs will contribute antibonding character. Even though these SOs are only a basis set, the reader may nevertheless recognize that conversion to this symmetrized basis goes a longway toward producing our ultimate MOs. Indeed, our (1sA±1sB) SOs, if normalized, are the same as the MOs we obtained Section 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules 221 Figure 7-13  Symmetry orbitals constructed from s- and p-type AOs. (a) Sketches according to an idealized convention that ignores overlap between AOs on A and B. (b) Effects of overlap. Note that the psA -psB combination is bonding. This depends on our having chosen a common z axis for both atoms. Sometimes the z axes are chosen to point from each atom toward the other. In that case, psA -psB becomes antibonding. for H+2 . The essential advantage of a symmetrized basis set is that it simplifies the secular determinant and makes it easier to understand and describe the mixing of the basis functions by the hamiltonian. For instance, since our MOs must have pure s, p, d, . . . and also g or u symmetry, and since our SOs are already of pure symmetry, we expect no further mixing to occur between SOs of different symmetry in forming MOs. This suggests that the interaction elementHij between SOs fi and fj of different symmetry should vanish. This is easily proved by noting that Hˆ is symmetric for all symmetry operations of the molecule, and, if fi and fj differ in symmetry for some operation, their product is antisymmetric for that operation, and therefore f*i Hfj is antisymmetric and its integral vanishes. Similarly, Sij vanishes, and Hij - ESij vanishes except in positions connecting basis functions of identical symmetry. As a result, our secular determinant over SOs has the form (7-104), where the notation sg[1s] indicates a sg SO made from 1sA, 1sB AOs, etc. By placing basis functions of like symmetry together, we have emphasized the block-diagonal form of our determinant, all elements in the nonshaded areas being zero by symmetry. (The pg[2px] and pg[2py] do not interact because of symmetry disagreements for reflection in the xz and yz planes.) Each of the nonzero blocks of (7-104) is a separate determinant (which is just a number), and the value of determinant (7-104) is simply the product of these six 222 Chapter 7 The Variation Method smaller determinants. Hence, if any one of these small determinants is zero, the large determinant is zero, thereby satisfying our determinantal equation. Therefore, each of these small determinants may be employed in a separate determinantal equation, and the problem is said to be partitioned into six smaller problems. It follows immediately that we can get mixing among the three sg SOs to produce three sg MOs and likewise for the su set, and that the p SOs can undergo no further mixing and hence are already MOs. (Notice that SOs in one block are not linked by off-diagonal elements either directly or indirectly, unlike the case described in Example (7-1).) (7-104) What will be the nature of the lowest-energy sg MO? It will not be pure sg[1s] because admixture of sg[2s] and sg[2p] SOs can produce charge shifts that will lower the energy. But the sg[2s] and sg[2p] SO energies are much higher than the sg[1s] (mainly because the 2s and 2p AOs are higher in energy in the atoms, and the atomic contributions still dominate in the molecule). Any energy decrease to be gained by charge shifting must be weighed against the energy increase due to the mixing in of such high energy components. The former very quickly become overbalanced by the latter, so the lowest energy sg MO is almost pure sg[1s] SO, the sg[2s] and sg[2p] SOs coming in only very slightly. This exemplifies an important general feature of quantum chemical calculations: mixing between basis orbitals tends to be small if they have widely different energies in the system. Thus, we now have two factors governing the extent of mixing of functions fi and fj—the size of the interaction element Hij , and the difference in energy between them in the system (Hii -Hjj ). A label we can use for this lowest-energyMOthat avoids implying that it is identical to the sg[1s] SO is 1sg. This stands for “the lowest-energy sg MO.” Because of the low energy of the 1s AOs, the next-lowest MO is almost pure su[1s], and we label it 1su. When considering the remaining two sg MOs, we can expect substantial mixing between sg[2s] and sg[2p] SOs because these functions are not very different in energy. In the hydrogen atom, the 2s and 2p AOs are degenerate, and, as we move along the periodic table, they become split farther and farther apart in energy. Therefore, we might expect to find the greatest mixing for B2 and C2, and to find less mixing for O2 and F2. Figure 7-14 is a schematic diagram of the MO energy levels we should expect for F2. Here the 2sg MOis primarily the sg[2s] SO and the 3sg MOis mainly the sg[2p] Section 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules 223 Figure 7-14  Schematic of MO energy level order for F2. The vertical axis is not an accurate energy scale. For instance, the 1s levels are very much lower in energy relative to the other levels than is suggested by the drawing. (- - -) indicates that the orbital is unoccupied in the ground state. SO. Similarly 2su and 3su are mainly su[2s] and su[2p], respectively. The 1p and 3s MOs are degenerate at the separated atom limit, where they are all 2p AOs. As the atoms come together and interact, the p levels split apart less than the s levels because the 2px and 2py AOs approach each other side to side, whereas the 2ps AOs approach end to end. The latter mode produces larger overlap and leads to larger interaction elements and greater splitting. (Because of the symmetry of the molecule, the 1pu MOs are always degenerate, and they may be mixed together in any way. In particular, we can regard them as being 1pux and 1puy or 1pu+1 and 1pu-1 with equal validity. The same situation holds for the 1pg pair.) From the ordering of energy levels in Fig. 7-14 we obtain for F2 the configuration F2 : (1sg)2(1su)2(2sg)2(2su)2(3sg)2(1pu)4(1pg)4 (7-105) Now we will consider what happens for lighter molecules. Recall that here the 2s and 2pAOs are closer together in energy. This allows greater mixing between the SOs containing these AOs and produces increased energy level splitting. Thus, the sg[2s] and sg[2p] SOs mix together more, and the resulting splitting causes an additional lowering in energy of the 2sg level and an increase for the 3sg level, compared to the F2 case. In a similar way the 2su and 3su levels are lowered and raised, respectively, by increased mixing between su[2s] and su[2p] SOs. The resulting energy level pattern for C2 is shown in Fig. 7-15. Note that the extra splitting has pushed the 3sg level above the 1pu level. No other change in the orbital energy ordering has occurred. As a result, C2 has, the configuration 224 Chapter 7 The Variation Method Figure 7-15  Aschematic diagram of theMOenergy level order forC2. The 2s and 2ps symmetry functions mix more strongly in producing 2s and 3s MOs than is the case in F2. The level shifts discussed in the text are indicated by dashed lines and arrows. (-------) represent F2 levels. Note the inversion of the order of the 1pu and 3pg levels. The orbital ordering is deduced from spectra. (See Mulliken [1].) (---) represent energies of orbitals not occupied by electrons. Note that there is no well-defined energy ordinate for this figure, and no accurate relationship between absolute values of orbital energies within a molecule or between molecules is implied. C2 : (1sg)2(1su)2(2sg)2(2su)2(1pu)4 (7-106) The above discussion is an effort to rationalize orbital energies obtained from calculations or deduced from molecular spectra. We have not yet described the details of how one goes about carrying out MO calculations on these molecules, nor will we in this chapter. However, even in the absence of precise numerical results, it is possible for such qualitative arguments to be very useful in understanding and predicting features of a wide range of chemical reactions. The molecular configurations obtained from the patterns of Figs. 7-14 and 7-15 predict molecular properties that are in strikingly good qualitative agreement with experimental observations. The data in Table 7-2 show that, when we go from H+2 to H2, adding a second electron to a bonding MO, the bond length decreases and the dissociation energy increases. Adding a third electron (He+2 ) causes partial occupation of the antibonding 1su MO and causes the bond length to increase and the dissociation energy to decrease. The four electron molecule He2 is not observed as is consistent with its configuration. The relative inertness of N2 as a chemical reactant becomes understandable from the fact that it has six more “bonding electrons” than antibonding electrons, giving a net of three bonds—one s and two p bonds. For N2 to react, it Section 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules 225 TABLE 7-2  Some Properties of Homonuclear Diatomic Molecules and Ions in their Ground Electronic States Molecule MO configuration Net number of bonding electrons Binding energy, De (eV) Equilibrium internuclear separation, Re(Å) Termc H + 2 1sg 1 2.7928 1.06 2 g H2 1sg 2 2 4.747745 0.7414 1 g+ H - 2 1sg 21su 1 1.7a 0.8 2 u + He2+ 1sg 21su 1 2.5 1.08 2 u + He2 1sg 21su 2 0 0.001b 2.88 (1 g+) He2- [He2]2sg 1 No data 2 g+ Li2+ [He2]2sg 1 1.29 3.14 2 g+ Li2 [He2]2sg 2 2 1.05 2.673 1 g+ Li2- [He2]2sg 22su 1 ~1.3(?) 3.2 2 u + Be2+ [He2]2sg 22su 1 No definitive data 2 u + Be2 [He2]2sg 22su 2 0 0.1 2.49 1 g+ Be2- [Be2]1pu 1 ~0.3 2.4 2u B2+ [Be2]1pu 1 1.8 — 2u B2 [Be2]1pu 2(?) 2 ~3 1.589 3 g- B2- [Be2]1pu 3 3 No data 2u C2+ [Be2]1pu 3 3 5.3 1.301 2u C2 [Be2]1pu 4 4 6.36 1.2425 1 g+ C2- [Be2]1pu 43sg 5 8.6 — 2 g+ N2+ [Be2]1pu 43sg 5 8.86 1.116 2 g+ N2 [Be2]1pu 43sg 2 6 9.90 1.098 1 g+ N2- [Be2]1pu 43sg 21pg 5 ~8.3 — 2g O2+ [Be2]1pu 43sg 21pg 5 6.7796 1.1171 2g O2 [Be2]3sg 21pu 41pg 2 4 5.2132 1.2075 3 g- O2- [Be2]3sg 21pu 41pg 3 3 4.14 1.32 2g F2+ [Be2]3sg 21pu 41pg 3 3 3.39 1.32 2g F2 [Be2]3sg 21pu 41pg 4 2 1.65 1.42 1 g+ F2- [Be2]3sg 21pu 41pg 43su 1 ~1.3 1.9 2 u + Ne2+ [Be2]3sg 21pu 41pg 43su 1 ~1.1 1.7 2 u + Ne2 [Be2]3sg 21pu 41pg 43su 2 0 0.003b 3.09 (1 g+) aThis state is unstable with respect to loss of an electron, but is stable with respect to dissociation into an atom and a negative ion. bFrom Hirschfelder et al. [2]. It may be shown that any two neutral atoms will have some range of R where the attractive part of the van der Waals’ interaction dominates. For He2, this minimum is so shallow and the nuclei so light that a stable state (including vibrations) probably cannot exist. For Ne2, a stable state should exist. The data for He2 and Ne2 are calculated from considerations of intermolecular forces. cThe term symbol corresponds to the configuration of column 2. 226 Chapter 7 The Variation Method is necessary to supply enough energy to at least partially break these bonds prior to forming new bonds. The reversal of the orbital energy order between 1pu and 3sg can be seen to occur between N2 and O2. The configurations of Table 7-2 indicate that some molecules will have closed shells, whereas others will have one or more unpaired electrons. For example, O2 has a configuration in which the degenerate 1pg level “contains” two electrons. Several states can be produced from such a configuration, just as several states could be produced from the 1s2s configuration for He. Hund’s first rule has been found to hold for molecules, and so the state of highest multiplicity is lowest in energy. For n unpaired electrons, the highest multiplicity achievable is n+1. For O2, this is three, and we expect the ground state of the O2 molecule to be a triplet. This is conveniently demonstrated to be so by observing that liquid oxygen is attracted into the gap between poles of a magnet, consistent with the existence of uncanceled electron spin magnetic moments. Close perusal of Table 7-2 indicates that some of the data are not in accord with the simple qualitative ideas just presented. For example, Li+2 is more strongly bonded than is Li2, even though the former has fewer bonding electrons. However, the Li+2 ion-molecule is longer than Li2. H-2 is less strongly bound than isoelectronic He+2 , yet it has a shorter equilibrium internuclear separation. Irregularities such as these require more detailed treatment. However, one of the useful characteristics of a qualitative approach is that it enables us to recognize cases that deviate from our expectations and therefore warrant further study. In Figs. 7-14 and 7-15, we saw how MOs are related to SOs for the separated atoms. Let us now consider how the separated atom SOs correlate with the united atomAOs. Recall that these orbitals are correlated by requiring them to be of identical symmetry. In Fig. 7-16 some of the possible SOs and united-atom AOs, together with their symmetry labels, are shown. Note that the sg SOs can correlate with s or ds AOs, su SOs with ps AOs, pu SOs with pp AOs and pg SOs with dp AOs. This gives us all the information we need except for resolving the ambiguities within a given symmetry type. For instance, which of the 1s, 2s, 3s, 3ds , . . . in theAOs correlates with which of the sg[1s], sg[2s], sg[2p], sg[3s], . . . in the SOs? This question is resolved by use of the noncrossing rule, which states that, in correlation diagrams, energy levels associated with orbitals or states of the same symmetry will not cross. This requires that we match up the lowest-energy united-atom AO of a given symmetry with the lowest energy SO of that symmetry, and so on up the ladder. This leads to the diagram in Fig. 7-17. The line interconnecting the 1s AO of the united atom with the 1s AOs of the separated atoms refers, at intermediate R, to the 1sg MO.We have already seen that thisMOmay contain contributions from sg[2s] and sg[2p] SOs. Thus, the correlation diagram tells us what orbitals the 1sg MO “turns into” at the limits of R, but does not imply that, at other R values, this MO is comprised totally of 1s AOs. Study of this correlation diagram reveals that the antibondingMOs(su andpg) are the MOs that correlate with higher energy united-atom AOs and hence favor the separated atoms in terms of energy. This results from the fact that antibonding MOs have a nodal plane bisecting the bond which is preserved as we proceed to the united atom, yielding a united atomAO having one more node than the separatedAOs we started with. Thus, 1ssu goes to p, 2ppg goes to d, etc. The MO is said to be “promoted.” We have now seen several ways to explain the effects of an orbital. We may focus on energies, and note that bonding and antibonding MOs correlate with low-energy and high-energy united-atom orbitals, respectively. Or, as we saw earlier, we can focus Section 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules 227 Figure 7-16  (a) Symmetry orbitals for homonuclear diatomic molecules. (b) United-atom AOs characterized by symmetry with respect to z axis. on charge distributions and their attractions for nuclei, and note that bonding MOs concentrate charge in the bond region, attracting the nuclei together, whereas antibonding MOs shift charge outside the bond, attracting the nuclei apart. Alternatively, we can recognize that bonding MOs result from AOs on each atom coming together in phase, resulting in positive overlap, while antibonding MOs result from opposite phases coming together to give negative overlap. 228 Chapter 7 The Variation Method Figure 7-17  Correlation diagram between separated-atom orbitals and united-atom orbitals for homonuclear diatomic molecules. Energy ordinate and internuclear separation abscissa are only suggestive. No absolute values are implied by the sketch. [Note: For H+2 sg2p correlates with 3dsg, sg3s with 3ssg. This arises because the separability of the H+2 hamiltonian (in the Born– Oppenheimer approximation) leads to an additional quantum number for this molecule. In essence, the H+2 wavefunction in elliptical coordinates may be written . =L(.)M(µ) eimf. The function L may have nodal surfaces of elliptical shape. M may have nodes of hyperbolic shape. In the correlation diagram for H+2 , it is necessary that ellipsoidal nodes correlate with spherical nodes in the united atom, while hyperboloid nodes correlate with hyperboloid nodes (which may be planar). Sketching sg2p and sg3s and comparing them with 3ssg and 3dsg (i.e., 3d3z2-r2 ) makes clear how this “nodal control” results in what, at first sight, appears to be a violation of the noncrossing rule. For H+2 , modifications for higher-energy states will also be required. For example, 4ssg will correlate with sg4s, not sg3p.] Section 7-6 Molecular Orbitals of Homonuclear Diatomic Molecules 229 Three common labeling conventions are used in Fig. 7-17. A level may be labeled with reference to the separated atom AOs to which it correlates. The separated atom AO symbol is placed to the right of the MO symmetry symbol (e.g., sg2s). Note the absence of square brackets, which we used to symbolize the SO (2sA+2sB). The symbol sg2s means “the MO of sg symmetry that correlates with 2s AOs at R=8.” An alternative label indicates the united-atom orbital with which the MO correlates. Here theAO label is placed to the left of the symmetry symbol (e.g., 3psu). The u and g subscripts in the united-atom notation are redundant and are often omitted. However, they are helpful in drawing correlation diagrams. Finally, the MOs may be simply numbered in their energy order within each symmetry type, as mentioned earlier (e.g., 2sg). EXAMPLE 7-3 A symmetry orbital is produced by taking 3dyz, a -3dyz, b, where a and b are points on the z axis. Sketch the situation. What is the sign of the overlap between these AOs? Is this SO bonding or antibonding? What is the symmetry symbol for this SO? SOLUTION  The sketch would show two four-leafed AOs, each like the 3dyz AO shown in Fig. 7-16, except that the phase signs of one would be minus those in the matching lobes of the other. TheseAOs are positioned side by side, and their phases are such that the inner lobes pointing towards each other are in phase. Hence the overlap is positive and the SO is bonding. Looking at this SO along the z axis, it appears like a py AO, so it is p. Inversion causes interchange of lobes of opposite sign, so it has u symmetry. Its symbol, then, is pu.  Term symbols for electronic states of homonuclear diatomic molecules are much like those in atoms. The main symbol gives information about the component of electronic angular momentum along the z axis. If ML =0,±1,±2, etc., then the main symbol is , , , etc. ML is the sum of ml values for the electrons. When identifying ml, we assume that the p MOs are complex, with ml =+1 and -1 rather than being mixed to give real px and py MOs. The main symbol is decorated with a superscript at left, giving spin multiplicity, and a g or u subscript at right for overall inversion symmetry. As an example, C+2 , with configuration 1s2 g 1s2 u 2s2 g 2s2 u 1p3 u has ML=±1 (zero for all s electrons and either +1,+1,-1 or +1,-1,-1 for the three p electrons) so its main symbol is . The inversion symmetry is u because there is an odd number of electrons in ungerade MOs. There is one unpaired electron, so the multiplicity is 2. The term, symbol is 2u (“doublet-pi-you”). terms are given, in addition, a+or-superscript at the right, indicating whether the wavefunction is symmetric or antisymmetric for reflection through a plane containing the nuclei. Such a reflection has the effect of reversing the direction of f: Clockwise motion around the internuclear axis becomes counterclockwise when viewed in a mirror containing that axis. This transforms exp(if) and exp(-if) into each other, so that p+ and p- turn into (minus) each other. To achieve a MO in the first place requires that mz values sum to 0. This happens for occupied equivalent p MOs only if p+ and p- are equally occupied, which means that both have one electron or both are full. The full case must yield 1 +g , i.e., must be symmetric for every operation, because each MO occurs twice in the electron list, and even antisymmetric functions give symmetric results when multiplied by themselves. For the half-filled case there are only two possibilities: p+(1)p-(2)±p-(1)p+(2). The “plus” case is symmetric for electron 230 Chapter 7 The Variation Method exchange, hence goes with the antisymmetric spin function aß -ßa, hence is a singlet state. Reflection causes p+.p-, but this returns the same function, so this case goes with a 1 + term. Similar reasoning shows that the “minus” case goes with a 3 - term. Hund’s rule predicts the latter term to lie below the former in energy. EXAMPLE 7-4 What term symbols represent possible excited states ofN2 produced by promoting an electron from the highest occupied MO to the lowest unoccupied MO? SOLUTION  When an electron is promoted, the possible resulting 1pu, 1pg configurations can be pictured as follows: 1pg 1pu +1 –1 +1 –1 +1 –1 +1 –1 The excited molecule has two half-filled MOs, for which the electron spins can be opposite (singlet) or the same (triplet). The sum of electron orbital momenta will remain zero (giving if the electron is excited fromml =-1 to-1 or from+1 to+1, but will change to+2 or-2 (giving) if changed from -1 to +1 or from +1 to -1. In all cases, the excited state has an odd number of electrons in the MOs of ungerade symmetry, so all terms will have a u subscript. The term symbols, then, are, respectively 1 u, 3 u, 3u, 1u.  These conventions for term symbols apply for any linear molecule having inversion symmetry (e.g., HCCH, CO2). For linear molecules lacking inversion symmetry (CO, HCN) all is the same except that there is no g–u symbol. The noncrossing rule mentioned above is an important aid in constructing correlation diagrams for many processes. It is called a rule rather than a law because it can only be shown to be highly improbable, not impossible, for two levels of the same symmetry to cross. Thus, imagine that we have a molecule with some variable parameter . and also with a symmetry operation R which is not lost as . varies. For example, . might be the H–O–Hangle inwater, andR could be reflection through the plane bisecting theH–O–H angle. Suppose that we have a complete set of basis functions and that, at each value of ., we manage to express exactly all but two of the eigenfunctions for the molecule. This uses up all but two dimensions of our function space, leaving us, at each value of ., with two eigenfunctions to determine and two functions in terms of which to express them. (These functions change with ., but the above argument has nevertheless served to reduce our problem to two dimensions.) Now let the two functions remaining from our original basis be mixed to become orthonormal and also individually either symmetric or antisymmetric for R. We label these symmetrized basis functions .1 and .2. Because we began with a complete basis, it must be possible to express the as yet undetermined wavefunctions .1 and .2 exactly as linear combinations of .1 and .2 at each value of .. Furthermore, if .1 and .2 are, say, both antisymmetric for R, it is necessary that .1 and .2 also both be antisymmetric. If .1 and .2 have opposite symmetries, however, .1 and .2 also have opposite symmetries. (In the latter case, .1 and .2 can only mix to produce unsymmetric functions, and so we know that .1 and .2 Section 7-7 Basis Set Choice and the VariationalWavefunction 231 are already identical with .1 and .2.) To determine the mixing coefficients and state energies for .1 and .2, we solve the 2×2 secular equation over the basis .1, .2:  H11 -E H12 H12 H22 -E  =0 (7-107) The roots are E± = 1 2 H11 +H22 ±4H2 12 +(H11 -H22)2 1/2 (7-108) The crossing of energy levels for .1 and .2 requires that, at some value of ., E+ equals E-. From Eq. (7-108), we see that this requires that the term in square brackets vanish, which requires that H12 and H11 - H22 vanish. Now, if .1 and .2 (and hence .1 and .2) have opposite symmetries for R, H12 vanishes for all values of ., and the curves will cross whenever H11 equals H22. But if .1 and .2 (and hence .1 and .2) have the same symmetry for R, H12 is not generally zero. In this situation, the curve crossing requires that both H12 and H11-H22 happen to pass through zero at the same value of .. This simultaneous occurrence of two functions passing through zero is so unlikely that it is safe to assume it will not happen. If the molecule possesses several elements of symmetry, H12 will vanish at all . if .1 and .2 disagree in symmetry for any one of them, so the noncrossing rule applies only to states having wavefunctions of identical symmetry for all symmetry operations of the molecule. A similar treatment for orbitals and orbital energy levels is possible, and the noncrossing rule applies for orbital energies as well as for state energies. 7-7 Basis Set Choice and the VariationalWavefunction One of the places where human decision can effect the outcome of a variational calculation is in the choice of basis. Some insight into the ways this choice effects the ultimate results is necessary if one is to make a wise choice of basis, or recognize which calculated results are “physically real” and which are artifacts of basis choice. One question we can ask is this: Is a minimal basis set equally appropriate for calculating an MO wavefunction for, say, B2 as F2? In each case we use 10 AOs and 2 spin functions producing a total of 20 spin MOs. With B2, however, we have 10 electrons to go into these spin MOs, and in F2 we have 18 electrons. In all but the crudest MO calculations, the total energy is minimized in a manner that depends on the natures of only the occupied MOs. In effect, then, the calculation forB2 produces the 10 “best” spin MOs from a basis set of 20 spin-AOs, whereas that for F2 produces the 18 best MOs from a different basis set of 20 spin-AOs. In a sense, then, the basis for F2 is less flexible than that for B2. Of course, the use of separated atom orbitals is a conscious effort to choose that basis that best spans the same function space as the best MOs. To the extent that this strategy is successful, the above problem is obviated (i.e., if both sets are perfect, additional flexibility is useless). The strategy is not completely successful, however, and comparison of results of minimal basis set calculations down a series of molecules such as B2,C2,N2,O2, and F2 may be partially hampered by this 232 Chapter 7 The Variation Method ill-defined inequivalence in basis set adequacy. In contrast, comparison of calculated results in a series of molecules such as CnH2n+2 is much less likely to suffer from this particular problem because the minimal basis set grows with increasing n in a way to keep pace with the number of electrons. Let us now briefly consider how basis sets might vary in adequacy for different states of a given molecule. We will compare the wavefunction for the ground state of a molecule with the wavefunction for a Rydberg state. Rydberg states are so named because their spectral lines progress toward the ionization limit in a manner similar to the spectral pattern for hydrogenlike ions (called a Rydberg series).7 Hence, the Rydberg states of molecules are in some way like excited states of the hydrogen atom. This can be understood by visualizing an excited state for, say, N2 wherein one electron is, on the average, very far away from the rest of the molecule, which is now an N+2 “core.” As the excited electron moves to orbitals farther and farther out, the N+2 core becomes effectively almost like a point positive charge. As a result, the coupling between the angular momentum of this orbital and the internuclear axis grows progressively weaker, so that the motion of the Rydberg electron becomes more and more independent of orientation of the core. It is not surprising that a hydrogen-likeAO centered in the bond becomes more and more appropriate as a basis for describing this orbital. In contrast, such a “single-center” basis normally requires many terms to accurately describe MOs in ground or non-Rydberg excited states. Thus, for a Rydberg state of N2, one would do well to choose a basis set of AOs located on the nuclei to describe the MOs of the N+2 core, and to use an AO (or several AOs) centered between the nuclei to describe the orbital for the Rydberg electron. Thus far we have kept the discussion within the framework of homonuclear diatomic molecules. When we come to heteronuclear diatomics, for example, CO, we lose inversion symmetry and we can no longer symmetry balance our basis. This means that a given basis may be more inadequate for representing the wavefunction on one end of the molecule than on the other. As a result, the electronic charge will be shifted toward the end where the basis set is best able to minimize the energy. This charge shift is an artifact of basis set imbalance, but, since we have no way to evaluate this imbalance, it is difficult to tell how much it affects our results. Mulliken has published some calculations on the HF molecule that illustrate this problem in a striking way. Table 7-3 is a list of total energies and dipole moments calculated for HF using a variety of basis sets. The first column of data arises from a minimal basis set of STOs (1sH, 1sF, 2sF, 2psF , 2ppxF , 2ppyF ) with orbital exponents evaluated from Slater’s rules for atoms. The second column results if the orbital exponents are allowed to vary independently to minimize the molecular energy. The basis set for the third column is obtained by augmenting the previous basis with additional STOs centered on the H nucleus (2sH, 2psH, 2ppxH, 2ppyH). Finally, the fourth column results from use of a basis set that has been augmented (over the minimal basis) at both nuclei in a way thought to be appropriately balanced. As the basis set grows increasingly flexible, the average energy becomes lower, but the expectation value for the dipole moment does not converge uniformly toward the observed value. In particular, by augmenting the basis on hydrogen only, we create a very unbalanced basis, which causes charge to shift too much toward the hydrogen end of the molecule. 7See A. B. F. Duncan [3]. Section 7-8 Beyond the Orbital Approximation 233 TABLE 7-3  Energies and Dipole Moments for Hydrogen Fluoride Calculated by the Variation Method Using Different Basis Setsa Min STO Slater . Min STO best . Min STO F; Aug. STO H (very unbalanced) Aug. STO F and H (balanced) Exp E (a.u.) -99.4785 -99.5361 -99.6576 -100.0580 -100.527 µ(H+F-) 0.85D 1.44D 0.92D 1.98D 1.82Db aSee Mulliken [4]. bData fromWeiss [5]. These problems with basis set adequacy are difficult to overcome completely. Fortunately, with a certain amount of experience, insight, and caution, it is nevertheless possible to carry out variational calculations and interpret their results to obtain reliable and useful information. 7-8 Beyond the Orbital Approximation Most of our discussion of the variation method has been restricted to calculations within the orbital approximation. To avoid leaving an inaccurate impression of the capabilities of the variation method, we shall briefly describe some calculations on some small (two-electron) systems where the method can be employed to its fullest capabilities. These calculations are listed in Table 7-4. The calculation on He by Kinoshita expresses the spatial part of the wavefunction as .(ks, kt, ku)=e-ks/2 8 l,m,n=0 n,even cl,m,n(ks)l-m(ku)m-n(kt)n (7-109) where s =r1 +r2, u=r12, t=-r1 +r2 (7-110) and k and cl,m,n are variable parameters. The exponential term causes the wavefunction to vanish as either electron goes to infinite r, and the terms in the sum build up a polynomial in one- and two-electron coordinates, reminiscent of the form of eigenfunctions for the harmonic oscillator and the hydrogenlike ion. Kinoshita carried out his calculation to as many as 39 terms, obtaining an energy that he estimated to differ from the exact result by no more than 1.2 × 10-6 a.u. A subsequent calculation by Pekeris, using a related approach, required solving a secular determinant of order 1078 and yielded an energy estimated to be accurate to 1.0×10-9 a.u. Applying corrections for coupling between electronic and nuclear motions, and also for relativistic effects, Pekeris arrived at a theoretical value for the ionization energy of He of 198310.687cm-1 compared to the experimental value of 198310.82 ±0.15cm-1. 234 Chapter 7 The Variation Method TABLE 7-4  Results of Some Very Accurate Variational Calculations on Two-Electron Systems System Minimized energya (a.u.) Estimated maximum energy error E¯ -Eexact (a.u.) He (ground singlet)b -2.9037225 0.0000012 He (ground singlet)c -2.903724375 0.000000001 He (lowest triplet)c -2.17522937822 0.00000000001 H2 (ground singlet)d -1.17447498302e 0.000000001 aUncorrected for nuclear motion and relativistic effects. bFrom Kinoshita [6]. cFrom Pekeris [7]. dFrom Kolos andWolniewicz [8]. eAt R =1.401078 a.u. In the 35 years since Pekeris work was reported, even more extensive calculations have been reported. For example, Drake et al. [9] have calculated an upper bound for the ground state of helium having 22 significant figures using a wavefunction having 2358 terms. Extremely accurate variational calculations have been carried out onH2 byKolos and Wolniewicz. They used elliptic coordinates and an r12 coordinate and expressed their wavefunction as an expansion in powers of these coordinates, analogous in spirit to the Kinoshita wavefunction described above. Their most accurate wavefunctions contain 100 terms and are calculated for a range of R values. After including corrections for relativistic effects and nuclear motion, Kolos and Wolniewicz arrived at a theoretical value for the dissociation energy in H2 of 36117.4cm-1 compared to what was then the best experimental value 36113.6±0.5cm-1. Subsequent redetermination of the experimental value gave 36117.3±1.0cm-1.8 The dissociation energy, D0, is the energy required to separate a molecule into its constituent atoms, starting with a molecule in its lowest vibrational state. The binding energy, De, is the energy for the corresponding process if we omit the vibrational energy of the molecule (see Fig. 7-18). These quantities are often much more sensitive measures of the accuracies of calculations than are total energies. The reason for this is easily understood when we recognize that the binding energy is a fairly small difference between two large numbers—the total energy of the molecule and the total energy of the separated atoms. Unless our errors in these two large energies are equal, the residual error is magnified (in terms of percentage) when we take the difference. Thus, the best total energy forH2 in a certain orbital approximation is-1.133629 a.u., which is 96.7% of the total energy. However, the corresponding binding energy is -0.133629 a.u., which is 76.6% of the correct value. The need to calculate accurate binding energies is sometimes referred to as the need to achieve “chemical accuracy.” The variational calculations cited above are among the most accurate performed, and they give an indication of the capabilities of the method. Properties other than 8See Herzberg [10]. Section 7-8 Beyond the Orbital Approximation 235 Figure 7-18  Schematic drawing showing the distinction between D0, the dissociation energy from the lowest vibrational level, and De, the binding energy, which does not take vibrational energy into account. The zero of energy is the energy of the separated atoms. energy predicted from suchwavefunctions are also very accurate. For example, Pekeris’ best wavefunction for the first triplet state of helium gives an electron density at the nucleus of 33.18416 electrons per cubic bohr compared with the experimental value 33.18388±0.00023 deduced from hyperfine splitting. For most systems of chemical interest, calculations of this sort become much too impractical to be considered. For this reason the orbital approximation, with all its limitations, is used in most quantumchemical calculations on systems having more than two electrons. 7-8.A Problems 7-1. Given the following two functions, f (x) and g(x), for the range 0=x =L: Figure P7-1  236 Chapter 7 The Variation Method Can these functions be expressed accurately as linear combinations of particlein- a-box eigenfunctions (box walls at x =0,L)? Indicate your reasoning. If yes, what is the expression for the first two coefficients in the expansion? Can you evaluate any of these by inspection? 7-2. Consider a particle in a box with a biased potential that is higher at x =L than at x =0. An approximate solution for the ground state could be f =v v 0.9.1 + 0.1.2, where .1 and .2 are the first and second eigenfunctions for the unbiased box. (a) Make a rough sketch of f, showing how it skews the particle distribution. (b) What is the average kinetic energy for f, in terms of h,m, and L? 7-3. The normalized function f =(2/45p)1/2r2 exp(-r) can be expanded in terms of hydrogen atom eigenfunctions: f =c1.1s +c2.2s +c3.2p0 +··· where .1s=(1/vp)exp(-r) and .2p0 =(1/v32p)r exp(-r/2) cos .. Evaluate c1 and c3. 7-4. Given the approximate wavefunction for the lowest state of a particle in a onedimensional box (Fig. P7-4): Figure P7-4  f =3/L(2x/L), 0=x =L/2 f =3/L[2(L-x)/L], L/2=x =L f =0, x<0, x>L a) Resolve f into the box eigenfunctions. That is, evaluate cn in the expression f = 8 n=1 cn.n, where .n =2/Lsin (npx/L) , 0=x =L, .n =0, x<0, x>L b) Using the coefficients from part (a) compare the value of f at x =L/2 with the values one obtains from the fapprox = m  n=1 cn.n, with m=1, 3, 5, 7, and 9 Section 7-8 Beyond the Orbital Approximation 237 c) Use the coefficients from part (a) to obtain an expression for E¯ appropriate for f. Estimate the value of the infinite series and thereby estimate E¯. Compare this value to Eexact. 7-5. Let f =exp(-ar2) be a trial function (not normalized) for the ground state of the hydrogen atom. Use the variation method to determine the minimum energy attainable from this form by variation of a. Find the average value of r and the most probable value of r for this wavefunction. Compare these r values and the average energy with the exact values. 7-6. Let f(a)=(a5/3p)1/2r exp(-ar) be a trial function for the ground state of the hydrogen atom: a) Verify that the variation method gives a = 3 2 ,E¯ =-38 a.u. b) Verify that f 3 2  has an overlap of 0.9775 with the 1s function. c) Find the value of a that maximizes the overlap of f with the 1s function and determine the average energy of this new f. 7-7. A normalized approximate wavefunction f for the hydrogen atom is projected into its components and found to be f = (1/v2).1s + (1/v4).2s + (1/v8).3s +c4s.4s. There are no higher terms. a) What is the value of c4s? b) What is the value of the average energy? 7-8. A normalized, spherically symmetric variationally optimized function for the hydrogen atom is analyzed by projecting out coefficients c1s, c2s, c2p0 , etc. It is found that c1s is equal to v0.80 and c2s =v0.15. a) What is the maximum possible value for c2p0? Explain your reasoning. b) What is the minimum possible value for E¯ that could correspond to this function, based on the above data? Explain your reasoning. 7-9. A normalized trial wavefunction of the form (in a.u.) f =(2. )7/(4p6!) 1/2 r2 exp (-.r) is variationally optimized to give the lowest possible energy for the hydrogen atom. The results part way along this process are  fHˆ f dv =. 2/10-./3. a) Complete the variational process to obtain the optimum . and the minimized average energy. b) Calculate the value of c1 in the expression fopt =c1.1s +c2.2s+···. c) Produce a new, normalized function . that is orthogonal to .1s but is otherwise as similar as possible to fopt. (Use the unexpanded symbolic terms .1s and f in your answer.) d) Suppose you are told that the average energy associated with your new function . is -0.133 a.u. Do you find this reasonable? Explain your answer. 7-10. f=(a5/3p)1/2r exp(-ar) is a normalized trial function for the hydrogen atom. The energy is minimized when a = 3/2. Calculate the average value of the potential energy predicted by this function if a =3/2. 238 Chapter 7 The Variation Method 7-11. Prove that optimized trial function (7-20) must contain contributions from continuum wavefunctions. 7-12. Compare the orbital exponent for a 1sAO in He as found by the variation method [Eq. (7-35)] with that given by Slater’s rules (Chapter 5). 7-13. A different trial function for calculating the polarizability of the hydrogen atom in a uniform electric field of strength F is .trial =.1s(c1 +c2z) This is somewhat similar to the example in the text, since z.1s gives a p-like function, but not exactly the 2pz eigenfunction. a) Use this form to find an expression for the minimum E¯ as a function of F. What value of E¯ does this give for F =0.1 a.u.? Can you suggest why this trial function is superior to the one used in the text? b) The polarizability is defined to be a in the expression E=- 1 2 - 1 2 aF2 What value of a do you obtain? [Exact a =4.5 a.u.] 1 a.u. of field strength is equal to e/a2 0 . Deduce the value of 1 a.u. of polarizability. 7-14. Which hydrogen atom state should be more polarizable, the 1s or 2s? [Consider the factors that determine the extent of mixing between basis functions.] Explain your reasoning. 7-15. fa and fb are chosen to be a normalized set of basis functions for an LCAO wavefunction for a one-electron homonuclear diatomic system. It is found that the values for the integrals involving these functions are f*aHˆ fa dv = -2 a.u., f*bHˆ fb dv =-2 a.u., f*aHˆ fb dv = -1 a.u., f*afb dv = 1 4 . Find an upper bound for the exact lowest electronic energy for this system. Find the corresponding LCAO normalized approximate wavefunction. 7-16. fa and fb are chosen as the normalized basis functions for an LCAO wavefunction for a one-electron, heteronuclear, diatomic molecule. It is found that the values for some integrals involving these functions are faHˆ fa dv = -2 a.u., fbHˆ fb dv =-1 a.u., faHˆ fb dv = - 1 2 a.u., fafb dv = 1 3 . whereHˆ is themolecular hamiltonian. Set up the secular determinantal equation and find the lowest electronic energy that can be computed from an LCAO wavefunction cafa + cbfb. Find ca and cb such that E¯ is minimized and the wavefunction is normalized. Section 7-8 Beyond the Orbital Approximation 239 7-17. Apossible basis function for representing the 1sg wavefunction ofH+2 is a 1s-like AO . 3/p 1/2 exp(-.r) located at the bond center. Assuming an internuclear separation of 2 a.u., find the . value that minimizes E¯. Is this basis function adequate to predict a bound H+2 molecule? [Use Appendix 3 to help you develop your formulas.] 7-18. Without referring to the text, and by inspection, what is the united atom limit for the 1su molecular orbital’s energy (in a.u.) for H+2 ? 7-19. Show that, at R=8, the .+ and .- wavefunctions for H+2 are capable of describing a state wherein the electron is in a 1s orbital on atom A. 7-20. Evaluate Eqs. (7-89) and (7-90) at R =0 to show that HAA =HAB at this point. 7-21. Examining Eq. (7-86), and letting HAB = kHAA what relationship between k and SAB is necessary if the sg MO is to be lower in energy than the su MO? [Assume that HAA is negative, and that k and SAB are positive.] 7-22. Consider the one-electron molecule–ion HeH2+: a) Write down the hamiltonian operator (nonrelativistic, Born–Oppenheimer approximation) for the electronic energy in atomic units for this system. b) Calculate the electronic energies for the lowest energy state of this system in the separated atom and united atom limits. 7-23. For a homonuclear diatomic molecule aligned as shown in Fig. P7-23, characterize each of the following MOs as s, p, d, and g or u, and bonding or antibonding. a) 2pya +2pyb b) 2pza +2pzb c) 3dz2 a +3dz2 b d) 3dxya +3dxyb e) 3dxza -3dxzb Figure P7-23  7-24. Characterize each of the following atomic orbitals with the symbols s, p, d, and also g or u. Let the z axis be the reference axis for angular momentum. 1s 2pz 3py 3dxy 2s 2px 3dz2 3dxz 7-25. Indicate whether you expect each of the following homonuclear diatomic MOs to be bonding or antibonding. Sketch the MO in each case: (a) su (b) pu (c) dg 240 Chapter 7 The Variation Method 7-26. A homonuclear diatomic molecule MO of pu symmetry is to be expressed as a linear combination ofAOs centered on the nuclei, which lie on the z axis. Which of the AOs in the following list can contribute to the MO? 1s 2s 2px 2py 2pz 3s 3px 3py 3pz 3dxy 3dxz 3dyz 3dz2 3dx2-y2 7-27. Use sketches and symmetry arguments to decide which of the following integrals vanish for diatomic molecules (the x, y, and z axes are shown in Fig. P7-23): a)  2pza 1sb dv b)  2pya1sb dv c)  2pza2pyb dv d)  2pya3dyzb dv e)  2pza3dyzb dv f)  1saHˆ 2pxa dv g)  1saHˆ 2pza dv 7-28. Show that, for reflection through a plane containing the nuclei, if the MO dx2-y2 is symmetric, then the MO dxy is antisymmetric. Show that the same is true for p MOs constructed from dxz and dyz AOs. (The internuclear axis is assumed to lie along the z coordinate.) 7-29. Assuming the internuclear axis to lie along the z coordinate, what are the possible ML quantum numbers for an MO constructed from 3dz2 a -3z2 b ? 7-30. In a homonuclear diatomic correlation diagram, what MO symmetry symbols (s, p, d, g, u) could correlate with each of the united atom AOs listed below? Assume z to be the “old” internuclear axis. Indicate for each case whether this united atom orbital is the terminus for a bonding or an antibonding MO. (a) 2pz (b) 2px (c) 3dxz (d) 3dxy 7-31. A homonuclear diatomic system has the ground-state MO configuration 1s2 g 1s2 u 2s2 g 2s2 u 3s2 g 1p4 u 1p2 g : a) What is the net number of bonding electrons? b) What spin multiplicity would you expect for the ground state? c) What would you expect the effect to be on the dissociation energy of this molecule of ionization (1) from the 1pg MO? (2) from the 3sg MO? d) Upon ionization (one-electron) from the 1pg level, what would be the spin multiplicity of the resulting ion? e) To what type of united atom AO does the pu MO correlate? 7-32. a) Without referring to the text, write out the ground state configuration for O+2 using MO symmetry symbols (1s2 g etc.) b) What is the net number of bonding electrons? c) How does the dissociation energy for this ion compare to that for O2? d) What is the ground state term symbol for this ion? e) Which occupied MOs may contain contributions from 2pz AOs, assuming z to be the internuclear axis? Section 7-8 Beyond the Orbital Approximation 241 7-33. The reduced symmetry of heteronuclear (compared to homonuclear) diatomic molecules results in their having a different correlation diagram. Set up a correlation diagram for heteronuclear diatomics. Be sure to indicate that the energy levels of each type of AO are not identical for the separated atoms. Comparing correlation diagrams for homonuclear and heteronuclear molecules, does it seem reasonable that He2 is unstable, whereas the isoelectronic LiH and LiHe+ are stable molecules? 7-34. Following are some Slater orbital coefficients for some MO’s of F2 calculated by Ransil [11] (the 2ps STOs are defined according to z axes pointing from each atom toward the other): 1sg c1s,A =c1s,B =0.70483 2sg c1s,A =c1s,B =0.17327 c2s,A =c2s,B =0.00912 c2s,A =c2s,B =-0.67160 c2ps ,A =c2ps ,B =-0.00022 c2ps ,A =c2ps ,B =-0.08540 We see that the 1sg MO is almost entirely made from 1s AOs on A and B. However, the 2sg MO contains what appears to be an anomalously large amount of 1s AO. This turns out to be an artifact of the fact that Slater-type 2s orbitals are not orthogonal to 1sAOs on the same center. For F2, the STO 1s, 2s overlap is 0.2377. Use this fact to construct a new orbital, 2s, that is orthogonal to 1s. Express the 2sg MO of Ransil in terms of the basis functions 1s, 2s, and 2ps on centers A and B. You should find the 1s coefficients much reduced. 7-35. Re for H+2 equals 2.00 a.u. At this distance, Eelec =-1.1026 a.u. What is the value of De for H+2 ? Multiple Choice Questions (Try to answer these without referring to the text.) 1. A homonuclear diatomic MO is given by f =2pz,a +2pz,b, where the z axis is the same as the internuclear axis. Which one of the following statements about f is correct? a) f is an antibonding MO, symbolized su. b) f is a bonding MO, symbolized pu. c) f is an antibonding MO, symbolized pg. d) f is a bonding MO, symbolized sg. e) f is an antibonding MO, symbolized pu. 2. For the three species N2, N+2 , N-2 , which one of the following orders for the bond energy (i.e., bond strength) is most reasonable? a) N2 >N+2 >N-2 b) N+2 >N2 >N-2 c) N-2 >N2 >N+2 d) N-2 >N+2 >N2 e) Only N2 forms a bond; N+2 and N-2 do not. 242 Chapter 7 The Variation Method 3. According to the LCAO-MO model, which one of the following second period diatomic molecules has a double bond in the ground electronic state? a) Li2 b) Be2 c) B2 d) C2 e) N2 4. Which one of the following statements concerning H+2 is false? a) The nondegenerate LCAO-MOs (without spin) must be either symmetric or antisymmetric for inversion. b) The lowest energy MO (without spin) of the molecule is antisymmetric for inversion. c) The MOs transform into the AOs of the helium ion as the two nuclei are fused together. d) The ground state has a multiplicity of two. e) The Born-Oppenheimer approximation permits finding the solution for the purely electronic wave function at each value of the internuclear distance. 5. Which one of the following statements is false for bonding MOs formed from linear combinations of AOs on atoms a and b? a) Only AOs that have nonzero overlap can form bonding MOs. b) Only AOs that have similar energies can form strongly bonding MOs. c) Bonding MOs cannot have a nodal plane perpendicular to the internuclear axis and midway between a and b. d) A p AO on b can combine with a p AO on a to form s,p, or d MOs. e) A maximum of three bonding MOs can be formed from 2p AOs on a and b. References [1] R. S. Mulliken, Rev. Mod. Phys. 2, 60, 506 (1930); 3, 90 (1931); 4, 1 (1932). [2] J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Intermolecular Forces.Wiley, New York, 1964. [3] A. B. F. Duncan, Rydberg Series in Atoms and Molecules. Academic Press, NewYork, 1971. [4] R. S. Mulliken, J. Chem Phys. 36, 3428 (1962). [5] R.Weiss, Phys. Rev. 131, 659 (1963). [6] T. Kinoshita, Phys. Rev. 150, 1490 (1957). [7] C. L. Pekeris, Phys. Rev. 115, 1216 (1959). [8] W. Kolos and L.Wolniewicz, J. Chem. Phys. 49, 404 (1968). Section 7-8 Beyond the Orbital Approximation 243 [9] G.W. F. Drake, M. M. Cassar, and R. A. Nistor, Phys. Rev. A 65, 54501 (2002). [10] G. Herzberg, J. Mol. Spectrosc. 33, 147 (1970). [11] B. J. Ransil, Rev. Mod. Phys. 32, 245 (1960). Chapter 8 The Simple H¨uckel Method and Applications 8-1 The Importance of Symmetry Our discussions of the particle in a box, the harmonic oscillator, the hydrogen atom, and homonuclear diatomic molecules have all included emphasis on the role that symmetry plays in determining the qualitative nature of the eigenfunctions. When we encounter larger systems, detailed and accurate solutions become much more difficult to perform and interpret, but symmetry continues to exert strong control over the solutions. In this chapter, we will describe a rather simple quantum chemical method that was formulated in the early 1930s by E. H¨uckel. One of the strengths of this method is that, by virtue of its crudeness and simplicity, the effects of symmetry and topology on molecular characteristics are easily seen. Also, the simplicity of the model makes it an excellent pedagogical tool for illustrating many quantum chemical concepts, such as bond order, electron densities, and orbital energies. Finally, the method and some of its variants continue to be useful for certain research applications. Indeed, it is difficult to argue against the proposition that every graduate student of organic and inorganic chemistry should be acquainted with the H¨uckel molecular orbital (HMO) method. 8-2 The Assumption of s–p Separability The simple H¨uckel method was devised to treat electrons in unsaturated molecules like ethylene and benzene. By 1930 it was recognized that unsaturated hydrocarbons are chemically more reactive than are alkanes, and that their spectroscopic and thermodynamic properties are different too. The available evidence suggested the existence of loosely held electrons in unsaturated molecules. We have already seen that, when atoms combine to form a linear molecule, we can distinguish between MOs of type s,p, d, . . . depending on whether the MOs are associated with anmquantum number of 0,±1,±2, . . . Thus, in acetylene (C2H2), the minimal basis set ofAOs on carbon and hydrogen lead to s and p MOs. Let us imagine that our acetylene molecule is aligned along the z Cartesian axis. Then the px p-type AOs on the carbons are antisymmetric for reflection through a plane containing the molecular axis and the y axis. Similarly, the py p-type AOs are antisymmetric for reflection through a plane containing the molecular axis and the x axis. The pz AOs, which are s-type functions, are symmetric for reflection through any plane containing 244 Section 8-2 The Assumption of s–p Separability 245 the molecular axis. It has become standard practice to carry over the s-p terminology to planar (but nonlinear) molecules, where m is no longer a “good” quantum number. In this expanded usage, a p orbital is one that is antisymmetric for reflection through the plane of the molecule, a s orbital being symmetric for that reflection. H¨uckel found that, by treating only the p electrons explicitly, it is possible to reproduce theoretically many of the observed properties of unsaturated molecules such as the uniform C–C bond lengths of benzene, the high-energy barrier to internal rotation about double bonds, and the unusual chemical stability of benzene. Subsequent work by a large number of investigators has revealed many other useful correlations between experiment and this simple HMO method for p electrons. Treating only the p electrons explicitly and ignoring the s electrons is clearly an approximation, yet it appears to work surprisingly well. Physically, H¨uckel’s approximation may be viewed as one that has the p electrons moving in a potential field due to the nuclei and a “s core,” which is assumed to be frozen as the p electrons move about. Mathematically, the s-p separability approximation is Etot =Es +Ep (8-1) where Etot is taken to be the electronic energy Eel plus the internuclear repulsion energy Vnn. Let us consider the implications of Eq. (8-1). We have already seen (Chapter 5), that a sum of energies is consistent with a sum of hamiltonians and a product-type wavefunction. This means that, if Eq. (8-1) is true, the wavefunction of our planar molecule should be of the form (see Problem 8-1) .(1, . . . ,n)=.p (1, . . . , k).s (k +1, . . . ,n) (8-2) and our hamiltonian should be separable into p and s parts: Hˆ(1, 2, . . . ,n)=Hˆp (1, 2, . . . , k)+Hˆs (k +1, . . . ,n) (8-3) Equations (8-2) and (8-3) lead immediately to Eq. (8-1): E¯ =  .*p.*s Hˆp +Hˆs .p.s dt(1, . . . ,n)  .*p.*s.p.s dt(1, . . . ,n) =  .*pHˆp.pdt(1, . . . , k)  .*p.pdt(1, . . . , k) +  .*sHˆs.s dt(k +1, . . . ,n)  .*s.s dt(k +1, . . . ,n) = Ep +Es (8-4) If these equations were valid, one could ignore .s and legitimately minimize Ep by varying .p , But the equations are not valid because it is impossible to rigorously satisfy Eq. (8-3). We cannot define Hˆp and Hˆs so that they individually depend completely on separate groups of electrons and still sum to the correct total hamiltonian. Writing these operators explicitly gives Hˆp (1, . . . , k) = - 1 2 k  i=1 .2 i + k  i=1 Vne (i)+ 1 2 k  i=1 k  j=1,j =i 1 rij (8-5) Hˆs (k +1, . . . ,n) = - 1 2 n  i=k+1 .2 i + n  i=k+1 Vne (i)+ 1 2 n  i=k+1 n  j=k+1,j =i 1 rij +Vnn (8-6) 246 Chapter 8 The Simple H¨uckel Method and Applications where Vne (i) represents the attraction between electron i and all the nuclei. These hamiltonians do indeed depend on the separate groups of electrons, but they leave out the operators for repulsion between s and p electrons: Hˆ-Hˆp -Hˆs = k  i=1 n  j=k+1 1 rij (8-7) In short, the s and p electrons really do interact with each other, and the fact that the HMO method does not explicitly include such interactions must be kept in mind when we consider the applicability of the method to certain problems. Some account of s-p interactions is included implicitly in the method, as we shall see shortly. 8-3 The Independent p-Electron Assumption TheHMOmethod assumes further that thewavefunction.p is a product of one-electron functions and that the hamiltonian Hˆp is a sum of one-electron operators. Thus, for np electrons, .p (1, 2, . . . ,n) = fi(1)fj (2) . . . fl(n) (8-8) Hˆp (1, 2, . . . , n) = Hˆp (1)+Hˆp (2)+···+Hˆp (n) (8-9) and  f*i(1)Hˆp (1)fi(1)dt (1)  f*i(1)fi(1)dt (1) =Ei (8-10) It follows that the total p energy Ep is a sum of one-electron energies: Ep =Ei +Ej +···+El (8-11) This means that the p electrons are being treated as though they are independent of each other, since Ei depends only on fi and is not influenced by the presence or absence of an electron in fj . However, this cannot be correct because p electrons in fact interact strongly with each other. Once again, such interactions will be roughly accounted for in an implicit way by the HMO method. The implicit inclusion of interelectronic interactions is possible because we never actually write down a detailed expression for the p one-electron hamiltonian operator Hˆp (i). (We cannot write it down because it results froma p–s separability assumption and an independent p-electron assumption, and both assumptions are incorrect.) Hˆp (i) is considered to be an “effective” one-electron operator—an operator that somehow includes the important physical interactions of the problem so that it can lead to a reasonably correct energy value Ei . A key point is that the HMO method ultimately evaluates Ei via parameters that are evaluated by appeal to experiment. Hence, it is a semiempirical method. Since the experimental numbers must include effects resulting from Section 8-4 Setting up the H¨uckel Determinant 247 all the interelectronic interactions, it follows that these effects are implicitly included to some extent in the HMO method through its parameters. It was pointed out in Chapter 5 that, when the independent electron approximation [Eqs. (8-8)–(8-11)] is taken, all states belonging to the same configuration become degenerate. In other words, considerations of space-spin symmetry do not affect the energy in that approximation. Therefore, the HMO method can make no explicit use of spin orbitals or Slater determinants, and so .p is normally taken to be a single product function as in Eq. (8-8). The Pauli principle is provided for by assigning no more than two electrons to a single MO. EXAMPLE 8-1 If O2 were treated by the HMO method, what would be the form of the wavefunction and energy for the ground state? SOLUTION  The ground state configuration for O2 is 1s2 g 1s2 u 2s2 g 2s2 u 3s2 g 1p2 u,x1p2 u,y × 1pg,x1pg,y, where we have shown the degenerate members of p levels explicitly and in their real forms. The HMO wavefunction is simply a product of the pi MOs, one for each of the six pi electrons: 1pu,x (1)1pu,x (2)1pu,y (3)1pu,y (4)1pg,x(5)1pg,y(6). The HMO energy is 2Ep,u,x +2Ep,u,y +Ep,g,x +Ep,g,y, which reduces to 4Ep,u +2Ep,g. Note that, becauseO2 is linear, there is no unique molecular plane containing the internuclear axis. Therefore this molecule has two sets of p MOs, one pair pointing in the x direction, the other pair pointing along y. For a planar molecule, only one of these pairs would qualify as p MOs, as will be seen in the next section.  8-4 Setting up the H¨uckel Determinant 8-4.A Identifying the Basis Atomic Orbitals and Constructing a Determinant The allyl radical, C3H5, is a planar molecule1 with three unsaturated carbon centers (see Fig. 8-1). The minimal basis set of AOs for this molecule consists of a 1s AO on each hydrogen and 1s, 2s, 2px, 2py, and 2pz AOs on each carbon. Of all these AOs only the 2pz AOs at the three carbons are antisymmetric for reflection through the molecular plane. Figure 8-1  Sketch of the nuclear framework for the allyl radical. All the nuclei are coplanar. The z axis is taken to be perpendicular to the plane containing the nuclei. 1The minimum energy conformation of the allyl system is planar. We will ignore the deviations from planarity resulting from vibrational bending of the system. 248 Chapter 8 The Simple H¨uckel Method and Applications Figure 8-2  The three p-type AOs in the minimal basis set of the allyl radical. Following H¨uckel, we ignore all the s-type AOs and take the three 2pz AOs as our set of basis functions. Notice that this restricts us to the carbon atoms: the hydrogens are not treated explicitly in the simpleHMOmethod. We label our three basis functions .1,.2,.3 as indicated in Fig. 8-2. We will assume these AOs to be normalized. Suppose that we now perform a linear variation calculation using this basis set. We know this will lead to a 3×3 determinant having roots that are MO energies which can be used to obtain MO coefficients. The determinantal equation is  H11 -ES11 H12 -ES12 H13 -ES13 H21 -ES21 H22 -ES22 H23 -ES23 H31 -ES31 H32 -ES32 H33 -ES33  =0 (8-12) where Hij =  .iHˆp.j dv (8-13) Sij =  .i.j dv (8-14) Since Hij and Sij are integrals over the space coordinates of a single electron, the electron index is suppressed in Eqs. (8-13) and (8-14). 8-4.B The Quantity a We have already indicated that there is no way to write an explicit expression for Hˆp that is both consistent with our separability assumptions and physically correct. But, without an expression for Hˆp , how can we evaluate the integrals Hij ? The HMO method sidesteps this problem by carrying certain of theHij integrals along as symbols until they can be evaluated empirically by matching theory with experiment. Let us first consider the integrals H11, H22 and H33. The interpretation consistent with these integrals is that H11, for instance, is the average energy of an electron inAO .1 experiencing a potential field due to the entire molecule. Symmetry requires that H11 =H33. H22 should be different since an electron in AO .2 experiences a different Section 8-4 Setting up the H¨uckel Determinant 249 environment than it does when in .1 or .3. It seems likely, however, that H22 is not very different from H11. In each case, we expect the dominant part of the potential to arise from interactions with the local carbon atom, with more distant atoms playing a secondary role. Hence, one of the approximations made in the HMO method is that all Hii are identical if .i is on a carbon atom. The symbol a is used for such integrals. Thus, for the example at hand, H11 =H22 =H33 =a. The quantity a is often called the coulomb integral.2 8-4.C The Quantity ß Next, we consider the resonance integrals or bond integrals H12, H23, and H13. (The requirement that Hˆp be hermitian plus the fact that the .’s and Hˆp are real suffices to make these equal to H21, H32, and H31, respectively.) The interpretation consistent with these integrals is thatH12, for instance, is the energy of the overlap charge between .1 and .2. Symmetry requires thatH12=H23 in the allyl system. However, even when symmetry does not require it, the assumption is made that all Hij are equal to the same quantity (called ß) when i and j refer to “neighbors” (i.e., atoms connected by a s bond). It is further assumed that Hij =0 when i and j are not neighbors. Therefore, in the allyl case, H12 =H23 =ß,H13 =0. 8-4.D Overlap Integrals Since the .’s are normalized, Sii =1. The overlaps between neighbors are typically around 0.3. Nevertheless, in the HMO method, all Sij (i = j ) are taken to be zero. Although this seems a fairly drastic approximation, it has been shown to have little effect on the qualitative nature of the solutions. 8-4.E Further Manipulation of the Determinant Our determinantal equation for the allyl system is now much simplified. It is  a -E ß 0 ß a-E ß 0 ß a-E  =0 (8-15) Dividing each row of the determinant by ß corresponds to dividing the whole determinant by ß3. This will not affect the equality. Letting (a -E)/ß =x, we obtain the result  x 1 0 1 x 1 0 1 x  =0 (8-16) 2The term “coulomb integral” for a is unfortunate since the same name is used for repulsion integrals of the form  .1(1).2(2)(1/r12).1(1).2(2)dv. The quantity a also contains kinetic energy and nuclear–electronic attraction energy. 250 Chapter 8 The Simple H¨uckel Method and Applications Figure 8-3  HMO determinants for some small systems. which is the form we will refer to as the HMO determinantal equation. Notice that x occurs on the principal diagonal, 1 appears in positions where the indices correspond to a bond, 0 appears in positions (e.g., 1,3) corresponding to no bond. This gives us a simple prescription for writing the HMO determinant for any unsaturated hydrocarbon system directly from a sketch of the molecular structure. The rules are (1) sketch the framework defined by the n unsaturated carbons; (2) number the atoms 1, . . . ,n (the ordering of numbers is arbitrary); (3) fill in the n×n determinant with x’s on the diagonal, 1’s in positions where rowcolumn indices correspond to bonds, 0’s elsewhere. See Fig. (8-3) for examples. As a check, it is useful to be sure that the determinant is symmetric for reflection through the diagonal of x’s. This is necessary since, if atoms i and j are neighbors, 1’s must appear in positions i, j and j , i of the determinant. Since the H¨uckel determinant contains only information about the number of unsaturated carbons and how they are connected together, it is sometimes referred to as a topological determinant. (Topology refers to properties that are due to the connectedness of a figure, but are unaffected by twisting, bending, etc.) 8-5 Solving the HMO Determinantal Equation for Orbital Energies The HMO determinantal equation for the allyl system (8-16) can be expanded to give x3 -2x =0 (8-17) or x(x2 -2)=0 (8-18) Thus, the roots are x =0, x =v2, and x=-v2. Recalling the definition of x, these roots correspond respectively to the energies E =a, E =a -v2ß, E =a +v2ß. How should we interpret these results? Since a is supposed to be the energy of a pi electron in a carbon 2p AO in the molecule, we expect this quantity to be negative (corresponding to a bound electron). Since ß refers to an electron in a bond region, it too should be negative. Therefore, the lowest-energy root should be E1 = a + v2ß, followed by E2 = a, with E3 = a - v2ß being the highest-energy root. Section 8-6 Solving for the Molecular Orbitals 251 Figure 8-4  p-Electron configurations and total energies for the ground states of the allyl cation, radical, and anion. (It is convenient to number the orbital energies sequentially, starting with the lowest, as we have done here.) We have just seen that bringing three 2pp AOs together in a linear arrangement causes a splitting into three MO energy levels. This is similar to the splitting into two energy levels produced when two 1sAOs interact, discussed in connection with H+2 . In general, n linearly independent separatedAOs will lead to n linearly independent MOs. The ground-state p-electron configuration of the allyl system is built up by putting electrons in pairs into the MOs, starting with those of lowest energy. Thus far, we have been describing our system as the allyl radical. However, since we have as yet made no use of the number of p electrons in the system, our results so far apply equally well for the allyl cation, radical, or anion. Configurations and total p energies for these systems in their ground states are depicted in Fig. 8-4. The total p-electron energies are obtained by summing the oneelectron energies, as indicated earlier. EXAMPLE 8-2 For a planar, unsaturated hydrocarbon having formula CxHy, where all the carbons are part of the unsaturated framework, how many pi MOs are there? SOLUTION  Each carbon atom brings one 2pp AO into the basis set, so there are x basisAOs. These x independent AOs mix to form x independent MOs.  8-6 Solving for the Molecular Orbitals We still have to find the coefficients that describe the MOs as linear combinations of AOs. Recall from Chapter 7 that this is done by substituting energy roots of the secular determinant back into the simultaneous equations. For the allyl system, the simultaneous equations corresponding to the secular determinant (8-16) are c1x +c2 =0 (8-19) c1 +c2x +c3 =0 (8-20) c2 +c3x =0 (8-21) (Compare these equations with the secular determinant in Eq. (8-16) and note the obvious relation.) As we noted in Chapter 7, homogeneous equations like these can give us only ratios between c1, c2, and c3, not their absolute values. So we anticipate using only two of these equations and obtaining absolute values by satisfying the normality 252 Chapter 8 The Simple H¨uckel Method and Applications condition. Because we are neglecting overlap betweenAOs, the latter step corresponds to requiring c2 1 +c2 2 +c2 3 =1 (8-22) The roots x are, in order of increasing energy, -v2, 0,+v2. Let us take x =-v2 first. Then -v2c1+ c2 =0 (8-23a) c1-v2c2+ c3 =0 (8-23b) c2-v2c3 =0 (8-23c) Comparing Eqs. (8-23a) and (8-23c) gives c1 =c3. Equation (8-23a) gives c2 =v2c1. Inserting these relations into the normality equation (8-22) gives c2 1 +v2c1 2 +c2 1 =1 (8-24) 4c2 1 =1, c1=± 1 2 (8-25) It makes no difference which sign we choose for c1 since anywavefunction is equivalent to its negative. (Both give the same .2.) Choosing c1=+1 2 gives c1 = 1 2 , c2 = 1 v2 , c3 = 1 2 (8-26) These coefficients define our lowest-energy MO, f1: f1 = 1 2 .1 + 1 v2 .2 + 1 2 .3 (8-27) A similar approach may be taken for x =0 and x=+v2. The results are (x =0) : f2 = 1 v2 .1 - 1 v2 .3 (8-28) x=+v2 : f3 = 1 2 .1 - 1 v2 .2 + 1 2 .3 (8-29) The allyl system MOs are sketched in Fig. 8-5. The lowest-energy MO, f1, has no nodes (other than the molecular-plane node common to all p MOs) and is said to be bonding in the C1-C2 and C2 -C3 regions. It is reasonable that such a bonding MO should have an energy wherein the bond-related term ß acts to lower the energy, as is true here. The second-lowest energy MO, f2, has a nodal plane at the central carbon. Because there are no p AOs on neighboring carbons in this MO, there are no interactions at all, and ß is absent from the energy expression. This MO is said to be nonbonding. The high-energy MO, f3 has nodal planes intersecting both bonds. Because the p AOs showsign disagreement across both bonds, this MO is everywhere antibonding and ß terms act to raise the orbital energy above a. EXAMPLE 8-3 According to HMO theory, do the p electrons favor a linear, or a bent allyl radical? Section 8-7 The Cyclopropenyl System: Handling Degeneracies 253 Figure 8-5  Sketches of the allyl system MOs. (a) emphasizes AO signs and magnitudes. (b) resembles more closely the actual contours of the MOs. SOLUTION  HMO theory favors neither. The difference between linear and bent allyl shows up as a difference in C1 - C2 - C3 angle and in C1 to C3 distance. The HMO method has no angular-dependent features and explicitly omits interactions between non-neighbor carbons, like C1 and C3.  8-7 The Cyclopropenyl System: Handling Degeneracies The allyl system results when three p AOs interact in a linear arrangement wherein H12 = H23 = ß, but H13 = 0. We can also treat the situation where the three p AOs approach each other on vertices of an ever-shrinking equilateral triangle. In this case, each AO interacts equally with the other two. This triangular system is the cyclopropenyl system C3H3 shown in Fig. 8-6. The HMO determinantal equation for this system is  x 1 1 1 x 1 1 1 x  =0, x3 +2-3x =0 (8-30) Figure 8-6  The cyclopropenyl system (all nuclei are coplanar). 254 Chapter 8 The Simple H¨uckel Method and Applications This equation can be factored as (x +2)(x -1)(x -1)=0 (8-31) Therefore, the roots are x=-2,+1,+1. Since the root x =1 occurs twice, we can expect there to be two independent HMOs having the same energy–a doubly degenerate level. The energy scheme and ground state electron configuration for the cyclopropenyl radical (three p electrons) (I) gives a total Ep of 3a +3ß. We can surmise from these orbital energies that f1 is a bonding MO, whereas f2 and f3 are predominantly antibonding. To see if this is reflected in the nodal properties of the MOs, let us solve for the coefficients. The equations consistent with the HMO determinant and with orbital normality are c1x +c2 +c3 =0 c1 +c2x +c3 =0 (8-32) c1 +c2 +c3x =0 c2 1 +c2 2 +c2 3 =1 Setting x=-2 and solving gives f1 = 1 v3 .1 + 1 v3 .2 + 1 v3 .3 (8-33) For this MO, the coefficients are all of the same sign, and so the AOs show phase agreement across all bonds and all interactions are bonding. To find f2 and f3 is trickier. We begin by inserting x =±1 into our simultaneous equations. This gives c1 +c2 +c3 = 0 (three times) (8-34) c2 1 +c2 2 +c2 3 = 1 (8-35) With three unknowns and two equations, an infinite number of solutions is possible. Let us pick a convenient one: c1=-c2, c3 =0. The normalization requirement then gives c1 =1/v2, c2=-1/v2, c3 =0. Let us call this solution f2: f2 = 1 v2 .1 - 1 v2 .2 (8-36) We still need to find f3. There remain an infinite number of possibilities, so let us pick one: c1 =1/v2, c2 =0, c3=-1/v2. We have used our experience with f2 to choose c’s that guarantee a normalized f3. Also, it is clear that f3 is linearly independent of Section 8-7 The Cyclopropenyl System: Handling Degeneracies 255 f2 since they contain different AOs. But it is desirable to have f3 orthogonal to f2. Let us test f2 and f3 to see if they are orthogonal: Since S = 0, f2 and f3 are nonorthogonal. We can project out that part of f3 that is orthogonal to f2 by using the Schmidt orthogonalization procedure described in Section 6-10. We seek a new function f3 given by f3 =f3 -Sf2 (8-38) where S = f2f3dv = 1 2 (8-39) Therefore, f3 =f3 - 1 2 f2 = 1 2v2 (.1 +.2 -2.3) (8-40) This function is orthogonal to f2 but is not normalized. Renormalizing gives f3 = 1 v6 (.1 +.2 -2.3) (8-41) In summary, to produce HMO coefficients for degenerate MOs, pick any two independent solutions from the infinite choice available, and orthogonalize one of them to the other using the Schmidt (or any other) orthogonalization procedure. The MOs for the cyclopropenyl system as seen from above the molecular plane are sketched in Fig. 8-7. The MO f2 can be seen to have both antibonding (C1–C2) and nonbonding (C1–C3,C2–C3) interactions. f3 has antibonding (C1–C3,C2–C3) and bonding (C1–C2) interactions. The interactions are of such size and number as to give an equal net energy value (a-ß) in each case. Since nodal planes produce antibonding or nonbonding situations, it is not surprising that higher and higher-energy HMOs in a Figure 8-7  The HMOs for the cyclopropenyl system: (a) f1 =(1/v3)(.1 +.2 +.3); (b) f2 = (1/v2)(.1 -.2) (c) f3 =(1/v6)(.1 +.2 -2.3). The nodal planes intersect the molecular plane at the dashed lines. 256 Chapter 8 The Simple H¨uckel Method and Applications system display more and more nodal planes. Notice that the MOs f2 and f3 have the same number of nodal planes (one, not counting the one in the molecular plane) but that these planes are perpendicular to each other. This is a common feature of some degenerate, orthogonal MOs in cyclic molecules. It is important to notice the symmetry characteristics of these MOs. f1 is either symmetric or antisymmetric for every symmetry operation of the molecule. (It is antisymmetric for reflection through the molecular plane, symmetric for rotation about the threefold axis, etc.) This must be so for any nondegenerate MO. But the degenerate MOs f2 and f3 are neither symmetric nor antisymmetric for certain operations. (f2 is antisymmetric for reflection through the plane indicated by the dashed line in Fig. 8-7, but is neither symmetric nor antisymmetric for rotation about the threefold axis by 120..) In fact, one can easily show that, given a cycle with an odd number of centers, each with oneAO of a common type, there is but one way to combine theAOs (to form a realMO)so that the result is symmetric or antisymmetric for all rotations and reflections of the cycle. Hence, an HMO calculation for a three-, five, seven-, . . . membered ring can give only one nondegenerate MO. However, for a cycle containing an even number of centers, the analogous argument shows that two nondegenerate MOs exist. 8-8 Charge Distributions from HMOs Now that we have a method that provides us with orbitals and orbital energies, it should be possible to get information about the way the p-electron charge is distributed in the system by squaring the total wavefunction .p . In the case of the neutral allyl radical, we have (taking .p to be a simple product of MOs) .p =f1(1)f1(2)f2(3) (8-42) Hence, the probability for simultaneously finding electron 1 in dv(1), electron 2 in dv(2) and electron 3 in dv(3) is .2 p (1, 2, 3)dv(1)dv(2)dv(3)=f2 1 (1)f2 1 (2)f2 2 (3)dv(1)dv(2)dv(3) (8-43) For most physical properties of interest, we need to know the probability for finding an electron in a three-dimensional volume element dv. Since the probability for finding an electron in dv is the sum of the probabilities for finding each electron there, the one-electron density function . for the allyl radical is . =2f2 1 +f2 2 (8-44) where we have suppressed the index for the electron. If we integrate . over all space, we obtain a value of three. This means we are certain of finding a total p charge corresponding to three p electrons in the system. To find out how the p charge is distributed in the molecule, let us express . in terms of AOs. First, we write f2 1 and f2 2 separately: f2 1 = 1 4 .2 1 + 1 2 .2 2 + 1 4 .2 3 + 1 v2 .1.2 + 1 v2 .2.3 + 1 2 .1.3 f2 2 = 1 2 .2 1 + 1 2 .2 3 -.1.3 (8-45) Section 8-8 Charge Distributions from HMOs 257 If we were to integrate f2 1 we would obtain  f2 1 dv = 1 4 -.1   .2 1 dv+ 1 2 -.1   .2 2 dv+ 1 4 -.1   .2 3 dv+ 1 v2 -.0   .1.2 dv + 1 v2 -.0   .2.3 dv+ 1 2 -.0   .1.3 dv = 1 4 + 1 2 + 1 4 =1 (8-46) Thus, one electron in f1 shows up, upon integration, as being “distributed” 1 4 at carbon 1, 1 2 at carbon 2, and 1 4 at carbon 3. We say that the atomic p-electron densities due to an electron in f1 are 1 4 , 1 2 , 1 4 at C1, C2, and C3, respectively, If we accumulate these figures for all the electrons, we arrive at a total p-electron density for each carbon. For the allyl radical, Table 8-1 shows that each atom has a p-electron density of unity. Generalizing this approach gives for the total p-electron density qi on atom i qi = all MOs  k nkc2 ik (8-47) Here k is the MO index, cik is the coefficient for anAO on atom i in MO k, and nk, the “occupation number,” is the number of electrons (0, 1, or 2) in MO k. (In those rare cases where cik is complex, c2 ik in Eq. (8-47) must be replaced by c*ikcik.) If we apply Eq. (8-47) to the cyclopropenyl radical, we encounter an ambiguity. If the unpaired electron is assumed to be inMOf2 of Fig. 8-7, we obtain q1=q2= 76 , q3= 4 6 . On the other hand, if the unpaired electron is taken to be in f3, q1=q2= 56 , q3= 86 . The HMO method resolves this ambiguity by assuming that each of the degenerate MOs is occupied by half an electron. This has the effect of forcing the charge distribution to show the overall symmetry of the molecule. In this example, it follows that q1 =q2 = q3 =1. The general rule is that, for purposes of calculating electron distributions, the electron occupation is averaged in any set of partially occupied, degenerate MOs. TABLE 8-1  HMO p Electron Densities in the Allyl Radical Carbon atom Electron 1 2 3 1 in f1 1 4 1 2 1 4 2 in f1 1 4 1 2 1 4 3 in f2 1 2 0 1 2 – – – Sum 1 1 1 258 Chapter 8 The Simple H¨uckel Method and Applications Figure 8-8  When the equilateral structure is distorted by decreasing R12 and increasing R13, R23, the energies associated with f1, f2, f3 shift as shown. In actuality, the equilateral triangular structure for the cyclopropenyl radical is unstable, and therefore the above-described averaging process is only a theoretical idealization. It is fairly easy to see that a distortion from equilateral to isosceles form will affect the MO energies E1, E2, and E3 differently. In particular, a distortion of the sort depicted in Fig. 8-8 would have little effect on E1 but would raise E2 (increased antibonding) and lowerE3 (decreased antibonding and increased bonding). Thus, there is good reason for the cyclopropenyl radical to be more stable in an isosceles rather than equilateral triangular form. This is an example of the Jahn–Teller theorem, which states, in effect, that a system having an odd number of electrons in degenerate MOs will change its nuclear configuration in a way to remove the degeneracy.3 The preference of the cyclopropenyl radical for a shape less symmetrical than what we might have anticipated is frequently called Jahn–Teller distortion.4 Many times we are interested in comparing the p-electron distribution in the bonds instead of on the atoms. In the integrated expression (8-46) are cross terms that vanish under the HMO assumption of zero overlap. But the overlaps are not actually zero, especially betweenAOs on nearest neighbors. Hence, we might view the factors 1/v2 as indicating how much overlap charge is being placed in the C1–C2 and C2–C3 bonds by an electron in f1. The C1–C3 bond is usually ignored because these atoms are not nearest neighbors and therefore have much smaller AO overlap. Since S12 =S23 =Sij for neighbors i and j in any p system (assuming equal bond distances), we need not include Sij , explicitly in our bond index. If we proceed in this manner, two electrons in 3Linear systems are exceptions to this rule. Problems are also encountered if there is an odd number of electrons and spin-orbit coupling is substantial. The reader should realize that the above statement of the theorem is a little misleading inasmuch as it makes it sound like the molecule finds itself in a symmetric geometry that produces denerate MOs and then “distorts” to a lower-energy geometry. It is actually we who have guessed a geometry that is too symmetric. When our calculations reveal that this results in degenerate orbital energies containing an odd number of electrons, we are alerted that we have erred in our assumption, and that the molecule is really in a less symmetric, lower energy geometry. 4See Salem [1, Chapter 8]. Section 8-9 Some Simplifying Generalizations 259 f would then give us a “bond order” of 2/v2=1.414. It is more convenient in practice to divide this number in half, because then the calculated p-bond order for ethylene turns out to be unity rather than two. Since ethylene has one p-bond, this can be seen to be a more sensible index. As a result of these considerations, the p-bond order (sometimes called mobile bond order) of the allyl radical is 1/v2 = 0.707 in each bond. (Electrons in f2 make no contribution to bond order since c2 vanishes. This is consistent with the nonbonding label for f2.) Generalizing the argument gives, for pij , the p-bond order between nearestneighbor atoms i and j : pij = all MOs  k nkcikcjk (8-48) where the symbols have the same meanings as in Eq. (8-46). In cases in which partially filled degenerateMOsare encountered, the averaging procedure described in connection with electron densities must be employed for bond orders as well. EXAMPLE 8-4 Calculate p13 for the cyclopropenyl radical, using data in Fig. 8-7. SOLUTION  There are 2 electrons in f1 and the coefficients on atoms 1 and 3 are 1 v3 , so this MO contributes 2 × ( 1 v3 )2 = 2/3. We allocate 1 2 electron to f2. Since c3 = 0 in this MO, the contribution to p13 is zero. The remaining 1 2 electron goes to f3, yielding a contribution of 1 2 × 1 v6 × -2 v6 = -1 6 . So p13 = 2 3 - 16 = 1 2 .  8-9 Some Simplifying Generalizations Thus far we have presented the bare bones of the HMO method using fairly small systems as examples. If we try to apply this method directly to larger molecules, it is very cumbersome. A ten-carbon-atom system leads to a 10 ×10 HMO determinant. Expanding and solving this for roots and coefficients is tedious. However, there are some short cuts available for certain cases. In the event that the system is too complicated to yield to these, one can use computer programs which are readily available. For straight chain and monocyclic planar, conjugated hydrocarbon systems, simple formulas exist for HMO energy roots and coefficients. These are derivable from the very simple forms of the HMO determinants for such systems.5 We state the results without proof. For a straight chain of n unsaturated carbons numbered sequentially, x = -2 cos[kp/(n+1)], k=1, 2, . . . ,n (8-49) clk = [2/(n+1)]1/2 sin[klp/(n+1)] (8-50) where l is the atom index and k the MO index. 5See Coulson [2]. 260 Chapter 8 The Simple H¨uckel Method and Applications For a cyclic polyene of n carbons, x = -2 cos(2xk/n), k =0, 1, . . . ,n-1 (8-51) clk = n-1/2 exp[2pik(l -1)/n], i=v-1 (8-52) The coefficients derived from Eq. (8-52) for monocyclic polyenes will be complex when theMOis one of a degenerate pair. In such cases one may take linear combinations of these degenerate MOs to produce MOs with real coefficients, if one desires. There is also a diagrammatic way to find the energy levels for linear and monocyclic systems.6 Let us consider monocycles first. One begins by drawing a circle of radius 2 |ß|. Into this circle inscribe the cycle, point down, as shown in Fig. 8-9 for benzene. Project sideways the points where the polygon intersects the circle. The positions of these projections correspond to the HMO energy levels if the circle center is assumed to be at E =a (see Fig. 8-9). The number of intersections at a given energy is identical to the degeneracy. The numerical values for E are often obtainable from such a sketch by inspection or simple trigonometry. For straight chains, a modified version of the above method may be used: For an ncarbon chain, inscribe a cycle with 2n+2 carbons into the circle as before. Projecting out all intersections except the highest and lowest, and ignoring degeneracies gives the proper roots. This is exemplified for the allyl system in Fig. 8-10. Examination of the energy levels in Figs. 8-9 and 8-10 reveals that the orbital energies are symmetrically disposed about E =a. Why is this so? Consider the allyl system. The lowest-energy MO has two bonding interactions. The highest-energy MO differs only in that these interactions are now antibonding. [See Fig. 8-5 and note that the coefficients in f1 and f3 are identical except for sign in Eqs. (8-27) and (8-29).] The role of the ß terms is thus reversed and so they act to raise the orbital energy for f3 just as much as they lower it for f1. A similar situation holds for benzene. As we will see shortly, the lowest energy corresponds to an MO without nodes between atoms, so this is a totally bonding MO. The highest-energyMOhas nodal planes between all neighbor carbons, and so every interaction is antibonding. An analogous argument holds for the degenerate pairs of benzene MOs. These observations suggest that the energy of an Figure 8-9  HMO energy levels for benzene produced by projecting intersections of a hexagon with a circle of radius 2 |ß|. 6See Frost and Musulin [3]. Section 8-9 Some Simplifying Generalizations 261 Figure 8-10  HMO energy levels for the allyl system (n=3) produced by projecting the intersections of an octagon (n=2×3+2) with a circle of radius 2 |ß|. MO should be expressible as a function of the net bond order associated with it, and this is indeed the case. The energy of the ith MO is given by the expression Ei =  fiHˆpfi dv =  k cki.kHˆp l cli.l dv (8-53) = k  l ckicli  .kHˆp.l dv (8-54) When the atom indices k and l are identical, the integral is equal to a; when k and l are neighbors, it equals ß. Otherwise it vanishes. Hence, we may write Ei = k c2 kia + neighbors  k,l ckicliß (8-55) However, c2 ki is qk,i , the electron density at atom k due to one electron in MO fi , and ckicli is pkl,i , the bond order between atoms k and l due to an electron in fi . Therefore, Ei = k qk,ia +2 neighbors  kEBm , but compound B should have the lower reduction potential. A plot of oxidation potential versus Em in units of ß for a series of aromatic hydrocarbons (Fig. 8-20) yields a correlation that is remarkably free of scatter. Figure 8-21 indicates a similar correlation for reduction potential versus Em+1. Since oxidation-reduction potentials correlate with HOMO–LUMO energies, and since these energies are paired in even alternant systems, we should expect a plot of oxidation vs. reduction potential for such compounds to be linear also. (Problem 8-14.) The data in Figs. 8-20 and 8-21 provide a connection between theoretical energy differences in units of ß, and experimental energies. From the slope in Fig. 8-20 we 276 Chapter 8 The Simple H¨uckel Method and Applications Figure 8-20  Oxidation potentials in acetonitrile solution versus energy of HOMO (in units of ß). ( 1/2 from Lund [10].) obtain that ß~= -2.03 eV, or -46.8 kcal/mole. Similarly, the reduction potential data of Fig. 8-21 give ß =-2.44eV=-56.3 kcal/mole. One must be cautious in interpreting the above values of ß. The problem is that the experimental numbers include effects of physical processes not included in the theory. For example, when a neutral molecule in solvent becomes oxidized or reduced, solvation energy changes occur. One might argue that the fit of the data to straight lines in Figs. 8-20 and 8-21 implies this sort of contribution to be small, but the sensitivity of redox potentials to solvent nature indicates that this is not the case. However, in larger molecules, solvation energy change upon ionization tends to be smaller, and larger molecules also tend to have Em and Em+1 closer to the E = a level. In other words, we expect both solvation energy change and redox potential to be proportional to Em or Em+1. Therefore, they can be combined in a single linear relation. There Section 8-14 Orbital Energies and Oxidation-Reduction Potentials 277 Figure 8-21  First reduction potential in 2-methoxyethanol versus energy of LUMO (in units of ß). ( 1/2 from Bergman [11].) is, in general, no guarantee that “extra” contributing effects will be of a nature to be correctly assimilated into the theoretical formula, but, for this effect, it happens that this is at least partially the case. Another extra effect to consider is the p-electron repulsion energy. Because the energies of the HOMO and LUMO are both associated with the neutral molecule, their energies fail to reflect the decrease or increase of p-electron repulsion energy resulting from loss or gain of a p electron. Here again, however, we expect the magnitude of the effect to be larger for smaller molecules, where the change in p densities is greatest, and also where Em and Em+1 deviate most from a. Thus, as before, this extra effect will not necessarily upset the linear relation expected from simpler considerations. In view of the crudity of the HMO method together with the fact that the empirical value of ß includes the effects of several extra processes, the ß values cited above for oxidation and reduction are considered to be in fairly good agreement, especially considering that the electron repulsion change operates to make oxidation easier and reduction harder. 278 Chapter 8 The Simple H¨uckel Method and Applications 8-15 Orbital Energies and Ionization Energies Suppose that monochromatic light is beamed into a gaseous sample of a compound. If the light is of sufficient energy, electrons will be “knocked out” of the molecules. The kinetic energy of such a photoelectron will be equal to the kinetic energy of the incident photon (h.) minus the energy needed to remove the electron from the molecule, that is, the ionization energy. Measurement of the kinetic energies of photoelectrons emitted in this manner is known as “photo-electron spectroscopy” [12]. In measuring the kinetic energies of photoelectrons from, say, benzene, it is found that a large number are near a particular energy value, another large number of electrons are near a different value, and so on for several kinetic energy values. Because the photoelectrons tend to clump near several kinetic energy values, it follows that we are, in effect, measuring several ionization energies. It is reasonable to associate these with removal of electrons from different MOs of the molecule.13 A correlation plot (Fig. 8-22) between HMO orbital energies and experimentally measured ionization energies for a number of alternant and nonalternant hydrocarbons has been produced by Brogli and Heilbronner [13]. Their best fit was achieved using a=-6.553±0.340eV,ß =-2.734±0.333 eV, where the limits define a range for the predicted ionization energy (IE) that will include the experimental value nine HMO values for IE (eV) Experimental IE (eV) 13 7 7 13 12 12 11 11 10 10 9 9 8 8 Figure 8-22  Experimental ionization energies for alternant and nonalternant hydrocarbons versus HMO orbital energies using a=-6.553eV,ß=-2.734 eV. 13Note that each ionization energy is associated with removal of an electron from a different MO of the neutral molecule. This differs from the first, second, etc. ionization energies produced by successive ionization. Section 8-16 p-Electron Energy and Aromaticity 279 times out of ten (a 90% confidence level). There is a fair degree of scatter in the correlation plot. By making some additional refinements in the theory, Brogli and Heilbronner succeeded in substantially improving the correlation. We will describe these refinements in Section 8-18. In this section, we have dealt only with ionization energies assigned to removal of a p electron. Removal of electrons from s orbitals is also observed. If the impinging photons are of X-radiation frequencies, the inner shell electron ionization energies are seen. Quantum-chemical treatments of these ionization energies are possible using theoretical methods described later. 8-16 p-Electron Energy and Aromaticity When propene is hydrogenated to form propane, the increase in heat content H. 298 (called the heat of hydrogenation) is-30.1 kcal/mole. The heat of hydrogenation for 1- butene is -30.3 kcal/mole. In general, the heat of hydrogenation of an isolated double bond is about-30 kcal/mole. Asimilar constancy holds for the contribution of a double bond to the heat of formation of a molecule. Therefore, isolated double bonds fit easily into the usual chemical device of estimating the energy of a molecule by adding together contributions from the substituent parts. This additivity appears to be violated when double bonds are conjugated. Thus, the heat of hydrogenation for trans-1,3-butadiene is -57.1 kcal/mole compared with the value of -60.6 kcal/mole for a pair of butene double bonds. Since the hydrogenation product in each case is butane, the energy difference of 3.5 kcal/mole must be due to the greater stability of conjugated double bonds. There is a theoretical parallel to this. TheHMOenergy for butadiene is 4a+4.472ß. For a pair of isolated double bonds, we double the energy of ethylene to obtain 4a+4ß. Therefore, the HMO method indicates that the conjugated double bonds are stabilized by 0.472ß. Because the p electrons in butadiene are delocalized over all three C–C bonds, this 0.472ß has often been referred to as delocalization energy. It was common practice for many years to equate this theoretical delocalization energy for a molecule to its experimentally measured “extra” stability (e.g., for butadiene, 0.472 |ß|=3.5 kcal/mole). In 1969, Dewar and co-workers [14, Chapter 5; 15; 16] demonstrated that conjugated double bonds may be successfully included in an additivity scheme. Hess and Schaad [17, 18] have considered this idea in the context of the HMO method. They distinguish between several kinds of C–C single and double bond energy as indicated in Table 8-2. Using these values of bond energy, the p energy of butadiene is calculated to be 2× 2.000ß + 0.4660ß + 4a = 4a + 4.4660ß compared with the HMO result of 4a + 4.472ß. The difference is 0.006ß, less than 0.002ß per p electron. Hess and Schaad show that this level of agreement holds for acyclic polyenes in general, even when there is much branching. Thus, it seems that conjugated double bonds in acyclic molecules fit into an additivity scheme after all. There is an important difference between the energy additivity scheme for conjugated systems described above and the familiar additivity scheme used for C–H, C–C, and isolated C–C bonds. In the latter cases the energy “contributed” by the bond is generally thought of as being the same as the energy of that bond in the molecule (at least in an averaged way–some care must be exercised with definitions), and, furthermore, the bond is thought to be fairly independent of the identity of the molecule. For instance, 280 Chapter 8 The Simple H¨uckel Method and Applications TABLE 8-2  p-Bond Types and Effective Binding Energies for Carbon–Carbon Double and Single Bonds Bond type Effective binding energy in units of ß 2.0000 (cis or trans) 2.0699 2.0000 2.1083 2.1716 (cis or trans) 0.4660 (cis or trans) 0.4362 (cis or trans) 0.4358 aThe eight numbers associated with these bonds are not unique. They satisfy six simultaneous equations. Hence, any two of them may be given arbitrary values and the remaining six found by solving the simultaneous equations. Different arbitrary assignments lead to different sets of numbers, all equally valid and all giving identical p-electron energies. These numbers can only be applied when a bond has an unambiguous formal identity, which means we must deal with acyclic polyenes (even number of centers and electrons). a C–H bond in butane is very similar to one in heptane. But this is not the case for conjugated molecules. Inspection of Fig. 8-23 shows that a single C–C bond between (formal) double bonds varies significantly in bond order from molecule to molecule. Since the total p-electron energy depends on bond order [Eq. (8-58)], we can tell at once that the actual theoretical energy contribution due to such a “single” bond varies from 2×0.4472=0.8944ß in butadiene to 1.088ß in decapentaene. However, examination of Fig. 8-23 reveals that, in going from molecule to molecule, as the “single” bonds increase in p-bond order, the “double” bonds decrease in order. Thus, adding an additional C–C–C group to a chain adds a constant amount of bond energy (about 2.54ß) to the total p energy, but this energy increment contains contributions from bond order changes over the whole molecule. We have, then, an additive scheme for Section 8-16 p-Electron Energy and Aromaticity 281 Figure 8-23  HMO bond orders for butadiene, hexatriene, octatetraene, and decapentaene. a delocalized effect. For this reason, the bond energy contributions of Table 8-2 are called effective bond energies. We are nowin a position to consider the concept of aromaticity. The term “aromatic” originally referred to organic molecules having pleasant odors. Later it referred to a class of molecules having a high degree of unsaturation. Benzene was recognized as the parent compound for many such molecules, and the term aromatic has come to mean having chemical properties peculiar to benzene and some of its relatives. The chemical stability of these molecules, their relatively low heats of combustion or of hydrogenation, and their tendency to prefer substitution rather than addition (thereby preserving their p systems intact) distinguish these molecules from ordinary polyenes and have come to be called aromatic properties, or manifestations of aromaticity.14 These properties suggest that the p electrons in aromatic systems are unusually low in energy, contributing to both the thermodynamic and the kinetic stability of the systems. We can test whether this is the case by calculating an expected p energy for benzene using the bond energies of Table 8-2 for an alternating single-double bond, or Kekul´e, structure. The result, 6a +7.61ß, is significantly less stable (by 0.39ß) than the HMO energy of 6a +8ß. We shall refer to this difference as the resonance energy (RE) of the system. RE =En(HMO)-En (from Table 8-2) (8-63) By this definition, a positive RE (in units of ß) corresponds to extra molecular stabilization. If we divide the RE by the number of p electrons, we obtain the RE per electron (REPE). Hence, the REPE for benzene is 0.065ß. Following Dewar we shall refer to a system having significantly positive REPE as aromatic, significantly negative REPE as antiaromatic, and negligible REPE as nonaromatic, or polyolefinic. (Recall that ß is a negative quantity. When REPE is tabulated in eV, it is conventional to use the absolute value of ß in eV so as to retain positive REPE for aromatic molecules.) Cyclic polyenes differ from acyclic polyenes in that many of them show significant values for REPE. Some results of Schaad and Hess [18] are reproduced in Fig. 8-24. 14The definition of the term “aromatic” is not generally agreed upon. Various criteria based on structural, magnetic, or reactivity properties have been proposed. See Minkin et al. [19] for a general review, and Schleyer et al. [20] for consideration of ring-current-induced nmr chemical shifts as an aromaticity probe. 282 Chapter 8 The Simple H¨uckel Method and Applications It is evident that a strong correlation exists between REPE and the presence or absence of aromatic properties. The change from very aromatic to very antiaromatic nature as we go from a sixmembered to a four-membered system is striking, If we examine the HMO energy levels, it is not difficult to see why these monocycles are so different. All six p electrons in benzene occupy bonding MOs, whereas cyclobutadiene has two bonding and two nonbonding electrons (XII). The situation is optimal for benzene not only because all of its p electrons are bonding, but also because all of the bonding levels are fully occupied. Because all monocycles have a nondegenerate lowest level followed by higher-energy pairs of degenerate levels, it is not hard to describe the conditions that should produce maximum stability. There should be 4n+2 electrons (2 for the lowest level and 4 for each of the n higher bonding levels), and there should be more than 4n centers (either 4n+1, 4n+2, or 4n+3) to force the n doubly degenerate, occupied levels to all be bonding. The stability of cyclopentadienyl anion (6 electrons, 5 centers) and cycloheptatrienyl cation (6 electrons, 7 centers) was correctly predicted from these simple considerations (XIII). Extensive research has gone on in efforts to find examples Figure 8-24  Resonance energy per electron for a number of molecules. (From Schaad and Hess [18].) Section 8-16 p-Electron Energy and Aromaticity 283 where n = 1, but results are often complicated by angle strain and deviation from planarity in the molecule.15 EXAMPLE 8-7 What would the above approach predict for the relative stabilities of cyclic ions C5H+5 and C5H-5 ? SOLUTION  C5H-5 , the cyclopentadienyl anion shown in XIII, satisfies the 4n+2 rule and should be unusually stable. C5H+5 has only four p electrons, hence should be less stable. (It has two fewer bonding electrons than C5H-5 .)  In a molecule containing both cyclic and acyclic parts, does the RE arise only from the cyclic part? For certain cases, the answer is yes. Dewar has recognized that side or connecting chains contribute nothing to the RE of a system when the chain in question is the same in all formal structures for the molecule. For example, stilbene can be written in four equivalent formal ways, as shown below. The linking chain is identical in all cases, and it should not contribute to the RE. Calculations confirm that the RE for stilbene is just double that for benzene. The intimate relation between aromaticity and the possibility for more than one equivalent formal structure for a molecule has long been recognized. These “mobile” bonds tend to favor equal bond lengths in contrast to the strong alternation characteristic of acyclic polyenes. In benzene, the extra stabilization may be viewed as resulting from the fact that all six bonds are identical and have a higher bond order (0.667) than the average of 0.634 for double and single acyclic bonds. In antiaromatic cyclobutadiene, the four identical bonds have a bond order of 0.5, which is significantly below the 15Research on aromaticity has been extended to Mobius conjugated hydrocarbons, where aromatic behavior should be associated with 4n electrons (see Herges [21]), and also to clusters of boron or silicon atoms (see Ritter [22]), as well as metal atoms (see Chen et al. [23]). 284 Chapter 8 The Simple H¨uckel Method and Applications average for acyclics. The very different bond orders in these two molecules is fully consistent with the energy argument based on the 4n+2 rule described earlier.16 To summarize, the p bonds in acyclic polyenes exhibit delocalization in the sense that bond orders are not transferable from one molecule to another, additivity in effective bond energies, immobility in formal bonds. Cyclic molecules do not exhibit additivity of effective energies or immobility of formal bonds. Their energy deviations from energy calculated assuming additivity and immobility are good indicators of kinetic and thermodynamic stability. Aromatic molecules possess extra stability because their p electrons17 are more bonding than those in acyclic polyenes. Antiaromatics are unstable because their p electrons are less bonding. From a consideration of experimental heats of atomization, Schaad and Hess have evaluated ß to be -1.4199 eV. The physical processes involved in dissociating a gasphase molecule into constituent atoms are quite different from those involved in adding or removing a p electron from a molecule in a solvent. Therefore, it is not surprising that the ß value obtained from heats of atomization differs substantially from values obtained from redox experiments. Indeed, it is this variability of ß as we compareHMO theory with different types of experiment that compensates for many of the oversights and simplifications of the approach. It is remarkable that, with but one such parameter, HMO theory does as well as it does. 8-17 Extension to Heteroatomic Molecules The range of application of the HMO method could be greatly extended if atoms other than carbon could be treated. Consider pyridine as an example (XV). A p electron at a carbon atom contributes an energy a to Ep . The contribution due to a p electron at nitrogen is presumably something different. Let us take it to be a = a +hß, where h is a parameter that will be fixed by fitting theoretical results to experiment. If the p electron is attracted more strongly to nitrogen than to carbon, h will be a positive 16It has been noted that bond length equalization associated with bond mobility results in it energy lowering when the C–C–C angles are near 120.. When the angle is very different from this, p energies are higher than expected. This has led to suggestions that “strain energy” may be an important factor in aromaticity. Because of the present lack of a quantum mechanical quantity equivalent to strain energy, and because the HMO method may include effects of s electrons in an implicit but poorly understood way, it is very hard to know whether such suggestions are at variance with other statements or are simply equivalent to them but stated from a different viewpoint. 17It is not necessarily true that all the “extra” stability of aromatic molecules is attributable to p-electron effects; s-electron energies also depend on bond lengths and bond angles. Hence, we may be seeing, once again, a situation where the p-electron treatment includes other effects implicitly. Schaad and Hess [18] indicate that s energies and p energies are indeed simply related over the bond-length range of interest. Section 8-17 Extension to Heteroatomic Molecules 285 number. In a similar spirit, we will take the energy of a p electron in a C–N bond to be ß =kß and evaluate k empirically. Not surprisingly, the values of h and k appropriate for various heteroatoms depend somewhat on which molecules and properties are used in the evaluation procedure. A set of values compiled and critically discussed by Streitwieser [7] is given in Table 8-3. Other sets have been published.18 The dots over each symbol indicate the number of p electrons contributed by the atom. In pyridine, the formal bond diagram indicates a six p-electron system, implying that the nitrogen atom contributes one p electron. We also can argue that, of the five TABLE 8-3  Parameters for Heteroatoms in the H¨uckel Methoda Heteroatom h Heteroatomic bond k N. 0.5 C.—. .N¨ 1.0 N¨ 1.5 C—N¨ 0.8 +N 2.0 . O 1.0 C—— O. 1.0 O¨ 2.0 C—O¨ 0.8 +O 2.5 N—O¨ 0.7 ¨F 3.0 C—¨F 0.7 C¨l 2.0 C—C¨l 0.4 B¨r 1.5 C—B¨r 0.3 Sb 0.0 C—S 0.8 S 0.0 C—S 0.8 S—S 1.0 Methyl (inductive C. a—Me)-0.5 hCa =-0.5 none — Methyl (heteroatom C. a—M¨e)0.2 hMe = 0.2 Ca—Me 0.7 Methyl (conjugative C. a—C—¨ H3) hCa =-0.1 Ca—C 0.8 hc =-0.1 C—H3 3.0 hH3 =-0.5 aConsistent with the philosophy of this approach is a distinction between single, double, and intermediateC—C bonds. Streitwieser recommends kC-C =0.9, kC... -C =1.0, kC=C =1.1. bSulfur is treated as a pair of AOs with a total of two p electrons, i.e., a sulfur in an aromatic ring is formally treated as two adjacent atoms S and S with the indicated parameters. 18See McGlynn et al. [24, p. 87]. 286 Chapter 8 The Simple H¨uckel Method and Applications valence electrons of nitrogen, two are involved in s covalent bonds with neighboring carbons, two more are in a s lone pair, leaving one for the p system. Therefore, the atom parameter to use for this molecule is h=0.5. The pyridine ring, like benzene, admits two equivalent structural formulas, and so the C–N bonds should be intermediate between double and single, symbolized C.–..N. in Table 8-3. Since k=1.0 in this case, ß=ß, and pyridine will have an HMO determinant differing from the benzene determinant only in the diagonal position corresponding to the nitrogen atom–the 1, 1 position according to our (arbitrary) numbering scheme. For this position, instead of x, we will have x =(a -E)/ß =(a +0.5ß -E)/ß =(a -E)/ß +0.5ß/ß =x +0.5 (8-64) The pyrrole molecule has a nitrogen atom of the type N¨ (XVI). Since three valence electrons of nitrogen are in covalent s bonds, two remain for inclusion in the p system. Therefore, pyrrole has a total of six p electrons. The unique structural formula indicates that theC–N¨ bond is formally single, and k=0.8, h=1.5 are the appropriate parameters here. Also, the carbon–carbon bonds are nowformally single or double. If we choose to distinguish among these bonds using the parameters in note a of Table 8-3, the resulting HMO determinant is  x +1.5 0.8 0 0 0.8 0.8 x 1.1 0 0 0 1.1 x 0.9 0 0 0 0.9 x 1.1 0.8 0 0 1.1 x  The methyl group can also be incorporated into the HMO method. Several approaches have been suggested. One is simply to modify the coulomb integral a for the carbon to which the methyl group is attached. A methyl group is thought to release sigma electrons to the rest of the molecule as compared to a substituent hydrogen. This suggests that an atom having a methyl group attached to it will be a bit electron rich and hence will be less attractive to p electrons. Use of a negative h parameter for this carbon is appropriate. This method is called the inductive model. Use of the inductive model does not add any new centers or any more p electrons to the conjugated system to which the methyl group is attached: The carbon to which the methyl is bonded is merely treated as a less attractive atom. A second approach is to treat the methyl group itself as a heteroatom. As we shall see shortly, the methyl group has two electrons that Section 8-18 Self-Consistent Variations of a and ß 287 Figure 8-25  Three MO symmetry solutions for a methyl group attached to a benzene ring. can participate (to a slight extent) in the p system, and so use of this heteroatom model adds one more center and two more p electrons for each methyl group included in this way. A third approach is the conjugative model. Because the methyl group has local threefold symmetry, one of the s MOs for the methyl group resembles the p MOs of cyclopropenyl in symmetry characteristics. The three types of symmetry solutions are given in Fig. 8-25 for a methyl group on a benzene ring (compare with Fig. 8-7). Notice that the MO at the right of the figure has its phases arranged so that it has nonzero overlap with a p AO on the benzene ring. This means that the two electrons in this “methyl group MO” can participate in the p system of the molecule. To emulate this picture in our HMO determinant, we must add two p electrons and two more centers to our system (one for C and one for H3) and find h and k values for the two new centers and bonds. The inductive effect of the methyl group on the neighboring ring carbon is often included in this model. In most cases, it is probably safe to say that the conjugative model is superior to the heteroatom model, which is, in turn, better than the inductive model, but no extensive critical comparison of these three models has been made. Parameters for all three approaches are included in Table 8-3. (Ca is the ring carbon.) 8-18 Self-Consistent Variations of a and ß Efforts have been made to improve the HMO method by taking account of molecular p charge distribution. Suppose that we carry out anHMOcalculation on a nonalternant molecule and find an electron density of 1.2 at one carbon and 0.8 at another. It is reasonable to argue that a p electron at the latter carbon is more strongly bound because it experiences less repulsion from other p electrons there. We can try to account for this by making a at that atom more negative. Thus, we could take ai =ai +.(1-qi)ß (8-65) where qi is the p-electron density at atom i and . is a parameter (assumed positive) to be fixed empirically. If q <1, then a is more negative than a. If q >1, a is less negative. Having now modified a (using a trial value for .), we must set up our new HMO determinant and solve it again. This yields new MOs, new values of qi , and therefore new values of a. We repeat this process over and over until electron densities remain essentially unchanged for two successive iterations. At this point, the electron densities leading to the HMO determinant are the same as those produced by the determinant, and the solution is said to be self-consistent with respect to electron densities. This procedure, often referred to as the “. technique,” discourages extreme deviations of electronic densities from the “norm” of unity at each carbon and thereby helps to compensate for the lack of explicit inclusion of p electron repulsion 288 Chapter 8 The Simple H¨uckel Method and Applications in the HMO method. Streitwieser’s calculations have led him to favor a value for . of 1.4. A similar idea has been applied to variations of the bond integral ß. Suppose that we carry out an HMO calculation and find a p-bond order of 0.5 in one bond and 0.9 in another. We expect that the latter bond is in fact shorter than the former. We could roughly predict how much shorter it is by using the bond-order–bond-length relation described earlier. It is reasonable to modify ß in these bonds on the basis of predicted length differences, set up a new HMO determinant, solve again, find new bond orders, and iterate until self-consistency is achieved with respect to bond orders. These modifications to the simple HMO method improve predictions of some properties but not others. For example, Brogli and Heilbronner [13] have found that orbital energy correlation with ionization energy, determined by photoelectron spectroscopy, is significantly improved through inclusion of the effects of bond length variations in the neutral molecule and the cation. This improved correlation, shown in Fig. 8-26, showed no additional improvement upon subsequent variation of a as a function of electron density. This is reasonable, since ß variation affects primarily bond order, hence MOenergy, and that is the property measured by photoelectron spectroscopy. Variation of a shifts charge from atom to atom, but has smaller energy effects. On the other hand, p-electron contributions to dipole moments, calculated from electronic excess or deficiency at each center, are very sensitive to variation of a, and quite insensitive 13 12 11 10 HMO values for IE (eV) 9 8 7 7 8 9 10 Experimental IE (eV) 11 12 13 Figure 8-26  Experimental ionization energy for alternant and nonalternant hydrocarbons versus orbital energy using a modified HMO technique which includes provisions for bond length variation in molecule and cation. (Compare with Fig. 8-22.) Section 8-19 HMO Reaction Indices 289 to variation of ß. Again, this is reasonable because dipole moments are sensitive to electronic distribution rather than MO energy. 8-19 HMO Reaction Indices In this section, we discuss some applications of the HMO method to reactivities of conjugated molecules. The reactions of conjugated molecules that have received most of this theoretical treatment are: 1. electrophilic aromatic substitution (XVII) 2. nucleophilic aromatic substitution (XVIII) 3. radical addition (XIX) For any of these reactions, we imagine there to be a path of least energy connecting reactants with products. For the two distinct reaction positions, 1 and 2 on naphthalene, the activation energies may differ, as indicated in Fig. 8-27. The problem is somehow to relate the differences in (inferred from relative rate data) to a number based on quantum chemical calculations. To do this in a sensible way requires that we have some idea of the detailed way in which the reaction proceeds– we have to know what the reaction coordinate is. In some cases, this is fairly well known. For electrophilic aromatic substitution reactions, evidence suggests that a 290 Chapter 8 The Simple H¨uckel Method and Applications Figure 8-27  Generalized energy versus reaction coordinate for reaction at two positions in naphthalene. positive electrophile (e.g., Cl+) approaches the substrate (say, naphthalene). As it draws closer, it causes a significant polarization of the p-electron charge distribution, drawing it toward the site of attack. Ultimately, it forms a partial bond with the carbon. At this stage, the carbon has already begun to loosen its bond to hydrogen, but it is at least partially bonded to four atoms (XX). This means that the ability of the carbon to participate in the aromatic system is temporarily hampered. At this point, the system is at or near the transition state. Thereafter, as H+ leaves, the potential energy decreases and Cl moves into the molecular plane. A similar detailed mechanism is thought to apply for nucleophilic reactions except that the attacking group is negative. For attack by a neutral radical, electrostatic attraction and charge polarization should not be significant factors. The radical bonds to the site of attack to produce a more-orless tetrahedrally bonded carbon. This again leads to an interruption of the p system, but now it is not temporary as it was in the substitution reactions. Based on these simple pictures, a number of MO quantities, often referred to as reaction indices, have been proposed as indicators of preferred sites for reaction. It is useful to divide these into two categories–those purporting to relate to early stages of the reaction, and those specifically related to the intermediate stage. Perhaps the most obvious reaction index to use for the earliest stages of electrophilic or nucleophilic reactions is the p-electron density. If Cl+ is attracted to p charge, it should be attracted most to those sites where p density is greatest. (Such an ion should be attracted to sites having excessive s charge density also, but our basic HMO assumptions ignore any variations in s density.) For an alternant hydrocarbon like naphthalene, all p densities are unity, so this index is of no use. For nonalternant molecules, however, it can be quite helpful. Azulene has varying HMO p densities (XXI). (More sophisticated calculations described in future chapters are in qualitative Section 8-19 HMO Reaction Indices 291 agreement with these p-electron density variations.) Experimentally, it is found that electrophilic substitution by Cl occurs almost entirely at position 1 (or 3). Nucleophilic substitution by CH3 (from CH3 Li) occurs at the position of least p-electron density, namely 4 (or 8). The charge density index refers to the nature of the molecule before allowance is made for perturbing effects due to the approaching reactant. Such a method is often called a “first-order” method, a terminology that is discussed more fully in Chapter 12. For alternant molecules, it is necessary to proceed to a high-order method, one that reflects the ease with which molecular charge is drawn toward some atom, or pushed away from it, as approach by a charged chemical reactant makes that atom more or less attractive for electrons. An index which measures this is called atom self-polarizability, symbolized pr,r . The formulas for this and related polarizabilities are derived in Chapter 12. For now, we simply note that the formula is pr,r = .qr .ar =4 occ  j unocc  k c2 rj c2 rk Ej -Ek (8-66) A larger absolute value of prr means that a larger change in p density qr occurs as a result of making atom r more or less attractive for electrons. (pr,r is negative since Ej - Ek is negative. This makes physical sense because it means that if dar is negative, making atom r more attractive, dqr is positive, indicating that charge accumulates there.) Since the most polarizable site should most easily accommodate either a positive- or a negative-approaching reactant, this index should apply for both electrophilic and nucleophilic reactions. For naphthalene, the values are p11=-0.433/ |ß|,p22=-0.405/ |ß|. This agrees with the experimentally observed fact that the 1 position of naphthalene is more reactive for both types of reaction. Examination of Eq. (8-66) indicates that the MOs near the energy gap between filled and empty MOs will tend to contribute most heavily to pr,r because, for these, Ej - Ek is smallest. For this reason, the highest occupied and lowest unfilled MO (HOMO and LUMO) are often the determining factor in relative values of prr . Fukui19 named these the frontier orbitals and suggested that electrophilic substitution would occur preferentially at the site where the HOMO had the largest squared coefficient. In nucleophilic substitution, the approaching reagent seeks to donate electronic charge to the substrate, so here the largest squared coefficient for theLUMOshould determine the preferred site. For even alternants like naphthalene, the pairing theorem forces these two MOs to have their absolute maxima at the same atom. The HOMO–LUMO coefficients for naphthalene are 0.425 and 0.263 for atoms 1 and 2, respectively, in accord with our expectations. For the nonalternant molecule azulene, discussed above, the largest HOMO coefficient occurs at atoms 1 and 3, which have already been mentioned to be the preferred sites for electrophilic attack. The largest LUMO coefficient occurs at 19See Fujimoto and Fukui [25]. 292 Chapter 8 The Simple H¨uckel Method and Applications atom 6, with atoms 4 and 8 having the second-largest value (see Appendix 6). Atoms 4 and 8 are the preferred sites for nucleophilic attack. Here, then, is a case where the charge density and frontier MO indices are not in agreement. The results suggest that, when significant p-density variations occur, this factor should be favored over higher-order indices. However, the whole approach is so crude that no ironclad rule can be formulated. For radical attack, some other index should be used, for we do not expect electrostatic or polarization effects to be important in such reactions. An index called the free valence20 has been proposed for free radical reactions. One assumes that the free radical begins bonding to a carbon atom in early stages of the reaction and that the ease with which this occurs depends on how much residual bonding capacity the carbon has after accounting for its regular p bonds. Thus, free valence is taken to be the difference between the maximum p bonding a carbon atom is capable of and the amount of p bonding it actually exhibits in the unreacted substrate molecule. The extent of p bonding is taken as the sum of all the orders of p bonds involving the atom in question. A common choice of reference for a maximally bonded carbon is the central atom in trimethylenemethane in its planar conformation (which is not the most stable). (XXII). Each bond in this neutral system has a p-bond order of 1/v3, and so the total p-bond order associated with the central carbon is v3.21 Using this as reference, the free valence for some atom r in any unsaturated hydrocarbon is defined as Fr =v3 - neighbors of r  z prs (8-67) A common way of representing the situation schematically is indicated in (XXIII) for butadiene. Bond orders are indicated on the bonds and free valences by arrows. It is clear that butadiene has a good deal more “residual bonding capacity” on its terminal atoms, and this is consistent with the fact that free radical attack on butadiene occurs predominantly on the end atoms. Other examples of correlation between free valence and rate of free radical addition have been reported.22 A plot of rate data for methyl 20See Coulson [26]. 21Sometimes the three sigma bonds are included in this calculation, giving 3+v3. This has no effect on the question of relative values of Fr . 22See Streitwieser [7] and Salem [1]. Section 8-19 HMO Reaction Indices 293 Figure 8-28  Rate data for methyl radical addition are plotted against the maximum free valence found in each molecule. The original “methyl affinity” (Levy and Szwarc [27]) has been multiplied by 6/m, where m is the number of sites having maximum free valence. (The asterisks in the figure identify these sites.) radical addition to conjugated molecules versus the largest free valence of the molecule is shown in Fig. 8-28. (We assume that the kinetics is dominated by the atom(s) having the maximum free valence.) The indices described above are most appropriate for indicating the relative ease of reaction in the early stages. By the time the reactants have reached the transition state, the substrate is quite far from its starting condition, so charge densities, polarizabilities, free valences calculated from the wavefunction of the unperturbed molecule may no 294 Chapter 8 The Simple H¨uckel Method and Applications longer be very appropriate. If the energy curves being compared through our indices behave in the simple manner described in Fig. 8-27, so that the higher-energy curve in early stages is also the higher-energy curve in the region of the transition state, such indices can be useful. Also, such simple behavior is very likely to occur when we compare a single type of reaction down a series of molecules of similar type, as in Fig. 8-28 (see also Problem 8-25). Experience, however, has indicated that indices more closely linked to the nature of the transition state are more generally reliable. We now describe one such reactivity index. We mentioned earlier that addition and substitution reactions are expected to interrupt the p system at the site of attack. For substitution reactions, this interruption is only temporary and is presumably most severe in the transition state. For addition reactions, it is permanent. The localization energy is defined as the p energy lost in this process of interrupting the p system. As an example, let us return to the naphthalene molecule. The situations resulting from interruption of the p system by neutral radical attack at positions 1 and 2 are illustrated in (XXIV). The remaining unsaturated fragment is, in each case, a neutral radical. [If attack were by a negative ion (nucleophilic), the fragment would be topologically the same but would be negatively charged. Likewise, attack by a positive electrophilic reagent leads to a positively charged fragment.] No matter where attack occurs, our p energy must go from 10a+···ß to 9a+···ß. This decrease by a is thus not expected to differ from case to case and hence is ignored in our localization energy calculation. The decrease in p energy is thus 2.299 |ß| for attack at position 1 and 2.480 |ß| for position 2. These localization energies indicate that attack at position 1 should be favored since the energy cost is smaller there, and this is in accord with observation. (Localization energies are symbolized L·r , L+r , L-r depending, respectively, on whether attack is by a free radical, an electrophilic cation, or a nucleophilic anion.) It is interesting to compare the various indices we have discussed for a single molecule to see how well they agree. Data for azulene are collected in Table 8-4. Experimentally, azulene is known to preferentially undergo electrophilic substitution at positions 1 (and 3), nucleophilic substitution at positions 4 (and 8), and radical addi- Section 8-20 Conclusions 295 TABLE 8-4  Reactivity Indices for Azulene r HOMO LUMO (atom no) qr -|ß|pr,r c2 r c2 r Fr Lr+(|ß|) Lr•(|ß|) Lr-(|ß|) 1 1.173 0.425 0.2946 0.0040 0.480 1.924 2.262 2.600 2 1.047 0.419 0.0000 0.0997 0.420 2.362 2.362 2.362 4 0.855 0.438 0.0256 0.2208 0.482 2.551 2.240 1.929 5 0.986 0.429 0.1126 0.0104 0.429 2.341 2.341 2.341 6 0.870 0.424 0.0000 0.2610 0.454 2.730 2.359 1.988 tion in positions 1 (and 3). Consider first electrophilic reaction. Examining the table indicates that position 1 is heavily favored by qr , HOMO distribution, and L+r . The only other index relevant for this process, prr , favors position 4. For nucleophilic reaction, qr,prr,L-r all favor position 4. The LUMO index favors position 6, but not decisively over position 4. For radical addition, LUMO favors position 6, whereas Fr and L·r favor position 4. The latter two indices, however, favor 4 over 1 by only a slight margin. Thus, for a nonalternant molecule like azulene, these numbers are not completely trustworthy and must be interpreted with caution. One difference between a molecule like anthracene and one like azulene is that all the C–C–C angles in the former molecule are similar (~120.) whereas in azulene they differ. One might anticipate that the smaller angles in the five-membered subunit, being already closer to the tetrahedral angle characteristic of saturated carbons, would allow easier substitution or addition than would be the case in the seven-membered subunit. This factor is ignored in our calculations of L·r and might easily tip the balance to favor position 1 over position 4 for radical attack since L·1 and L·4 are so close in value. In short, these HMO reactivity index approaches are once again techniques that ignore many aspects of the physical processes being followed. It seems likely that many of these will cancel out of comparisons among similar molecules, but dissimilarities between or within molecules (most often encountered in nonalternant systems) will cause such cancellations to be less complete. For more discussion of these and otherHMOreaction indices, the reader is referred to more specialized discussions.23 Application of some of these indices to carcinogenicities of polycyclic aromatic hydrocarbons has been reviewed.24 8-20 Conclusions In this chapter, we have seen how certain basic features of molecular structure manifest themselves in molecular properties. The connectedness, or s bond network, defines the bond positions where p electrons can congregate to lower the energy of the system. 23See Streitwieser [7], Salem [1], Dewar [14], and Klopman [28]. 24See Lowe and Silverman [29]. For a dialogue for for nonspecialists by the same authors, see [30]. 296 Chapter 8 The Simple H¨uckel Method and Applications The extent of congregation is a useful measure of bond length and also is directly contributory to the total energy of the system. The symmetry restrictions for MOs have been emphasized. The fairly successful correlation of HMO results with certain experimental measurements suggests that the method effectively accounts for the controlling factors in some molecular properties. However, we have omitted discussion of other properties that correlate only poorly with HMO theory. Notable in this regard are spectral energies. The natural idea of relating a spectral transition energy to a difference between orbital energies has not been very successful in HMO theory, except when one restricts attention to a particular band in a series of related molecules. This is at least partly due to improper handling of electron exchange symmetry in the HMO method leading to an inability to distinguish between different states of a given configuration. The HMO method can be an extremely instructive way to approach a problem since it can describe the manner in which certain important factors are operating. Also, for some situations, its predictive power is rather good. However, the limitation to conjugated systems, the reliance on an increasing number of parameters as extensions are made, the inability to conform to some kinds of experimental measurement, and the conceptual slipperiness of the quantities used in the method have all contributed to a decline of interest in further development of HMO theory. More powerful computers have made it possible for more complicated but better defined methods to be used. 8-20.A Problems 8-1. Show that, if Hˆ = Hˆ1 + Hˆ2 + Hˆ3, and Hˆifj (i) = Ejfj (i), then .prod = f1(1)f2(2)f3(3) and .det = |f1(1)f2(2)f3(3)| are both eigenfunctions of Hˆ and have the same eigenvalue. Show also that . =(1/v2)[f1(1)f2(2)f3(3)+ f1(1)f2(2)f4(3)] is an eigenfunction of Hˆ if and only if E3 =E4. 8-2. Set up the HMO determinant for each of the following molecules: (a) (b) (c) (d) (e) 8-3. Set up and solve the H¨uckel determinantal equation for 2-allylmethyl (also called trimethylenemethane) (XXV). Display the orbital energy levels and indicate the Section 8-20 Conclusions 297 electron configuration for the neutral ground state. Calculate En. Find the coefficients for all MOs. (Be sure that degenerate MOs are orthogonal.) Calculate the charge densities and bond orders. 8-4. Suppose that two MOs of a molecule are given by the formulas f1 = (1/v3).1 +(1/v3).2 +(1/v3).3, f2 = (1/v3).3 +(1/v3).4 +(1/v3).5, where the .’s areAOs which are assumed to be orthonormal. By inspection, what is the overlap between these MOs? 8-5. Given the following two degenerate MOs for cyclobutadiene (assumed square planar): Use the Schmidt procedure to obtain a normalized MO that has the same energy as f2 but is orthogonal to f1. 8-6. For cyclooctatetraene (in its idealized, but incorrect, planar, octahedral form), see if you can answer the following without reference to tabulations: a) What is the HMO energy of the highest energy p MO? b) Sketch this MO (from above the molecule) showing signs and magnitudes (actual numbers) of the MO coefficients. 8-7. a) What is the H¨uckel orbital energy for the following MO? (Assume that all centers are carbons.) [Hint: Use Eq. (8-56).] 1 2 3 4 5 c 1 c 2 c 3 c 4 c 5 1 0 3 13 13 3 2 b) Calculate the contributions to bond orders due to one electron in this MO. c) Calculate the H¨uckel energy of the following MO. (Figure out the value of a if you need to use it.) 8-8. Without performing an HMO calculation, sketch the MOs for the pentadienyl radical. Use the particle-in-a-box solutions and the pairing theorem as a guide. 298 Chapter 8 The Simple H¨uckel Method and Applications 8-9. Which, if any, of the following systems would you expect to exhibit Jahn–Teller distortion? Indicate your thinking. (You should be able to answer without reference to HMO data tables.) (a) Benzyl radical, C7H7, (b) Cyclopentadienyl radical, C5H5. (c) Cyclobutadienyl radical cation, C4H+4 . (d) Benzene cation, C6H+6 . 8-10. The bond lengths given in Table P8-10 have been reported for ovalene (XXVI). Using a library source or a computer program, obtain HMO bond orders for ovalene and calculate theoretical bond lengths using a relation from Section 8-12. Make a comparison plot for ovalene of the type shown in Fig. 8-16. (See caption of Fig. 8-16 for values of k and s.) TABLE P8-10  Bond Length (Å) Bond Length (Å) 1–2 1.445 1-10 1.401 2–3 1.354 6-7 1.419 3–4 1.432 4-25 1.411 4–5 1.429 24-25 1.366 5–6 1.429 5-22 1.424 6–1 1.425 7-20 1.435 8-11. Horrocks et al. [31] report experimental bond lengths for quinoline complexed to nickel. They display a comparison plot that uses theoretical data from an MO method more refined than the simple HMO method. Using the appropriate heteronuclear parameters, perform on the computer an HMO calculation for quinoline. Calculate theoretical C–C bond distances and compare them with the experimental and theoretical data of Horrocks et al. 8-12. When the molecule CH2=CH–CH=O absorbs light of a certain frequency, a lone-pair electron on oxygen (called “n” for nonbonding) is promoted to the lowest empty p MO of the molecule (called p*; hence, an n-.p* transition). Assuming that the p MOs of this molecule are identical to those in butadiene, which C–C bondwould you expect to become longer and which shorter as a result of this transition? Calculate the expected bond length changes using butadiene data, using either set of k, s values in the caption of Fig. 8-16. (Observed: CH2–CH~= 0.06Å, CH–CH~= -0.04Å.) Section 8-20 Conclusions 299 8-13. ESR coupling constants are shown in Table P8-13 for six hydrocarbon anion radicals. Use HMO tabulations in the literature (or a computer) to obtain p-electron MO coefficients for these systems. Construct a plot of coupling constant aHµ versus c2 µi, where i is the MO containing the unpaired electron (aHµ values are in gauss). The numbered positions in Table P8-13 refer to hydrogen atoms. TABLE P8-13  8-14. Polarographic half-wave potentials for oxidation and reduction of aromatic hydrocarbons are given in Table P8-14. a) Make separate plots of these data against energy (in units of ß) of the highest occupied and lowest empty MO respectively. (Use tabulations or a computer program.) b) Now plot reduction versus oxidation half-wave potential for this series. Explain adherence to or deviation from linearity. TABLE P8-14  Compound Structure Reduction half-wave potential in 2-methyoxyethanol (V) Oxidation half-wave potential in acetonitrile (V) Tetracene 1.135 0.54 1,2-Benzpyrene 1.36 0.76 Anthracene 1.46 0.84 (Continued) 300 Chapter 8 The Simple H¨uckel Method and Applications TABLE P8-14  (Continued) Compound Structure Reduction half-wave potential in 2-methyoxyethanol (V) Oxidation half-wave potential in acetonitrile (V) Pyrene 1.61 0.86 1,2-Benzanthracene 1.53 0.92 1,2,5,6-Dibenzanthracene 1.545 1.00 Phenanthrene 1.935 1.23 Fluoranthene 1.345 1.18 Naphthalene 1.98 1.31 Biphenyl 2.075 1.48 Section 8-20 Conclusions 301 8-15. Use tabulated or computer-generated HMO data for neutral azulene (XXVII) to answer the following questions (Tabulated data may be found in Appendix 6.): a) What values would you expect for oxidation and reduction half-wave potentials for this molecule under conditions described in Problem 8-14? b) If an electron were removed from the highest occupied MO to produce an ion, which bonds would you expect to lengthen, which to shorten? 8-16. Use the effective bond energies of Table 8-2 to calculate the expected p energy for (XXVIII). Compare this with the HMO energy of 18a+21.906ß. 8-17. Obtain the HMO data for naphthalene (XXIX) and perylene (XXX): a) For each molecule, compare E to the energy predicted by use of Table 8-2. Categorize each molecule as aromatic, nonaromatic or antiaromatic. b) Compare the RE for these two molecules. Does the central ring in perylene appear to be contributing? c) Draw formal bond structures for perylene. What can you conclude about the two bonds connecting naphthalene units in perylene? d) Use HMO bond orders to calculate a predicted length for these two bonds. How do they compare with the observed 1.471Å value? Is this observed length consistent with your conclusion of part (c)? 8-18. The third, fourth, and fifth molecules in Fig. 8-24 have dipole moments. Assuming that the individual rings attract or repel charge in accordance with our expectations from the 4n+2 rule, predict the direction of the p-electronic contribution to the dipoles. (Dipoles are defined by chemists as being directed from positive 302 Chapter 8 The Simple H¨uckel Method and Applications toward negative ends of electric dipoles. Physicists use the opposite convention.) Howwould you expect the p-electron densities to vary in these molecules? Compare your expectations with tabulated densities. 8-19. How many p electrons are there in each of the following neutral molecules? Assume that each one is planar. 8-20. Use the parameters in Table 8-3 to construct the HMO determinant for molecule (XXXI). Use the conjugative model for the methyl group. 8-21. Substitution of a nitrogen for a carbon in benzene changes the HMO energy levels, but not drastically. Hence the stability of the six p-electron molecule pyridine can still be rationalized by the 4n+2 rule. Which member of each of the following pairs of molecules would you expect to be stable on the basis of such arguments 8-22. It is observed that many even alternant hydrocarbons tend to undergo nucleophilic substitution, electrophilic substitution, and radical addition at the same site(s). Rationalize this behavior in terms of the following indices (where appropriate): qr , HOMO, LUMO, L·r . Section 8-20 Conclusions 303 8-23. Can prr be a successful index for nucleophilic and electrophilic substitution if these are observed to occur at different sites? 8-24. For the methylene cyclopropene system C4H4 (see Appendix 6 for HMO data and atomic numbering scheme): a) Calculate the free valences for the neutral molecule. b) Decide, using three appropriate reactivity indices, which site is most susceptible to electrophilic attack. c) Decide which protons would lead to hyperfine splitting of the ESR spectrum of the radical anion, according to the simple H¨uckel approach. d) Decide whether the second-lowest MO is net bonding or net antibonding. Why? 8-25. Published data for free radical (CCI·3) addition to hydrocarbons are shown in Table P8-25; see Kooyman and Farenhorst [32]. Assuming planarity in each case, use standardHMOtabulations or computer programs to obtainFr values for each molecule. Choose the largest value for each molecule and plot this against the log of the modified rate constant. In each case, modify the rate constant by dividing k by the number of sites on the molecule having the maximum Fr value. TABLE P8-24  Molecule Structure Rate constant Benzene <10-3 (use this limit) Biphenyl 2.7 × 10-3 Triphenylene <4×10-2 (use this limit) Phenanthrene 1.6 × 10-2 Naphthalene 4 × 10-2 (Continued) 304 Chapter 8 The Simple H¨uckel Method and Applications TABLE P8-24  (Continued) Molecule Structure Rate constant Chrysene 6.7 × 10-2 Pyrene 1.3 Stilbene 1.0 1,2,5,6-Dibenzanthracene 3.7 Styrene 12.5 Anthracene 22 Benzanthracene 30 3,4-Benzopyrene 70 Naphthacene 102 Section 8-20 Conclusions 305 (If a molecule has Fr values differing by 0.002 or less, treat them as equal.) If any data points deviate greatly from the general trend, try to give an explanation. 8-26. Which ring of the fourth molecule in Fig. 8-24would you predict an electrophilic reagent would be more likely to attack, and why? 8-27.* Which of the following neutral unsaturated planar hydrocarbons should experience greater changes in bond order when a p electron is added to form the anion, and why? 8-28.* Consider an ESR experiment on the odd-alternant radical shown. Identify the site(s) that have attached hydrogen atoms that should give a) the largest coupling constant. b) the second-largest coupling constant. c) no coupling constant. 8-29.* Which of the following two seven-p-electron systems should be easier to ionize, and why? Multiple Choice Questions (Try to answer the following questions from sketches that you generate without reference to the text.) 1. According to simple HMO theory, which one of the following statements about butadiene is true? a) It has a nonbonding MO. b) Exciting an electron from the HOMO to the LUMO will not change the p bond orders. *This problem assumes prior study of Appendix 5. 306 Chapter 8 The Simple H¨uckel Method and Applications c) Exciting an electron from the HOMO to the LUMO will not change the p charge densities. d) Ionizing an electron from the HOMO results in all p charge densities becoming equal to 0.75. e) It is most likely to undergo electrophilic substitution at one of the two inner carbons. 2. Which of the following is a prediction that would result from a simple HMO treatment of the butadienyl cation, C4H+6 ? a) The ESR coupling constant is larger for hydrogens attached to the two inner carbons. b) The central C–C bond has a higher p-bond order than it has in the neutral molecule. c) The positive charge resides mostly on the two central carbons. d) There are only three p energy levels. e) The MO coefficients on the central pair of carbons are larger in all of the p MOs than they are in the neutral molecule. References [1] L. Salem, The Molecular Orbital Theory of Conjugated Systems. Benjamin, NewYork, 1966. [2] C. A. Coulson, Proc. Roy. Soc. (London) A164, 383 (1938). [3] A. A. Frost and B. Musulin, J. Chem. Phys., 21, 572 (1953). [4] C. A. Coulson and A. Streitwieser, Jr., Dictionary of p-Electron Calculations. Freeman, San Francisco, 1965. [5] A. Streitwieser, Jr., and J. I. Brauman, Supplemental Tables of Molecular Orbital Calculations. Pergamon, Oxford, 1965. [6] E. Heilbronner and P. A. Straub, HMOs. Springer-Verlag, Berlin and New York, 1966. [7] A. Streitwieser, Jr., Molecular Orbital Theory for Organic Chemists. Wiley, NewYork, 1961. [8] C. A. Coulson, Proc. Roy. Soc. (London) A169, 413 (1939). [9] F. Gerson and J. H. Hammons, in Nonbenzenoid Aromatics (P. J. Snyder, ed.), Vol. II. Academic Press, NewYork, 1971. [10] H. Lund, Acta Chem. Scand. 11, 1323 (1957). [11] I. Bergman, Trans. Faraday Soc. 50, 829 (1954). [12] D. W. Turner, Molecular Photoelectron Spectroscopy. Wiley (Interscience), NewYork, 1970. [13] F. Brogli and E. Heilbronner, Theoret. Chim. Acta 26, 289 (1972). Section 8-20 Conclusions 307 [14] M. J. S. Dewar, The Molecular Orbital Theory of Organic Chemistry. McGraw- Hill, NewYork, 1969. [15] M. J. S. Dewar and C. de Llano, J. Amer. Chem. Soc. 91, 789 (1969). [16] M. J. S. Dewar, A. J. Harget, and N. Trinajstic, J. Amer. Chem. Soc. 91, 6321 (1969) [17] B. A. Hess, Jr., and L. J. Schaad, J. Amer. Chem. Soc. 93, 305, 2413 (1971). [18] L. J. Schaad and B. A. Hess, Jr., J. Amer. Chem. Soc. 94, 3068 (1972). [19] V. I. Minkin, M. N. Glukhovtsev, and B. Y. Simkin, Aromaticity and Antiaromaticity. Wiley (Interscience), NewYork, 1994. [20] P. v. R. Shleyer, C. Maerker, A. Dransfeld, H. Jiao, and N. J. R. van Eikema Hommes, J. Amer. Chem. Soc. 118, 6317, (1996). [21] R. Herges, and D. Ajami, Nature, 426, 819 (2003). [22] S. K. Ritter, Chem. Eng. News, 82, vol. 9, p. 28 (2004). [23] Z. Chen, C. Corminboeuf, T. Heine, J. Bohmann, and P v. R. Schleyer, J. Amer. Chem. Soc., 125, 13, 930, (2003). [24] S. P. McGlynn, L. G. Vanquickenborne, M. Kinoshita, and D. G. Carroll, Introduction To Applied Quantum Chemistry. Holt, NewYork, 1972. [25] H. Fujimoto and K. Fukui, in Chemical Reactivity and Reaction Paths (G. Klopman, ed.).Wiley (Interscience), NewYork, 1974. [26] C. A. Coulson, Trans. Faraday Soc. 42, 265 (1946). [27] M. Levy and M. Szwarc, J. Chem. Phys. 22, 1621 (1954). [28] G. Klopman, ed., Chemical Reactivity and Reaction Paths.Wiley (Interscience), NewYork, 1974. [29] J. P. Lowe and B. D. Silverman, Accounts Chem Res., 17, 332, 1984. [30] J. P. Lowe and B. D. Silverman, J. Mol. Structure (Theochem.) 179, 47 (1988). [31] W. deW. Horrocks, D. H. Templeton, and A. Zalkin, Inorg. Chem. 7, 2303 (1968). [32] E. C. Kooyman and E. Farenhorst, Trans. Faraday Soc. 49, 58 (1953). Chapter 9 Matrix Formulation of the Linear Variation Method 9-1 Introduction In Chapter 7 we developed a method for performing linear variational calculations. The method requires solving a determinantal equation for its roots, and then solving a set of simultaneous homogeneous equations for coefficients. This procedure is not the most efficient for programmed solution by computer. In this chapter we describe the matrix formulation for the linear variation procedure. Not only is this the basis for many quantum-chemical computer programs, but it also provides a convenient framework for formulating the various quantum-chemical methods we shall encounter in future chapters. Vectors and matrices may be defined in a formal, algebraic way, but they also may be given geometric interpretations. The formal definitions and rules suffice for quantum-chemical purposes. However, the terminology of matrix algebra is closely connected with the geometric ideas that influenced early development. Furthermore, most chemists are more comfortable if they have a physical or geometric model to carry along with mathematical discussion. Therefore, we append some discussion of geometrical interpretation to the algebraic treatment.1 9-2 Matrices and Vectors 9-2.A Definitions A matrix is an ordered array of elements satisfying certain algebraic rules. We write our matrices with parentheses on the left and right of the array.2 Unless otherwise stated, we restrict the elements to be numbers (which need not be real). In general, however, as long as the rules of matrix algebra can be observed, there is no restriction on what the elements may be. In expression (9-1), we have written a matrix in three ways: . .... 1.2 3.8 -4.0 5.0 1.0 0.0 9.1 0.0 -3+4i 6.0 -1.0 -1.0 . .... = . .... c11 c12 c13 c21 c22 c23 c31 c32 c33 c41 c42 c43 . .... =C (9-1) 1More thorough discussions of matrix algebra at a level suitable for the nonspecialist are given by Aitken [1] and Birkhoff and MacLane [2]. 2Some authors use brackets. 308 Section 9-2 Matrices and Vectors 309 On the left, the numerical elements are written explicitly. In the center they are symbolized by a subscripted letter. On the right the entire matrix is indicated by a single symbol. We will use sans serif, upper-case symbols to represent matrices. Individual matrix elements will be symbolized either by a subscripted lower-case symbol (e.g., c12) or by a subscripted symbol for the matrix in parentheses (e.g., (C)12). It is useful to recognize rows and columns in a matrix. The sample matrix given above has four rows and three columns, so it is said to have dimensions 4×3. When subscripts are used to denote position in a matrix, the convention is that the first subscript indicates the row, the second indicates the column. (The order “row-column” is important to remember. The mnemonic “RC,” or “Roman Catholic” is helpful.) Rows are numbered from top to bottom, columns from left to right. There is no limit on the dimensions for matrices, but we usually will be concerned in a practical way with finite-dimensional matrices in this book. If a matrix has only one column or row, it is called a column vector or row vector, respectively. We will use sans serif, lower-case symbols to denote column vectors. Additional symbols, described shortly, will be used to denote row vectors. These two kinds of vector behave differently under the rules of matrix algebra, so it is important to avoid confusing them. If a matrix has only one row and one column, its behavior under the rules of matrix algebra becomes identical to the familiar behavior of ordinary scalars (i.e., numbers), and so a 1×1 matrix is simply a number. The similarity in appearance between a matrix and a determinant may be deceptive. A determinant is denoted by bounding with vertical straight lines, and is equal to a number that can be found by reducing the determinant according to a prescribed procedure (see Appendix 2). For this to be possible, the determinant must be square (i.e., have the same number of rows as columns). A matrix is not equal to a number and need not be square. (However, one can take the determinant of a square matrix A. This number is symbolized |A| and is not the same as A without the vertical bars.) Two matrices are equal if all elements in corresponding positions are equal. Thus, A=B means aij =bij for all i and j . 9-2.B Complex Conjugate,Transpose, and Hermitian Adjoint of a Matrix We define the complex conjugate of a matrix A to be the matrix A*, formed by replacing every element of A by its complex conjugate. If A = A*,A is a real matrix. (Every element is real.) We define the transpose of a matrix A to be the matrix A˜ , formed by interchanging row 1 and column 1, row 2 and column 2, etc. The transpose of the 4×3 matrix in expression (9-1) is the 3×4 matrix given in C˜ = . .. 1.2 5.0 9.1 6.0 3.8 1.0 0.0 -1.0 -4.0 0.0 -3+4i -1.0 . .. (9-2) If we denote some column vector as p, we can symbolize the corresponding row vector as ˜p. Thus, the tilde symbol is one device we can use to indicate a row vector. 310 Chapter 9 Matrix Formulation of the Linear Variation Method Transposing a square matrix corresponds to “reflecting” it through its principal diagonal (which runs from upper left to lower right) as indicated in A= . .. 1 2 3 4 5 6 7 8 9 . .. , A˜ = . .. 1 4 7 2 5 8 3 6 9 . .. (9-3) If A=A˜ ,A is a symmetric matrix. We define the hermitian adjoint of A to be the matrix A†, formed by taking the transpose of the complex conjugate of A (or the complex conjugate of the transpose. The order of these operations is immaterial.) Hence, A† = (A˜ )* = (A˜ *). If A = A†, A is a hermitian matrix. 9-2.C Addition and Multiplication of Matrices and Vectors Multiplication of a matrix by a scalar is equivalent to multiplying every element in the matrix by the scalar. Addition of two matrices is accomplished by adding elements in corresponding positions in the matrices. Thus, for example, 1 2 3 4  +25 6 7 8  = 1+10 2+12 3+14 4+16  = 11 14 17 20  and it is evident that the operation of matrix addition is possible only within sets of matrices of identical dimensions. Matrix multiplication is a bit more involved. We start by considering multiplication of vectors. Two types of vector multiplication are possible. If we multiply a row vector on the left times a column vector on the right, we take the product of the leading element of each plus the product of the second element of each, plus . . . , etc., thereby obtaining a scalar as a result. Hence, this is called scalar multiplication of vectors. For example, 1 2 3 . .. 4 5 6 . .. =1×4+2×5+3×6=32 This kind of multiplication requires an equal number of elements in the two vectors. It is important to retain in mind the basic operation described here: summation of products taken by sweeping across a row on the left and down a column on the right. Since there exists but one row on the left and one column on the right, we obtain one number as the result. The other possibility is to multiply a column vector on the left times a row vector on the right. Employing the same basic operation as above, we sweep across row 1 on the left and down column 1 on the right, obtaining a product that we will store in position (1, 1) of a matrix to keep track of its origin. The product of row 1 times column 2 gives us element (1, 2) and so on. In this way, we generate a whole matrix of numbers. For example, 1 2  3 4 5 = 1×3 1×4 1×5 2×3 2×4 2×5  = 3 4 5 6 8 10  Section 9-2 Matrices and Vectors 311 This is an example of matrix multiplication of vectors. Just as before, the number of columns on the left (one) equals the number of rows on the right. Now, however, the number of elements in the vectors may differ, and the dimensions of the matrix reflect the dimensions of the original vectors. The two types of vector multiplication may be symbolized as follows, using our notation for scalars, row vectors, column vectors, and matrices: ˜ab = c (9-4) a˜b = C (9-5) Multiplying two matrices together is most simply viewed as scalar multiplying all the rows in the left matrix by all the columns of the right matrix. Thus, in AB=C, the element cij is the (scalar) product of row i in A times column j in B. This process is possible only when the number of columns in A equals the number of rows in B. Thus, AB may exist as a matrix C, while BA may not exist due to a disagreement in number of rows and columns. If A and B are both square matrices and have equal dimension, then AB and BA both exist, but they still need not be equal. That is, matrix multiplication is not commutative. In a triple product of matrices, ABC, one can multiply AB first (call the result D) and then multiply DC to get the final result (call it F). Or, if one takes BC=E, then one always finds AE=F. Thus, the result is invariant to the choice between (AB)C or A(BC), and so matrix multiplication is associative. 9-2.D Diagonal Matrices, Unit Matrices, and Inverse Matrices A diagonal matrix is a square matrix having zeros everywhere except on the principal diagonal. Diagonal matrices of equal dimension commute with each other, but a diagonal matrix does not, in general, commute with a nondiagonal matrix. A unit matrix 1 is a special diagonal matrix. Every diagonal element has a value of unity. A unit matrix times any matrix (of appropriate dimension) gives that same matrix as product. That is, 1A =A1=A. It follows immediately that the unit matrix commutes with any square matrix of the same dimension. We define the left inverse of a matrix A to be A-1, satisfying the matrix equation A-1A=1. The right inverse is defined to satisfy AA-1 =1. In most of our quantum-chemical applications of matrix algebra, we will be concerned only with vectors and square matrices. For square matrices, the left and right inverses are identical, and so we refer simply to the inverse of the matrix. 9-2.E Complex Conjugate, Inverse, and Transpose of a Product of Matrices If AB=C, then C* =(AB)* =A*B*. In words, the complex conjugate of a product of matrices is equal to the product of the complex conjugate matrices. This is demonstrable from the observation that (C)ij =(A)i1(B)1j +(A)i2(B)2j +··· and (C)*ij = (A)*i 1(B)*1 j + (A)*i 2(B)*2 j +··· and so the complex conjugate is produced by taking 312 Chapter 9 Matrix Formulation of the Linear Variation Method the complex conjugate of every element in A and B but not changing their order of combination. If AB = C, then C-1 = (AB)-1 = B-1A-1. In words, the inverse of a product of matrices is equal to the product of inverses, but with the order reversed. We can easily show that this satisfies the rules of matrix algebra. C-1C = (AB)-1AB = B-1A-1AB=B-11B=B-1B=1. If we failed to reverse the order, we would instead have A-1B-1AB and, because the matrices do not commute, we would be prevented from carrying through the reduction to 1. If AB = C, then C˜ = ( AB) = B˜ A˜ . The transpose of a product is the product of transposes, again in reverse order. Since C˜ has cij and cj i interchanged, it follows that, where we had row i of A times column j of B, we must now have row j of A times column i of B. But this is the same as column j of A˜ times row i of B˜ . To obtain row on left and column on right for proper multiplication, we must have B˜ A˜ . 9-2.F A Geometric Model Consider a vector in two-dimensional space emanating from the origin of a Cartesian system as indicated in Fig. 9-1a. We can summarize the information contained in this vector (magnitude and direction) by writing down the x and y components of the vector terminus, (3 2) in this case. It must be understood that the first number corresponds to the x component and not the y, and so the vector (3 2) carries its information through number position as well as number value. If we multiply both numbers in the vector by 2, the result, (6 4), corresponds to a vector collinear to the original but twice as long. Therefore, multiplying a vector by a number results in a change of scale but no change in direction. Hence, the term “scalar” is often used in place of “number” in vector terminology. Suppose that we rotated the Cartesian axes counterclockwise through an angle ., maintaining them orthogonal to each other and not varying the distance scales. We imagine our original vector to remain unrotated during this coordinate transformation. (Equivalently, we can imagine rotating the vector clockwise by ., keeping the axes fixed.) We wish to know how to express our vector in the new coordinate system. The situation is depicted in Fig. 9-1b. Inspection reveals that the new coordinates (xy) are related to the old (x y) as follows: x =x cos . +y sin ., y=-x sin . +y cos . (9-6) If we make use of matrix algebra, we can express Eqs. (9-6) as a matrix equation: x y  =  cos . sin . -sin . cos . x y  (9-7) Multiplying the two-dimensional vector (call it v) by the 2×2 matrix (call it R) generates a new vector v, that gives the coordinates of our vector in the new coordinate system: v =Rv (9-8) Section 9-2 Matrices and Vectors 313 Figure 9-1  (a) The vector (3 2). (b) The same vector and its relationship to two Cartesian axis systems. We have, then, a parallel between the vectors v and v and matrix R on the one hand, and the two-dimensional “geometrical” vector and rotating coordinate system on the other. R represents the rotation and is often referred to as a rotation matrix. If we were to perform the rotation in the opposite direction, the rotation matrixwould be the same except for the sin . terms, which would reverse sign: rotation of coordinates clockwise by . -. cos . -sin . sin . cos .  (9-9) Note that this is just R˜ , the transpose of R. 314 Chapter 9 Matrix Formulation of the Linear Variation Method If we were to rotate counterclockwise by . and then clockwise by ., we should end up with our original coordinates for the vector. Thus, we should expect R˜ Rv=v (9-10) or R˜ R=1 (9-11) (Note that the order of operations is consistent with reading from right to left, just as with differential operators. Thus, R˜ Rmeans that firstRis performed, thenR˜ .) Relation (9-11) is easily verified by explicit multiplication. Thus we see that, if we think of a matrix as representing some coordinate transformation in geometrical space, the inverse of the matrix represents the reverse transformation. In this particular example, the transpose of the transformation matrix turns out to be the inverse transformation matrix. When this is so, the matrix is said to be orthogonal. (Orthogonal transformations do not change the angles between coordinate axes; orthogonal axes remain orthogonal—hence the name “orthogonal.”) The analogous transformation for matrices having complex or imaginary coefficients is called a unitary transformation. A unitary matrix has its hermitian adjoint as inverse: A†A=1. While it is easy to visualize a coordinate transformation in two or three dimensions, it is more difficult in higher-dimensional situations. Nevertheless, the mathematics and terminology carry forward to any desired dimension and are very useful. One may, if one wishes, talk of vectors and coordinate transformations in hyperspace, or one can eschew such mental constructs and simply follow the mathematical rules without a mental model. One must be a bit cautious about inverses of matrices. In the rotation described above, we have a unique way of relating each x, y point in one coordinate system to a point at x, y in the other. The transformation does not entail any loss of information and can therefore be “undone.” Such transformations (and their matrices) are called nonsingular. A nonsingular matrix is recognizable through the fact that its determinant must be nonzero. If we had a transformation which, for example, caused all or some points in one coordinate system to coalesce into a single point in the transformed system, we would lose our ability to back-transform in a unique way. Such a singular transformation has no inverse, and the determinant of a singular matrix equals zero. 9-2.G Similarity Transformations A matrix product of the form A-1HA is called a similarity transformation on H. If A is orthogonal, then A˜HA is a special kind of similarity transformation, called an orthogonal transformation. If A is unitary, then A†HA is a unitary transformation onH. There is a physical interpretation for a similarity transformation, which will be discussed in a later chapter. For the present, we are concerned only with the mathematical definition of such a transformation. The important feature is that the eigenvalues, or “latent roots,” of H are preserved in such a transformation (see Problem 9-5). In this section we have quickly presented the salient rules of matrix algebra and hinted at their connection with geometric operations. The results are summarized in Table 9-1 for ease of reference. Section 9-3 Matrix Formulation of the Linear Variation Method 315 TABLE 9-1  Some Matrix Rules and Definitions for a Square Matrix A of Dimension n A=B Matrix equality; means aij =bij, i,j=1,n A+B=C Matrix addition; cij =aij +bij, i,j=1,n cA=B Multiplication of A by scalar; bij =c ·aij, i,j=1,n AB=C Matrix multiplication; cij = n k=1 aikbkj, i,j=1,n |A| The determinant of the matrix A (see Appendix 2) A-1 The inverse of A; A-1A=AA-1 =1 If A-1 exists, A is nonsingular and |A| =0. A* The complex conjugate of A; aij .a*ij, i,j=1,n If A* =A,A is real. A˜ The transpose of A; (A˜ )ij =aj i (rows and columns inter changed) If A˜ =A, symmetric; if A˜ =-A, antisymmetric; if A˜ =A-1, orthogonal. A† The hermitian adjoint of A; (A†)ij =a*ji(A† =A˜ *) If A† =A, hermitian. If A† =A-1, unitary. (ABC)* =A*B*C* Complex conjugate of product (ABC)=C˜ B˜ A˜ Transpose of product (ABC)† =C†B†A† Hermitian adjoint of product (ABC)-1 =C-1B-1A-1 Inverse of product |ABC| = |A| · |B| · |C| Determinant of product (any order) T-1AT A similarity transformation If T-1 =T†, this is a unitary transformation. If T-1 = ˜T, this is an orthogonal transformation. 9-3 Matrix Formulation of the Linear Variation Method We have seen that the independent-electron approximation leads to a series of MOs for a molecular system. If the MOs are expressed as a linear combination of n basis functions (which are often approximations to AOs, although this is not necessary), the variation method leads to a set of simultaneous equations: (H11 -ES11)c1 +(H12 -ES12)c2+···+(H1n -ES1n)cn = 0 ... (9-12) (Hn1 -ESn1)c1 +···+(Hnn -ESnn)cn = 0 All terms have been defined in Chapter 7. Given a value forE that satisfies the associated determinantal equations, we can solve this set of simultaneous equations for ratios between the ci ’s. Requiring MO normality establishes convenient numerical values for the ci ’s. 316 Chapter 9 Matrix Formulation of the Linear Variation Method A matrix equation equivalent to Eq. (9-12) is3 . .. H11 -ES11 H12 -ES12 ··· H1n -ES1n ... Hn1 -ESn1 ··· Hnn -ESnn . .. . ..... c1 c2 ... cn . ..... = . ..... 0 0 ... 0 . ..... (9-13) The matrix in Eq. (9-13) is clearly the difference between two matrices. This enables us to rewrite the equation in the form . .. H11 H12 ··· H1n ... ... Hn1 ··· Hnn . .. . ..... c1 c2 ... cn . ..... =E . .. S11 S12 ··· S1n ... ... Sn1 ··· Snn . .. . ..... c1 c2 ... cn . ..... (9-14) or Hci =EiSci, i=1, 2, . . . ,n (9-15) where we have introduced the subscript i to account for the fact that there are many possible values forE and that each one has its owncharacteristic set of coefficients. Note that the “eigenvector” ci is a column vector and that each element in ci is (effectively) multiplied by the scalar Ei according to Eq. (9-15). In general, there are as many MOs as there are basis functions, and so Eq. (9-15) represents n separate matrix equations. We can continue to use matrix notation to reduce these to a single matrix equation. We do this by stacking the n c vectors together, side by side, to produce an n×n matrix C. The numbers E must also be combined into an appropriate matrix form. We must be careful to do this in such a way that the scalar E1 still multiplies only c1 (now column 1 of C) E2 multiplies only c2, and so forth. This is accomplished in the following equation . .. H11 ··· H1n ... ... Hn1 ··· Hnn . .. . .. c11 ··· c1n ... ... cn1 ··· cnn . .. = . .. S11 ··· S1n ... ... Sn1 ··· Snn . .. . .. c11 ··· c1n ... ... cn1 ··· cnn . .. . ..... E1 0 0 ··· 0 0 E2 0 ··· 0 ... ... ... ... 0 0 0 ··· En . ..... (9-16) or HC=SCE (9-17) The matrix E is a diagonal matrix of orbital energies (often referred to as the matrix of eigenvalues). C is the matrix of coefficients (or matrix of eigenvectors), and each 3Quantum-chemical convention is to use upper case letters for individual elements of the matrices H, S, and E. This differs from the usual convention. Section 9-4 Solving the Matrix Equation 317 column in C refers to a different MO. The first column refers to the MO having energy E1. In multiplying E by C from the left, each coefficient in column 1 becomes multiplied by E1. This would not occur if we multiplied E by C1 from the right. Therefore, HC=SCE is correct, whereas HC=ESC is incorrect. 9-4 Solving the Matrix Equation Since we know the basis functions and the effective hamiltonian (in principle, at least), we are in a position to evaluate the elements in H and S. How do we then find C and E? Let us first treat the simplified situation where our basis set of functions is orthonormal, either by assumption or design. Then all the off-diagonal elements of S (which correspond to overlap between different basis functions) are zero, and all the diagonal elements are unity because of normality. In short, S = 1. Therefore, Eq. (9-17) becomes HC=CE (9-18) and our problem is, given H, find C and E. Now, we want a set of coefficients that correspond to normalized MOs. We have seen earlier that, for an orthonormal basis set, this requires eachMOto have coefficients satisfying the equation (assuming real coefficients) c2 1i +c2 2i +···+c2 ni =1 (9-19) We can write this as a vector equation ˜cici = ci1 ci2 ··· cin . ..... c1i c2i ... cni . .....=1 (9-20) Furthermore, we know that any two different MOs must be orthogonal to each other. That is ˜cicj =0, i =j . All this may be summarized in the matrix equation C˜ C=1 (9-21) Hence, the coefficient matrix is orthogonal. In the more general case in which coefficients may be complex, C is unitary; i.e., C†C=1. Our problem then is, given H, find a unitary matrix C such that HC=CE with E diagonal. We can multiply both sides of a matrix equation by the same matrix and preserve the equality. However, because matrices do not necessarily commute, we must be careful to carry out the multiplication from the left on both sides, or from the right on both sides. Thus, multiplying Eq. (9-18) from the left by C†, we obtain C†HC=C†CE=1E=E (9-22) where we have used the fact that C is unitary. Now our problem may be stated as, given H, find a unitary matrix C such that C†HC is diagonal.4 Several techniques 4Not every matrix (not even every square matrix) can be diagonalized by a unitary transformation, but every hermitian matrix can be so diagonalized. 318 Chapter 9 Matrix Formulation of the Linear Variation Method exist for finding such a matrix C. These are generally much more suitable for machine computation than are determinantal manipulations. We can illustrate that the allyl radical energies and coefficients already found by the HMO method do in fact satisfy the relations C†C=1 and C†HC=E. The matrix C can be constructed from the HMO coefficients and is C= . ... 1 2 1 v2 1 2 1 v2 0 - 1 v2 1 2 - 1 v2 1 2 . ... (9-23) Therefore C†C = . ... 1 2 1 v2 1 2 1 v2 0 - 1 v2 1 2 - 1 v2 1 2 . ... . ... 1 2 1 v2 1 2 1 v2 0 - 1 v2 1 2 - 1 v2 1 2 . ... = . .. 1 0 0 0 1 0 0 0 1 . .. =1 (9-24) The matrix H for the allyl radical is, in HMO theory, H= . .. a ß 0 ß a ß 0 ß a . .. (9-25) The reader should verify that C†HC= . .. a +v2ß 0 0 0 a 0 0 0 a -v2ß . .. =E (9-26) The diagonal elements can be seen to correspond to the HMO energies. Note that the energy in the (1 1) position of E corresponds to the MO with coefficients appearing in column 1 of C, illustrating the positional correlation of eigenvalues and eigenvectors referred to earlier. If the basis functions are not orthogonal, S =1 and the procedure is slightly more complicated. Basically, one first transforms to an orthogonal basis to obtain an equation of the form HC = CE. One diagonalizes H as indicated above to find E and C, where C is the matrix of coefficients in the orthogonalized basis. Then one back transforms C into the original basis set to obtain C. There are many choices available for the orthogonalizing transformation. The Schmidt transformation, based on the Schmidt orthogonalization procedure described in Chapter 6, is popular because it is very rapidly performed by a computer. Here we will simply indicate the matrix algebra involved. Let the matrix that transforms a nonorthonormal basis to an orthonormal one be symbolized A. This matrix satisfies the relation A†SA=1 (9-27) Section 9-4 Solving the Matrix Equation 319 Furthermore, |A| = 0 and so A-1 exists. We can insert the unit matrix (in the form AA-1) wherever we please in the matrix equation HC = SCE without affecting the equality. Thus, HAA-1C=SAA-1CE (9-28) Multiplying from the left by A† gives A†HAA-1C=A†SAA-1CE (9-29) By Eq. (9-27), this reduces to (A†HA)(A-1C)=(A-1C)E (9-30) where the parentheses serve only to make the following discussion clearer. If we define A†HA to be H, and A-1C to be C, Eq. (9-30) becomes HC =CE (9-31) Since we know the matrix H and can compute A from knowledge of S, it is possible to write down an explicit H matrix for a given problem. Then, knowing H (which is just the hamiltonian matrix for the problem in the orthonormal basis), we can seek the unitary matrix C such that C†HC is diagonal. These diagonal elements are our orbital energies. (Note that E in Eq. (9-31) is the same as E in HC=SCE.) To find the coefficients for the MOs in terms of the original basis (i.e., to find C), we use the relation AC =A(A-1C)=1C=C (9-32) One nice feature of this procedure is that, even though we use the inverse matrix A-1 in our formal development, we never need to actually compute it. (A and A† are used to find H, and A is used to find C.) This is fortunate because calculating inverse matrices is a relatively slow process. A few more words should be said about the process of diagonalizing a hermitian matrix H with a unitary transformation. Two methods are currently in wide use. The older, slower method, known as the Jacobi method, requires a series of steps on the starting matrix. In the first step, a matrix O1 is constructed that causes the largest off-diagonal pair of elements of H to vanish in the transformation H1 =O˜ 1HO1. Now a second transformation matrix O2 is constructed to force the largest off-diagonal pair of elements in H1 to vanish in the transformation O˜ 2H1O2 = O˜ 2O˜ 1HO1O2. This procedure is continued. However, since each transformation affects more elements in the matrix than just the biggest pair, we eventually “unzero” the pair that was zeroed in forming H1 or H2, etc. This means that many more transformations are required than there are off-diagonal pairs. Eventually, however, the off-diagonal elements will have been nibbled away (while the diagonal elements have been building up) until they are all smaller in magnitude than some preselected value, and so we stop the process. The transformation matrix C corresponds to the accumulated product O1O2O3 . . . . 320 Chapter 9 Matrix Formulation of the Linear Variation Method Amore recently discovered, faster procedure is the Givens– Householder–Wilkinson method. Here, H is first tridiagonalized, which means that all elements are made to vanish except those on the main diagonal as well as on the codiagonals above and below the main diagonal. This similarity transformation can be done in a few steps, each step zeroing all the necessary elements in an entire row and column. The eigenvalues for the tridiagonal matrix (and hence for the original matrix) may be found one at a time as desired. If only the third lowest eigenvalue is of interest, that one alone can be computed. This is a useful degree of freedom which results in substantial savings of time. Once an eigenvalue is found, its corresponding eigenvector may be computed. 9-5 Summary The steps to be performed in a matrix solution for a linear variation calculation are: 1. From the basis set, calculate the overlap matrix S. 2. From the basis set and hamiltonian operator, calculate the hamiltonian matrix H. 3. If S = 1, find an orthogonalization procedure. In the Schmidt method, A is such that A†SA=1. The matrix equation may now be written in the form HC =CE. 4. Find C such that C†HC is a diagonal matrix. The diagonal elements are the roots E. 5. If necessary, back transform: AC =C. The columns of C contain the MO coefficients appropriate for the original basis set. 9-5.A Problems 9-1. Evaluate the following according to the rules of matrix algebra: a) 6 7 8 . .. 9 10 11 . .. b) 6 7  a b c c) 4 6 i -3  +7 3 1 -1 3  d) cos . -sin . sin . cos .  cos . sin . -sin . cos .  e) . .. i 4 1 7 0 -3 . .. 3 2 4 7  Section 9-5 Summary 321 f) 3 2 4 7 . .. i 4 1 7 0 -3 . .. g)  cos . -sin . sin . cos .  9-2. If Hij = .i*Hˆ .j dt and Hˆ is hermitian, show that H is a hermitian matrix. 9-3. Let A= a11 a12 a21 a22  , B= b11 b12 b21 b22  Show that, in general, AB=BA. 9-4. Let A= . .. 1 0 0 0 2 0 0 0 3 . .. , B= . .. 4 0 0 0 5 0 0 0 6 . .. , and C= . .. 1 0 1 0 1 0 1 0 1 . .. Show that AB=BA, but AC =CA. Compare the matrix AC with CA. Do these matrices show any simple relationship? Can you relate this to properties of A and C mathematically? 9-5. The “latent roots” .i of A are solutions to the equation |A-.i1| = 0, i =1, 2, . . . ,n, where n is the dimension of A. a) Show that, under a similarity transformation B=T-1AT, the latent roots are preserved. b) Demonstrate that diagonalization of A via a similarity transformation produces the latent roots as the diagonal elements. 9-6. Show that, if a matrix has any latent roots equal to zero, it has no inverse. 9-7. The trace (or spur) of a matrix is the sum of the elements on the principal diagonal. Thus, tr A=n i=1aii . a) Show that the trace of a triple product of matrices is invariant under cyclic permutation. That is, tr(ABC)=tr(CAB)=tr(BCA) but not tr(CBA). b) Show that the trace of a matrix is invariant under a similarity transformation. 9-8. The norm of a matrix is the positive square root of the sum of the absolute squares of all the elements. For a real matrix A, norm A= . . n  i,j=1 a2 ij . . 1/2 = . .  i  j A˜ i,j (A)j,i . . 1/2 322 Chapter 9 Matrix Formulation of the Linear Variation Method Prove that the norm of a real matrix is preserved in an orthogonal transformation (or, you may prefer to prove that the norm of any matrix is preserved in a unitary transformation). 9-9. Use the facts that the trace, the determinant, and the norm of a matrix are invariant under an orthogonal transformation to find the eigenvalues of the following matrices: a) . .. 0 1 1 1 0 1 1 1 0 . .. b) . ... 1 2 1 v2 1 2 1 v2 0 - 1 v2 1 2 - 1 v2 1 2 . ... c) . .. 0 1 0 1 2 1 0 1 0 . .. 9-10. Consider the matrix  cos . 0 -sin . 0  What is the effect of this transformation on 3 2 ? On 3 3 ? Can the transformation be uniquely reversed? (That is, for, say, . =0, and given a transformed vector 3 0 , can one uniquely determine the vector this was transformed from?) Does the matrix have an inverse? Evaluate its determinant. 9-11. What are the eigenvectors for the matrix H= . .. 1 0 0 0 -3 0 0 0 -2 . .. 9-12. Show that, if A and B have “simultaneous eigenvectors” (i.e., both diagonalized by the same similarity transformation), then A and B commute. 9-13. IfHC=CE, andC†C=1, thenC†HC=E, and we seek a unitary transformation that diagonalizes H. If HC = SCE, and C†SC = 1, then C†HC = C†SCE = 1E, and C†HC = E. Since this is the same working equation as the one we found above, why do we not proceed in the same way? Why do we bother orthogonalizing our basis first? 9-14. We have mentioned that a matrix may be used to represent the rotation of coordinates by some angle .. Such a rotation is a geometric operation, so we have, Section 9-5 Summary 323 in effect, represented an operator with a matrix. It is possible to represent other operators in a similar way. Indeed, an alternative approach to quantum mechanics exists in which the whole formalism is based on matrices and their properties (matrix mechanics, as opposed to wave mechanics). A particularly interesting example is provided by the matrices constructed by Pauli to represent spin operators and functions. It was mentioned in Chapter 5 that spin functions a and ß satisfy rules similar to those for orbital angular momentum. Two of these are Sˆza = 1 2 a, Sˆzß =- 1 2 ß But it was pointed out that a and ß could not be expressed in terms of spherical harmonics. Pauli represented this operator and functions by a = 1 0  , ß= 0 1  , Sˆz = 1 2 1 0 0 -1  Using these definitions, show that  a†ß d. =  ߆a d.=0,  a†a d.= ß†ß d.=1, Sˆza = 1 2 a, Sˆzß =- 1 2 ß [Note: since a and ß are essentially the Dirac delta functions in the spin coordinate., the process of integration reduces here to scalar multiplication of vectors.] References [1] A. C. Aitken, Determinants and Matrices, 9th ed.Wiley (Interscience), NewYork, 1958. [2] G. Birkhoff and S. MacLane, A Survey of Modern Algebra, 5th ed. Macmillan, NewYork, 1995. Chapter 10 The Extended H¨uckel Method 10-1 The Extended H¨uckel Method The extended H¨uckel (EH) method is much like the simple H¨uckel method in many of its assumptions and limitations. However, it is of more general applicability since it takes account of all valence electrons, s and p, and it is of more recent vintage because it can only be carried out on a practical basis with the aid of a computer. The basic methods of extended H¨uckel calculations have been proposed at several times by various people. We will describe the method of Hoffmann [1], which, because of its systematic development and application, is the EHMO method in common use. The method is described most easily by reference to an example. We will use methane (CH4) for this purpose. 10-1.A Selecting Nuclear Coordinates The first choice we must make is the molecular geometry to be used. For methane, we will take the H–C–H angles to be tetrahedral and C–H bond distances of 1.1 Å.We can try altering these dimensions later. Cartesian coordinates for the five atoms are listed in Table 10-1, and the orientation of the nuclei in Cartesian space is indicated in Fig. 10-1. (Even though the eigenvalues and MOs one finally obtains are independent of how CH4 is oriented in Cartesian space,1 it is generally a good idea to choose an orientation that causes some Cartesian and symmetry axes to coincide. The resulting expressions for MOs in terms of AOs are generally much simpler to sketch and interpret.) 10-1.B The Basis Set Next we must select the basis set of functions with which to express the MOs. The extended H¨uckel method uses the normalized valence AOs for this purpose. For CH4, this means a 1s AO on each hydrogen and a 2s, 2px, 2py, and 2pz AO on carbon. The inner-shell 1s AO on carbon is not included. The AOs are represented by Slater-type orbitals (STOs). Except for the 1sAOs on hydrogen, the exponential parameters of the STOs are determined from Slater’s rules (Chapter 5). Various values for the hydrogen 1sAO exponent have been suggested. These have ranged from the 1.0 given by Slater’s 1It sometimes happens that an approximation is made that causes the solution to depend on orientation. This is called “loss of rotational invariance.” 324 Section 10-1 The Extended H¨uckel Method 325 TABLE 10-1  Cartesian Coordinates (in Angstroms) for Atoms of Methane Oriented as Shown in Fig. 10-1 Atom x y z C 0.0 0.0 0.0 Ha 0.0 0.0 1.1 Hb 1.03709 0.0 -0.366667 Hc -0.518545 0.898146 -0.366667 Hd -0.518545 -0.898146 -0.366667 Figure 10-1  Orientation of methane in a Cartesian axis system. rules to a value of v2. We will use a value of 1.2, which is near the optimal value for H2 (see Chapter 7). The STOs for methane are listed in Table 10-2. 10-1.C The Overlap Matrix Knowing the AO functions and their relative positions enables us to calculate all of their overlaps. This would be a tedious process with pencil and paper. However, the formulas have been programmed for automatic computation, so this step is included in any EH computer program.2 The computed overlap matrix for the methane molecule is shown in Table 10-3. This matrix is symmetric (since the overlap between twoAOs is independent of their numbering order) and has diagonal elements of unity since the AOs are normalized. The zero values in the first four rows and columns reflect the orthogonality between the s and all pAOs on carbon. Other zero values result when hydrogen 1sAOs are centered in nodal planes of carbon pAOs. The geometry of the system is clearly reflected in the overlap matrix. For instance, the overlap of the 2pz AO of carbon with the hydrogen 1s AO at Ha is large and positive, while its overlaps with AOs on Hb,Hc, and Hd are negative, equal, and of smaller magnitude. The 2s AO of carbon, on the other hand, overlaps all 1s AOs equally. Also, the overlap between every pair of hydrogen 1s AOs 2Several such programs are available from Quantum Chemistry Program Exchange, Chemistry Dept., Room204, Indiana University, Bloomington, Indiana 47401. http://www.QCPE.Indiana.edu 326 Chapter 10 The Extended H¨uckel Method TABLE 10-2  Basis AOs for Methane AO no. Atom Type na la ma exp 1 C 2s 2 0 0 1.625 2 C 2pz 2 1 0 1.625 3 C 2px 2 1 (1)b 1.625 4 C 2py 2 1 (1)b 1.625 5 Ha 1s 1 0 0 1.200 6 Hb 1s 1 0 0 1.200 7 Hc 1s 1 0 0 1.200 8 Hd 1s 1 0 0 1.200 an, l,m are the quantum numbers described in Chapter 4. b2px and 2py are formed from linear combinations of m=+1 and m=-1 STOs, and neither of these AOs can be associated with a particular value of m. TABLE 10-3  Overlap Matrix for STOs of Table 10-2 1 2 3 4 5 6 7 8 1 1.0000 0.0 0.0 0.0 0.5133 0.5133 0.5133 0.5133 2 0.0 1.0000 0.0 0.0 0.4855 -0.1618 -0.1618 -0.1618 3 0.0 0.0 1.0000 0.0 0.0 0.4577 -0.2289 -0.2289 4 0.0 0.0 0.0 1.0000 0.0 0.0 0.3964 -0.3964 5 0.5133 0.4855 0.0 0.0 1.0000 0.1805 0.1805 0.1805 6 0.5133 -0.1618 0.4577 0.0 0.1805 1.0000 0.1805 0.1805 7 0.5133 -0.1618 -0.2289 0.3964 0.1805 0.1805 1.0000 0.1805 8 0.5133 -0.1618 -0.2289 -0.3964 0.1805 0.1805 0.1805 1.000 is the same. Features such as these provide a useful check on the correctness of our initial Cartesian coordinates. 10-1.D The Hamiltonian Matrix We have the overlap matrix S. Next we must find the hamiltonian matrix H. Then we will be in a position to solve the equation HC=SCE for C and E. (See Chapter 9.) The matrix H is calculated from a very approximate but simple recipe. The basic ideas are similar in spirit to those described in connection with the interpretations of a and ß in the simple H¨uckel method. The energy integral Hii in the EH method is taken to be equal to the energy of an electron in the ith AO of the isolated atom in the appropriate state. The various ionization energies of atoms are known,3 so this presents no great difficulty. However, one special problem must be dealt with, namely, finding the 3See Moore [2]. Section 10-1 The Extended H¨uckel Method 327 appropriate state. In the isolated carbon atom, the lowest-energy states are associated with the configuration 1s22s22p2. In a saturated molecule such as methane, however, carbon shares electrons with four hydrogens, and calculations indicate that the 2s and all three 2p AOs are about equally involved in forming occupied MOs. That is, in the molecule, carbon behaves as though it were in the 2s2p3 configuration. This configuration (shortened to sp3) is referred to as the valence state of carbon in this molecule. Since this is an “open-shell” configuration (i.e., not all the electrons are spin-paired in filled orbitals), there are several actual physical states (corresponding to different spin and orbital angular momenta of an isolated carbon atom) that are associated with this configuration. Thus, there are several real physical states, with different ionization energies, associated with our mentally constructed sp3 valence state for the atom in a molecule. The question is, what real ionization energies should we use to evaluate our valence state ionization energy (VSIE)? The approach that is used is simply to average the real IEs for loss of a 2p or a 2s electron, the average being taken over all states associated with the sp3 configuration. Various authors recommend slightly different sets of VSIEs.4 We use here the values tabulated by Pople and Segal [7]. Because of the rather crude nature of the EH method, the slight variations in VSIE resulting from different choices are of little consequence.5 For methane, we have (C2s):H11 = -19.44eV=-0.7144 a.u. (10-1) (C2p):H22 = H33 =H44=-10.67eV=-0.3921 a.u. (10-2) (H1s):H55 = H66 =H77 =H88=-13.60eV=-0.50000 a.u. (10-3) The off-diagonal elements of H are evaluated according to6 Hij =KSij Hii +Hjj 2  (10-4) where K is an adjustable parameter. The rationalization for such an expression is that the energy of interaction should be greater when the overlap between AOs is greater, and that an overlap interaction energy between low-energy AOs should be lower than that produced by an equal amount of overlap between higher-energy AOs. We will discuss the energy versus overlap relation in more detail in a later section. The value of K suggested by Hoffmann [1] is 1.75. The reasons for choosing this value will be discussed shortly. For now, we accept this value and arrive at the hamiltonian matrix given in Table 10-4. By examiningHwe can guess in advance some of the qualitative features of the MOs that will be produced. For instance, the value of H25 (-0.3790 a.u.) indicates a strong energy-lowering interaction between the 2pz AO and the 1sAO on Ha. This interaction refers to AOs with positive and negative lobes as they are assigned in the basis set. Therefore, we expect a low-energy (bonding) MO to occur where theseAOs are mixed with coefficients of the same sign so as not to affect thisAOsign relation. Ahigh-energy MO should also exist where the mixing occurs through coefficients of opposite sign, producing an antibonding interaction. The values of H26,H27, and H28 are positive, 4See Skinner and Pritchard [3], Hinze and Jaff´e [4], Basch et al. [5], Anno [6], and Pople and Segal [7]. 5The proper valence state for carbon in methane differs from that in ethylene, which in turn differs from that in acetylene. Generally, this is ignored in EHMO calculations and a compromise set of VSIEs is selected for use over the whole range of molecules. 6This formula is often called theWolfsberg–Helmholtz relation. 328 Chapter 10 The Extended H¨uckel Method TABLE 10-4  The Extended H¨uckel Hamiltonian Matrix for CH4a 1 2 3 4 5 6 7 8 1 -0.7144 0.0 0.0 0.0 -0.5454 -0.5454 -0.5454 -0.5454 2 0.0 -0.3921 0.0 0.0 -0.3790 0.1263 0.1263 0.1263 3 0.0 0.0 -0.3921 0.0 0.0 -0.3573 0.1787 0.1787 4 0.0 0.0 0.0 -0.3921 0.0 0.0 -0.3094 0.3094 5 -0.5454 -0.3790 0.0 0.0 -0.5000 -0.1579 -0.1579 -0.1579 6 -0.5454 0.1263 -0.3573 0.0 -0.1579 -0.5000 -0.1579 -0.1579 7 -0.5454 0.1263 0.1787 -0.3094 -0.1579 -0.1579 -0.5000 -0.1579 8 -0.5454 0.1263 0.1787 0.3094 -0.1579 -0.1579 -0.1579 -0.5000 aAll energies in a.u. due to negative overlap in corresponding positions of S. In this case, energy lowering will be associated with mixing 2pz with 1s AOs on hydrogens b, c, and d, but now the mixing coefficients will have signs that reverse the AO sign relations from those pertaining in the original basis. TheAOs that are orthogonal have zero interaction, and so mixing between such AOs will not affect MO energies. (When such AOs are mixed in the same MO, it is often the result of arbitrary mixing between degenerate MOs. In such cases, one can find an orthogonal pair of MOs such that two noninteracting AOs do not appear in the same MO. Methane will be seen to provide an example of this.) 10-1.E The Eigenvalues and Eigenvectors Having H and S, we now can use the appropriate matrix-handling programs to solve HC=SCE for the matrix eigenvalues on the diagonal of E and the coefficients for the MOs, which are given by the columns of C. The eigenvalues for methane, together with their occupation numbers, are given in Table 10-5. The corresponding coefficients are given in Table 10-6. TABLE 10-5  Energies for Methane by the Extended H¨uckel Method MO no. Energy (a.u.) Occ. no. 8 1.1904 0 7 0.2068 0 6 0.2068 0 5 0.2068 0 4 -0.5487 2 3 -0.5487 2 2 -0.5487 2 1 -0.8519 2 Section 10-1 The Extended H¨uckel Method 329 TABLE 10-6  Coefficients Defining MOs for Methane MO number 1 2 3 4 5 6 7 8 1(2s) 0.5842 0.0 0.0 0.0 0.0 0.0 0.0 1.6795 2(2pz) 0.0 0.5313 -0.0021 -0.0007 -0.0112 -0.0137 1.1573 0.0 3(2px) 0.0 0.0021 0.5313 -0.0021 1.1573 -0.0178 0.0110 0.0 4(2py) 0.0 0.0007 0.0021 0.5313 0.0176 1.1572 0.0139 0.0 5(1sa) 0.1858 0.5547 -0.0022 -0.0007 0.0105 0.0128 -1.0846 -0.6916 6(1sb) 0.1858 -0.1828 0.5237 -0.0019 -1.0260 0.0114 0.3518 -0.6916 7(1sc) 0.1858 -0.1853 -0.2589 0.4542 0.4943 -0.8977 0.3558 -0.6916 8(1sd) 0.1858 -0.1865 -0.2626 -0.4516 0.5213 0.8734 0.3770 -0.6916 Only two of the eight MOs are nondegenerate. These two MOs must be symmetric or antisymmetric for every symmetry operation of the molecule. This is easily checked by sketching the MOs, referring to the coefficients in the appropriate columns of Table 10-6. The lowest nondegenerate energy occurs in position 1 of our eigenvalue list (Table 10-5), and so the coefficients for this MO are to be found in column 1 of Table 10-6. This column indicates that theMO f1 is equal to 0.5842 2s+0.1858 1sa + 0.1858 1sb +0.1858 1sc +0.1858 1sd , where the symbols 2s, 1sa, etc., stand for AOs on carbon, Ha, etc. A sketch of thisMOappears in Fig. 10-2. It is obviously symmetric for all rotations and reflections of a tetrahedron. Notice that this MO is bonding in all four C–H bond regions since the 2s STO on carbon is in phase agreement with all the hydrogen 1s AOs. The higher-energy, nondegenerate MO f8 is qualitatively similar to f1 except that the signs are reversed on the 1s AOs (see Table 10-6). Hence, this MO has the same symmetry properties as f1, but is antibonding in the C–H regions. Figure 10-2  A drawing of the lowest-energy nondegenerate EHMO for methane. The AOs are drawn as though they do not overlap. This is done only to make the drawing simpler. Actually, the AOs overlap strongly. 330 Chapter 10 The Extended H¨uckel Method Since these two MOs resemble the 2s STO in being symmetric for all the symmetry operations of a tetrahedron, we will refer to them as s-type MOs. The remaining six MOs are grouped into two energy levels, each level being triply degenerate. (It is possible to predict from symmetry considerations alone that the energy levels resulting from this calculation will be nondegenerate and triply degenerate. This is discussed in a later chapter.) Because they are degenerate, the MOs cannot be expected to show all the symmetry of the molecule, but it should be possible for them to show some symmetry. Consider f2, as given by column 2 of Table 10-6. This is mainly constructed from the 2pz AO on carbon and 1s AOs on the four hydrogens. Small contributions from 2px and 2py are also present, however. It would be nice to remove these small contributions and “clean up” the MO.We can do this, as mentioned earlier, by mixing f2 with appropriate amounts of f3 and f4 since these are all degenerate. The equations that our cleaned-up MO, f2 , must satisfy are f2 =d2f2 +d3f3 +d4f4 (10-5) where (in order to cause all 2px and 2py contributions to vanish) 0.0021d2 +0.5313d3 +-0.0021d4=0.0 0.0007d2 +0.0021d3 + 0.5313d4=0.0 (10-6) d2 2 + d2 3 + d2 4=1 As a result, f2 =0.9999f2 -0.0040f3 -0.0013f4 (10-7) A similar procedure to produce an orbital with no pz or py contribution (f3 ), and one with no pz or px contribution (f4 ) gives f3 = 0.9999f3 +0.0040f2 -0.0040f4 (10-8) f4 = 0.9999f4 +0.0013f2 +0.0039f3 (10-9) The coefficients for these MOs appear in Table 10-7. TABLE 10-7  Coefficients for MOs f2 ,f3 ,f4 f2 f3f4 2s 0.0 0.0 0.0 2pz 0.5313 0.0 0.0 2px 0.0 0.5313 0.0 2py 0.0 0.0 0.5313 1sa 0.5547 0.0 0.0 1sb -0.1849 0.5228 0.0 1sc -0.1849 -0.2614 0.4529 1sd -0.1849 -0.2614 -0.4529 Section 10-1 The Extended H¨uckel Method 331 Figure 10-3  The three lowest-energy degenerate MOs of methane. There is no fundamental change produced by intermixing degenerate MOs in this way. The total electronic density and the orbital energies are uninfluenced. The only advantage is that the cleaned-up MOs are easier to sketch and visualize. The MOs f2 , f3 , and f4 are sketched in Fig. 10-3. Each of the MOs in Fig. 10-3 is symmetric or antisymmetric for some of the operations that apply to a tetrahedron. f2 is symmetric for rotations about the z axis by 2p/3, and also for reflection through the xz plane. This same reflection plane is a symmetry plane for f3 and f4 , but neither of these MOs shows symmetry or antisymmetry for rotation about the z axis. Each MO contains one pAO and, perforce, has the symmetry of that AO.We shall refer to these as p-type MOs. Note that hydrogen 1s AOs lying in the nodal plane of a pAO do not mix with that pAO in formation of MOs. This results from zero interaction elements in H, which, in turn, results from zero overlap elements in S. Note also that the MO f2 is the MO that we anticipated earlier on the basis of inspection of the matrix H. Because of phase agreements between the hydrogen 1s AOs and the adjacent lobes of the p AOs, f1 , f2 , and f3 are C–H bonding MOs. A similar “cleaning up” procedure can be performed on f5, f6, and f7. These turn out to be the C–H antibonding mates to the MOs in Fig. 10-3. The broad results of this calculation are that there are four occupied C–H bonding MOs, one of s type and three of p type. At higher energies are four unoccupied C–H antibonding MOs, again one of s type and three of p type. Note that the s- and the three p-type MOs fall into the same energy pattern as the s and pAOs of isolated carbon. Because of their highly symmetric tetrahedral geometry, the hydrogen atoms do not lift the degeneracy of the pAOs. There are many molecules and complexes in which a cluster of atoms or molecules surrounds a central atom in such a highly symmetric way that the degeneracies among certain AOs on the central atom are retained. 10-1.F The Total Energy The total EH energy is taken as the sum of the one-electron energies. For methane, this is 2×(-0.8519)+6×(-0.5487), or-4.9963 a.u. There is some ambiguity as to how this energy is to be interpreted. For instance, does it include any of the internuclear repulsion energy? Also, what problems will arise from our neglect of inner-shell 332 Chapter 10 The Extended H¨uckel Method electrons? By comparing EH total energy changes with experimental energy changes, it has been decided7 that the change in EH total energy upon change of geometry is approximately the same as the actual change in total electronic plus nuclear repulsion energy for the system. Thus, our value of -4.9963 a.u. is not a realistic value for the total (nonrelativistic) energy of methane (the actual value is -40.52 a.u.), but it is meaningful when compared to EH energies for methane at other geometries. For example, if we uniformly lengthen or shorten all the C–H bonds in methane and repeat our EH calculation several times, we can generate an energy curve versus RC–H for the symmetrical stretch vibrational mode of methane. The resultant plot is given in Fig. 10-4. The EH total energy is minimized at about RC–H =1 Å, reasonably close to the experimentally observed 1.1- Å distance for the minimum total energy of methane. The appearance of the curve in Fig. 10-4 does encourage us to equate EH total energy changes to changes in actual electronic-plus-nuclear-repulsion energies. As we shall see later, this procedure fails for some molecules (notably H2) and for methane may be regarded as fortuitous. Figure 10-4  Total extended H¨uckel energy for CH4 as a function of C–H bond length. 7See Hoffmann [1]. Section 10-1 The Extended H¨uckel Method 333 Figure 10-5  Staggered ethane MO energies versus K. C–H distances are 1.1 Å; C–C distance is 1.54 Å; all angles are tetrahedral. 10-1.G Fixing the Parameter K We mentioned earlier that Hoffmann suggested a value of 1.75 for K. We will now indicate the considerations behind this suggestion. Hoffmann used the ethane molecule C2H6 to evaluate K. A plot of the orbital energies of staggered ethane as a function of K is shown in Fig. 10-5. The energies are linearly dependent on K at values of K greater than about 1.5. At lower values, the lines curve and some crossing occurs. Hence, in order that the MO energy order not be highly sensitive to K, its value should exceed 1.5. A plot of the amount of electronic charge in a bond or at an atom in ethane, as calculated by the EHMO method, versus K is shown in Fig. 10-6. We will describe the details of such calculations shortly, but for now, we merely note that the disposition of charge in ethane becomes rather insensitive to K at values of K greater than about 1.5. In Fig. 10-7 a plot of the EH total energy difference between staggered and eclipsed ethanes versus K is given. This energy difference has an experimentally determined value of 2.875 ± 0.025 kcal/mole, the staggered form being more stable. To give reasonable agreement with this experimental value, K should be about 1.75. Thus, the value K =1.75 is selected because the MO energy order and charge distribution are not sensitive to K in this region and because this value of K gives the correct total energy change for a known physical process. We have also seen earlier that this same value of K leads to a reasonable prediction for the equilibrium bond length inCH4. Notice that the evaluation ofK comes by matching the 334 Chapter 10 The Extended H¨uckel Method Figure 10-6  Mulliken gross population onH[N(H)] and Mulliken overlap populations inC–Cand C–H bonds [n(C–C) and n(C–H)] of ethane as calculated by the EH method with various values of K. Figure 10-7  Extended H¨uckel energy difference between staggered and eclipsed ethanes as a function of K. Section 10-2 Mulliken Populations 335 EH total energy change to the total (nuclear repulsion plus electronic) observed energy change for internal rotation in ethane. This is consistent with our earlier interpretation of EH total energy. 10-2 Mulliken Populations We found that the electron densities and bond orders calculated in the simple H¨uckel method were extremely useful for relating theory to observable molecular properties such as electron spin-resonance splittings or bond lengths. Hence, it is desirable that we find analogous quantities to describe the distribution of electrons in an all-valenceelectron method like the EH method. A number of suggestions have been made. The one we use is due to Mulliken [8]. It is the most widely used, and, as we shall see, it has an especially direct and useful connection with the EH method. Consider a real, normalized MO, fi , made up from two normalizedAOs, .j and .k: fi =cji.j +cki.k (10-10) We square this MO to obtain information about electronic distribution: f2 i =c2 ji.2 j +c2 ki.2 k +2cjicki.j.k (10-11) If we integrate Eq. (10-11) over the electronic coordinates, we obtain (since fi , .j , and .k are normalized) 1=c2 ji +c2 ki +2cjickiSjk (10-12) where Sjk is the overlap integral between .j and .k. Mulliken suggested that one electron in fi should be considered to contribute c2 ji to the electron net AO population of .j , c2 ki to the population of .k, and 2cjickiSjk to the overlap population between .j and .k. If there are two electrons in fi , then these populations should be doubled. Let qi j symbolize the net AO population of .j due to one electron in MOfi, and pi jk symbolize the overlap population between .j and .k due to this same electron. The above example leads to the following general definitions: qi j = c2 ji (10-13) pi jk = 2cjickiSjk (10-14) We can now sum the contributions due to all the electrons present in the model system, obtaining a Mulliken net AO population qj for each AO .j , and a Mulliken overlap population pjk, for each distinct AO pair .j and .k: qj = MOs  i nic2 ji = MOs  i niqi j (10-15) Pjk = 2 MOs  i nicjickiSjk = MOs  i nipi jk (10-16) Notice that the sum of all the netAO and overlap populations must be equal to the total number of electrons in the model system. (In the EH method, this is the total number 336 Chapter 10 The Extended H¨uckel Method of valence electrons.) This contrasts with the situation in the simple H¨uckel method where the sum of electron densities alone, exclusive of bond orders, equals the total number of p electrons. The Mulliken populations are useful indices of the location of electronic charge in the molecule and its bonding or antibonding nature. The contributions to such populations from one electron in MO f4 of methane are given below (data taken from Tables 10-3 and 10-7): q4 2py = (0.5313)2=0.2823 (10-17) q4 1sc = q4 1sd=(0.4529)2=0.2051 (10-18) p4 2py-1sc = p4 2py-1sd=2(0.5313)(0.4529)(0.3964)=0.1908 (10-19) p4 1sc-1sd = 2(0.4529)(-0.4529)(0.1805)=-0.0740 (10-20) All the other populations for f4 are zero. These numbers show that an electron in this MOcontributes about 28%, of an electron to the carbon 2py netAOpopulation, 20–21% to each of two hydrogen AOs, 19% to each of two C–H bonds, and a negative 7% to the region between Hc and Hd . The last corresponds to an antibonding interaction, as is obvious from an examination of the sketch of f4 in Fig. 10-3. (A negative overlap population is interpreted to mean that the amount of charge in the overlap region is less than what would exist if one squared the two AOs and then combined them.) By summing over the contributions due to all eight valence electrons, we obtain the Mulliken populations shown in Table 10-8. These data are part of the normal output of an EH computer program. The matrix is symmetric, and only the unique elements are tabulated. Such a matrix is named a Mulliken overlap population matrix. The data in Table 10-8 indicate that the hydrogens have positive overlap populations with the 2s and 2pAOs of carbon but have small negative overlap populations with each other. This corresponds to saying that the hydrogens interact with the carbon AOs in a bonding way and in a weakly antibonding way with each other. The symmetry equivalence of the four hydrogen atoms results in their having identical net AO populations. Often we are interested in knowing how the hydrogen atoms interact with the carbon in toto, rather than with the 2s and 2p AOs separately. This can be obtained simply TABLE 10-8  Mulliken Net AO and Overlap Populations for Methane as Computed by the Extended H¨uckel Method 2s 2pz 2px 2py 1sa 1sb 1sc 1sd 2s 0.6827 0.0 0.0 0.0 0.2229 0.2229 0.2229 0.2229 2pz 0.5645 0.0 0.0 0.5723 0.0636 0.0636 0.0636 2px 0.5645 0.0 0.0 0.5087 0.1272 0.1272 2py 0.5645 0.0 0.0 0.3815 0.3815 1sa 0.6844 -0.0491 -0.0491 -0.0491 1sb 0.6844 -0.0491 -0.0491 1sc 0.6844 -0.0491 1sd 0.6844 Section 10-2 Mulliken Populations 337 TABLE 10-9  Reduced Net AO and Overlap Population Matrix for Methane C Ha Hb Hc Hd C 2.3762 0.7952 0.7952 0.7952 0.7952 Ha 0.6844 -0.0491 -0.0491 -0.0491 Hb 0.6844 -0.0491 -0.0491 Hc 0.6844 -0.0491 Hd 0.6844 by summing all the carbon atom AO contributions together to give a reduced overlap population matrix, shown in Table 10-9. The reduced population matrix makes evident the equivalence of the four C–H bonds, all of which have a total overlap population of 0.7952 electrons. The Mulliken population scheme described above assigns some electronic charge to AOs, the rest to overlap regions. An alternative scheme, which assigns all the charge to AOs, was also proposed by Mulliken. One simply divides each overlap population in half, assigning half of the charge to each of the two participating AOs. When all the overlap populations have been reassigned in this way, the electronic charge is all in the AOs, and the sum of these AO charges still equals the total number of electrons. Mulliken called theAOpopulations resulting from this procedure grossAOpopulations. We will use the symbol N(X) for the gross population in X, where X can be anAO or an atom (i.e., the sum of all gross AO populations on one atom). Clearly N(X)=qx + 1 2 j =x pxj (10-21) The grossAOpopulations, gross atomic populations, and the resultant atomic charges (obtained by combining electronic and nuclear charges) for methane are listed in Table 10-10. These data suggest that the carbon has lost a very small amount of charge to hydrogen upon formation of the molecule. It would be very risky, however, to place much faith in such an interpretation. It turns out that populations are rather sensitive to choice of VSIEs. For example, Hoffmann’s original choice of VSIEs differs from that TABLE 10-10  Gross AO Populations, Gross Atomic Populations, and Net Atomic Charges for Methane Gross AO Gross atom Net atomic population population charge C2s 1.128 3.966 +0.0334 C2p a 0.946 Ha 1.008 1.008 -0.0083 aAll 2pAOs and all HAOs have identical values because they are equivalent through symmetry. 338 Chapter 10 The Extended H¨uckel Method used here, and he obtained gross populations for hydrogen atoms of around 0.9, giving net positive charges of 0.1. Thus, absolute values are not very useful, but changes in gross population as we go from one hydrogen to another in the same hydrocarbon molecule or in closely related molecules do appear to be rather insensitive to VSIE choice and are often in accord with results of more accurate calculations. 10-3 Extended H¨uckel Energies and Mulliken Populations In Chapter 8 it was shown that a simple quantitative relation exists between the energy of a simple H¨uckel MO and the contributions of the MO to bond orders [Eq. (8-56)]. The more bonding such an MO is, the lower is its energy. A similar relationship will now be shown to hold for extended H¨uckel energies and Mulliken populations. The orbital energy for the real MO fi is (ignoring the spin variable) Ei =  fiHˆ fi dv  f2 i dv (10-22) = AOs j,k cjickiHjk AOs j,k cjickiSjk (10-23) Assume that fi is normalized: The denominator is unity. The numerator of Eq. (10-23) contains diagonal and off-diagonal terms. If we separate these, and substitute the relation (10-4) for the off-diagonal terms, we obtain Ei = AOs  j,k c2 jiHjj +2 AOs  j1,Esg is negative (since Haa is negative). This analysis shows that the choice of 1.75 as the value for K has the effect of making the increase in overlap population dominate the energy change. If K were less than unity, the netAO population changes would dominate. The above example suggests that the EHMO method lowers the energy by maximizing weighted overlap populations at the expense of net AO populations. It also suggests that the EH energy should be lowered whenever a molecule is distorted in a way that enables overlap population to increase. These are useful rules of thumb, but some caution must be exercised since the existence of several different kinds of atom in a molecule leads to a more complicated relation than that in the above example. Our methane example illustrates the above ideas. We have already seen that the total EH energy for methane goes through a minimum around RC–H =1 Å. Let us see how the individual MO energies change as a function of RC–H and try to rationalize their behaviors in terms of the overlap population changes. Aplot of theMOenergies is given in Fig. 10-8. The lowest-energyMOis s type and C–H bonding. The overlap betweenH 1sAOs and theC2s STOincreases asRC–H decreases. As a result, the energy of thisMO decreases asRC–H decreases, favoring formation of the united atom. The second-lowest energy level is triply degenerate and belongs to p-type C–H bonding MOs. A glance at Fig. 10-3 indicates that overlap between AOs will first increase in magnitude as RC–H decreases. But ultimately, at small RC–H this behavior must be reversed because the 1s and 2pAOs are orthogonal when they are isocentric. Therefore, as RC–H decreases, the overlap population first increases, then decreases toward zero. Consequently, the EH energy of this level first decreases, then increases. Since there are six electrons in this 340 Chapter 10 The Extended H¨uckel Method Figure 10-8  Extended H¨uckel MO energies for methane as a function of RC–H. level and only two in the lowest-energy MO, this one dominates the total energy and is responsible for the eventual rise in total energy at small R that creates the minimum in Fig. 10-4. The two remaining (unoccupied) levels belong to antibonding MOs, and the overlap population becomes more negative as RC–H decreases. The p-type level rises less rapidly and eventually passes through a maximum (not shown), again because the overlap population must ultimately approach zero as RC–H approaches zero. It is important to remember that all these remarks apply to the EH method only. The relationships between the EH method and other methods or with experimental energies is yet to be discussed. 10-4 Extended H¨uckel Energies and Experimental Energies We have seen that a change in EH energy reflects certain changes in calculated Mulliken populations. We now consider the circumstances that should exist in order that such EH energy changes agree roughly with actual total energy changes for various systems. Section 10-4 Extended H¨uckel Energies and Experimental Energies 341 In essence, there are two requirements. First, the population changes calculated by the EH method ought to be in qualitative agreement with the charge shifts that actually occur in the real system. This condition is not always met. Some systems, especially those having unpaired electrons, are not accurately representable by a singleconfiguration wavefunction (that is, by a single product or a single Slater determinant). An example of this was seen for some 1s2s states of helium (see Chapter 5). Since the EHMO method is based on a single configuration, it is much less reliable for treating such systems. Special methods exist for handling such systems, but they normally are not applied at the H¨uckel level of approximation. The second requirement is that, as the charge shifts in the real system, the total energy should change in the way postulated by the EH method. For instance, if the charge increases in bond regions, this should tend to lower the total energy. In actuality, this does not always happen. We can see this by returning to our example of two identical 1sAOs separated byR. The associated sg MOincreases its overlap population monotonically asR decreases. Therefore, theEHpostulate predicts that the total energy should decrease monotonically. It is clear, however, that this does not happen in any real system. If our system is H2, we know that the experimental energy decreases with decreasing R until R =Re, and increases thereafter (Fig. 7-18). If our system is He2+ 2 , the experimental total energy increases as R decreases, due to the dominance of internuclear repulsion.8 Consideration of many examples such as these indicates that the postulate is usually qualitatively correct when we are dealing with interactions between neutral, fairly nonpolar systems that are separated by a distance greater than a bond length typical for the atoms involved. These, then, are circumstances under which the EHMO method might be expected to be qualitatively correct. In brief, we find two kinds of condition limiting the applicability of the EH method. The first is that the system must be reasonably well represented by a single configuration. Hence, closed-shell systems are safest. The second condition is that we apply the method to uncharged nonpolar systems where the nuclei that are undergoing relative motion are not too close to each other. These conditions are often satisfied by molecules undergoing internal rotation about single bonds. Thus, it is indeed appropriate that the internal rotation barrier in ethane was used to help calibrate the method. A comparison of other EH calculated barriers with experimental values (Table 10-11) gives us some idea of the capabilities and limitations of the method. For all the molecules listed, the most stable conformation predicted by the EH method agrees with experiment. This suggests that one can place a fair amount of reliance on the conformational predictions of the method, at least for threefold symmetric rotors. Also, certain gross quantitative trends, such as the significant reduction as we proceed down the series ethane, methylamine, methanol, are displayed in the EH results, but it is evident that the quantitative predictions of barrier height are not very good. This is not too surprising since the method was calibrated on hydrocarbons. Introduction of halogens, nitrogen, or oxygen produces greater polarity and might be expected to require a different parametrization. For reasons outlined earlier, we expect even less accuracy in EH calculations of molecular deformations involving bond-angle or bond-length changes. This expectation is generally reinforced by calculation. Molecular shape predictions become poorer 8Actually, He2+ 2 has a minimum at short R in its energy curve due to an avoidance of curve crossing (discussed in Chapter 14), but this minimum is unstable with respect to the separated ions. See Pauling [9]. 342 Chapter 10 The Extended H¨uckel Method TABLE 10-11  Energy Barriers for Internal Rotation about Single Bondsa Barrier (kcal/mole)b Molecule Calculated Experiment CH3–CH3 3.04 2.88 CH3–NH2 1.66 1.98 CH3–OH 0.45 1.07 CH3–CH2F 2.76 3.33 CH3–CHF2 2.39 3.18 CH3–CF3 2.17 3.25 CH3–CH2Cl 4.58 3.68 CH3–CHCH2 1.20 1.99 cis–CH3–CHCHCl 0.11 0.62 CH3–CHO 0.32 1.16 CH3–NCH2 0.44 1.97 aCalculated barriers are for rigid rotation, where no bond length or angle changes occur except for the torsional angle change about the internal axis. bThe stable form for the first seven molecules has the methyl C–H bonds staggered with respect to bonds across the rotor axis. For the last four molecules, the stable form has a C–H methyl bond eclipsing the double bond. for more polar molecules (water is calculated to be most stable when it is linear), and bond-length predictions are quite poor too. Results such as these have tended to restrict use of the EH method to qualitative predictions of conformation in molecules too large to be conveniently treated by more accurate methods. However, just as the simple H¨uckel method underwent various refinements (such as the . technique) to patch up certain inadequacies, so has the EH method been refined. Such refinements9 have been shown to give marked improvement in numerical predictions of various properties. The EH method has been overtaken in popularity by a host of more sophisticated computational methods. (See Chapter 11.) However, it is still sometimes used as a first step in such methods as a way to produce a starting set of approximate MOs. The EHMO method also continues to be important as the computational equivalent of qualitative MO theory (Chapter 14), which continues to play an important role in theoretical treatments of inorganic and organic chemistries (as, for example, inWalsh’s Rules and inWoodward-Hoffmann Rules). 10-4.A Problems The following output is produced by an EH calculation on the formaldehyde molecule, and is referred to in Problems 10-1 to 10-10. 9See Kalman [10], Boyd [11], and the references cited in these papers. Also, see Anderson and Hoffmann [12] and Anderson [13]. Section 10-4 Extended H¨uckel Energies and Experimental Energies 343 Formaldehyde (Ground State) Orbital Numbering Orbital Atom n l m x y z exp Hii 1 H-1 1 0 0 -0.550000 0.952600 0.0 1.200–13.60 2 H-2 1 0 0 -0.550000 -0.952600 0.0 1.200–13.60 3 C-3 2 0 0 0.0 0.0 0.0 1.625–19.44 4 C-3 2 1 0 0.0 0.0 0.0 1.625–10.67 5 C-3 2 1 1 0.0 0.0 0.0 1.625–10.67 6 C-3 2 1 1 0.0 0.0 0.0 1.625–10.67 7 O-4 2 0 0 1.220000 0.0 0.0 2.275–32.38 8 O-4 2 1 0 1.220000 0.0 0.0 2.275–15.85 9 O-4 2 1 1 1.220000 0.0 0.0 2.275–15.85 10 O-4 2 1 1 1.220000 0.0 0.0 2.275–15.85 Hij =KSij (Hii +Hjj )/2, with K =1.75. Distance Matrix (a.u.) 1 2 3 4 1 0.0 3.6004 2.0787 3.7985 2 3.6004 0.0 2.0787 3.7985 3 2.0787 2.0787 0.0 2.3055 4 3.7985 3.7985 2.3055 0.0 Total effective nuclear repulsion =17.69537317 a.u. Formaldehyde Eigenvalues Eigenvalues (a.u.) Occ. no. Eigenvalues (a.u.) Occ. no. E(1)= 1.039011 0 E(6) =-0.587488 2 E(2)= 0.472053 0 E(7) =-0.597185 2 E(3)= 0.314551 0 E(8) =-0.611577 2 E(4)=-0.342162 0 E(9) =-0.755816 2 E(5)=-0.517925 2 E(10)=-1.242836 2 Sum=-8.625654 a.u. Total Overlap Matrix 1 2 3 4 5 6 7 8 9 10 1 1.0000 0.1534 0.5133 0.0 -0.2428 0.4204 0.0813 0.0 -0.0729 0.0392 2 0.1534 1.0000 0.5133 0.0 -0.2428 -0.4204 0.0813 0.0 -0.0729 -0.0392 3 0.5133 0.5133 1.0000 0.0 0.0 0.0 0.3734 0.0 -0.3070 0.0 4 0.0 0.0 0.0 1.0000 0.0 0.0 0.0 0.2146 0.0 0.0 5 -0.2428 -0.2428 0.0 0.0 1.0000 0.0 0.4580 0.0 -0.3056 0.0 6 0.4204 -0.4204 0.0 0.0 0.0 1.0000 0.0 0.0 0.0 0.2146 7 0.0813 0.0813 0.3734 0.0 0.4580 0.0 1.0000 0.0 0.0 0.0 8 0.0 0.0 0.0 0.2146 0.0 0.0 0.0 1.0000 0.0 0.0 9 -0.0729 -0.0729 -0.3070 0.0 -0.3056 0.0 0.0 0.0 1.0000 0.0 10 0.0392 -0.0392 0.0 0.0 0.0 0.2146 0.0 0.0 0.0 1.0000 Eigenvectors 1 2 3 4 5 6 7 8 9 10 1 0.5279 0.7683 0.8924 0.0 -0.4281 -0.2016 0.0 -0.2141 -0.2721 0.0011 2 0.5279 0.7683 -0.8924 0.0 0.4281 -0.2016 0.0 0.2141 -0.2721 0.0011 3 -1.3964 -0.5553 0.0000 0.0 -0.0000 -0.0460 0.0 -0.0000 -0.4875 0.2550 4 0.0 0.0 0.0 0.9940 0.0 0.0 0.2456 0.0 0.0 0.0 5 -0.6043 1.1727 -0.0000 0.0 0.0000 0.2768 0.0 0.0000 0.2245 0.0685 6 0.0000 -0.0000 -1.2519 0.0 -0.3813 0.0000 0.0 -0.3179 0.0000 0.0000 7 0.8367 -0.4799 0.0000 0.0 0.0000 -0.0884 0.0 -0.0000 0.3066 0.8481 8 0.0 0.0 0.0 -0.4532 0.0 0.0 0.9181 0.0 0.0 0.0 9 -0.6960 0.3412 -0.0000 0.0 0.0000 -0.8317 0.0 -0.0000 0.3327 0.0252 10 -0.0000 0.0000 0.2511 0.0 0.6475 0.0000 0.0 -0.7600 -0.0000 -0.0000 344 Mulliken Overlap Populations for 12 Electrons 1 2 3 4 5 6 7 8 9 10 1 0.6876 -0.0702 0.2920 0.0 0.1134 0.3890 -0.0210 0.0 -0.0225 -0.0180 2 -0.0702 0.6876 0.2920 0.0 0.1134 0.3890 -0.0210 0.0 -0.0225 -0.0180 3 0.2920 0.2920 0.6097 0.0 0.0 0.0 0.1058 0.0 0.1443 0.0 4 0.0 0.0 0.0 0.1207 0.0 0.0 0.0 0.1936 0.0 0.0 5 0.1134 0.1134 0.0 0.0 0.2634 0.0 0.1876 0.0 0.1880 0.0 6 0.3890 0.3890 0.0 0.0 0.0 0.4929 0.0 0.0 0.0 -0.0046 7 -0.0210 -0.0210 0.1058 0.0 0.1876 0.0 1.6420 0.0 0.0 0.0 8 0.0 0.0 0.0 0.1936 0.0 0.0 0.0 1.6857 0.0 0.0 9 -0.0225 -0.0225 0.1443 0.0 0.1880 0.0 0.0 0.0 1.6061 0.0 10 -0.0180 -0.0180 0.0 0.0 0.0 -0.0046 0.0 0.0 0.0 1.9939 Charge Matrix for MOs with Two Electrons in Each 1 2 3 4 5 6 7 8 9 10 1 0.1664 0.3881 0.4266 0.0 0.4258 0.1088 0.0 0.1476 0.3363 0.0004 2 0.1664 0.3881 0.4266 0.0 0.4258 0.1088 0.0 0.1476 0.3363 0.0004 3 0.9171 0.0561 0.0000 0.0 0.0000 0.0028 0.0 0.0000 0.7358 0.2882 4 0.0 0.0 0.0 1.7825 0.0 0.0 0.2175 0.0 0.0 0.0 5 0.3198 1.1155 0.0000 0.0 0.0000 0.3257 0.0 0.0000 0.1775 0.0614 6 0.0000 0.0000 1.1204 0.0 0.4593 0.0000 0.0 0.4203 0.0000 0.0000 7 0.2082 0.0241 0.0000 0.0 0.0000 0.0021 0.0 0.0000 0.1123 1.6534 8 0.0 0.0 0.0 0.2175 0.0 0.0 1.7825 0.0 0.0 0.0 9 0.2220 0.0282 0.0000 0.0 0.0000 1.4518 0.0 0.0000, 0.3017 -0.0037 10 0.0000 0.0000 0.0264 0.0 0.6891 0.0000 0.0 1.2845 0.0000 0.0000 345 346 Chapter 10 The Extended H¨uckel Method Reduced Overlap Population Matrix Atom by Atom 1 2 3 4 1 0.6876 -0.0702 0.7945 -0.0615 2 -0.0702 0.6876 0.7945 -0.0615 3 0.7945 0.7945 1.4866 0.8148 4 -0.0615 -0.0615 0.8148 6.9277 Orbital Charges 1 1.018973 6 0.879574 2 1.018973 7 1.767672 3 1.026764 8 1.782529 4 0.217471 9 1.749778 5 0.564637 10 1.973630 Net Charges 1 -0.018973 3 1.311555 2 -0.018973 4 -1.273609 Total charge = 0.000000. 10-1. Use the output to determine the orientation of the molecule with respect to Cartesian coordinates. Sketch the molecule in relation to these axes and number the atoms in accord with their numbering in the output. 10-2. Use the orbital numbering data together with the overlap matrix to figure out which of the labels 1s, 2s, 2px, 2py, 2pz goes with each of the ten AOs (i.e., it is obvious that AO 1 is a 1s AO, but it is not so obvious what AO 5 is). 10-3. Use your conclusions from above, together with coefficients in the eigenvector matrix, to sketch the MOs having energies of -0.756, -0.611, and -0.597 a.u. Which of these are p MOs? Which are s MOs? 10-4. Label each of the ten MOs “p” or “s” by inspecting the coefficient matrix. 10-5. What is the Mulliken overlap population between C and O 2pp AOs in this molecule? Should removal of an electron from MO 7 cause the C=O bond to shorten or to lengthen? 10-6. Demonstrate that MO 7 satisfies Eq. (10-24). (Note that Hii values in the first table are in units of electron volts, while orbital energies are in atomic units.) 10-7. Use the reduced overlap population matrix to verify that the sum of AO and overlap populations is equal to the number of valence electrons. 10-8. Using MO 7 as your example, verify that the charge matrix table is a tabulation of the contributions of each MO to gross atom populations. 10-9. In the list of “orbital charges,” are the “orbitals” MOs orAOs? Demonstrate how these numbers are derived from those in the charge matrix. Section 10-4 Extended H¨uckel Energies and Experimental Energies 347 10-10. What is the physical meaning of the “net charges” in the data? Would you characterize these results as indicative of low polarity? Which end of the molecule should correspond to the negative end of the dipole moment? 10-11. How many MOs will be produced by an EHMO calculation on butadiene? References [1] R. Hoffmann, J. Chem. Phys. 39, 1397 (1963). [2] C. E. Moore, Atomic Energy Levels, Natl. Bur. Std. (U.S.) Circ. 467. Natl. Bur. Std.,Washington D.C., 1949. [3] H. A. Skinner and H. O. Pritchard, Trans. Faraday Soc. 49, 1254 (1953). [4] J. Hinze and H. H. Jaff´e, J. Amer. Chem. Soc. 84, 540 (1962). [5] H. Basch, A. Viste, and H. B. Gray, Theoret. Chim. Acta 3, 458 (1965). [6] T. Anno, Theoret. Chim. Acta 18, 223 (1970). [7] J. A. Pople and G. A. Segal, J. Chem. Phys. 43, S136 (1965). [8] R. S. Mulliken, J. Chem. Phys. 23, 1833, 1841, 2338, 2343 (1955). [9] L. Pauling, J. Chem. Phys. 1, 56 (1933). [10] B. L. Kalman, J. Chem. Phys. 60, 974 (1974). [11] D. B. Boyd, Theoret. Chim. Acta 30, 137 (1973). [12] A. B. Anderson and R. Hoffmann, J. Chem. Phys. 60, 4271 (1974). [13] A. B. Anderson, J. Chem. Phys. 62, 1187 (1975). Chapter 11 The SCF-LCAO-MO Method and Extensions 11-1 Ab Initio Calculations A rigorous variational calculation on a system involves the following steps: 1. Write down the hamiltonian operator Hˆ for the system. 2. Select some mathematical functional form . as the trial wavefunction. This form should have variable parameters. 3. Minimize E¯ =  .*Hˆ .dt  .*.dt (11-1) with respect to variations in the parameters. The simple and extended H¨uckel methods are not rigorous variational calculations. Although they both make use of the secular determinant technique from linear variation theory, no hamiltonian operators are ever written out explicitly and the integrations in Hij are not performed. These are semiempirical methods because they combine the theoretical form with parameters fitted from experimental data. The term ab initio (“from the beginning”) is used to describe calculations in which no use is made of experimental data. In an ab initio variational method, all three steps listed above are explicitly performed. In this chapter we describe a certain kind of ab initio calculation called the self-consistent field (SCF) method. This is one of the most commonly encountered types of ab initio calculation for atoms or molecules. We also describe a few popular methods for proceeding beyond the SCF level of approximation. The SCF method and extensions to it are mathematically and physically considerably more complicated than the one-electron methods already discussed. Thus, one normally does not perform such calculations with pencil and paper, but rather with complicated computer programs. Therefore, in this chapter we are not concerned with how one does such calculations because, in most cases, they are done by acquiring a program written by a group of specialists. Rather we are concerned with a description of the mathematical and physical underpinnings of the method. Because the method is simultaneously complicated and rigorously defined, a special jargon has developed. Terms like “Hartree–Fock,” or “correlation energy” have specific meanings and are pervasive in the literature. Hence, a good deal of emphasis in this chapter is put on defining some of these important terms. 348 Section 11-3 The Form of theWavefunction 349 11-2 The Molecular Hamiltonian In practice, one usually does not use the complete hamiltonian for an isolated molecular system. The complete hamiltonian includes nuclear and electronic kinetic energy operators, electrostatic interactions between all charged particles, and interactions between all magnetic moments due to spin and orbital motions of nuclei and electrons. Also an accounting for the fact that a moving particle experiences a change in mass due to relativistic effects is included in the complete hamiltonian. The resulting hamiltonian is much too complicated to work with. Usually, relativistic mass effects are ignored, the Born–Oppenheimer approximation is made (to remove nuclear kinetic energy operators), and all magnetic interactions are ignored (except in special cases where we are interested in spin coupling). The resulting hamiltonian for the electronic energy is, in atomic units, Hˆ =- 1 2 n  i=1 .2 i - N  µ=1 n  i=1 Zµ/rµi + n-1  i=1 n  j=i+1 1/rij (11-2) where i and j are indices for the n electrons and µ is an index for the N nuclei. The nuclear repulsion energy Vnn is Vnn = N-1  µ=1 N  .=µ+1 ZµZ./rµ. (11-3) In choosing this hamiltonian, we are in effect electing to seek an energy of an idealized nonexistent system—a nonrelativistic system with clamped nuclei and no magnetic moments. If we wish to make a very accurate comparison of our computed results with experimentally measured energies, it is necessary to modify either the experimental or the theoretical numbers to compensate for the omissions in Hˆ . 11-3 The Form of theWavefunction The wavefunction for an SCF calculation is one or more antisymmetrized products of one-electron spin-orbitals. We have already seen (Chapter 5) that a convenient way to produce an antisymmetrized product is to use a Slater determinant. Therefore, we take the trial function . to be made up of Slater determinants containing spin-orbitals f. If we are dealing with an atom, then the f’s are atomic spin-orbitals. For a molecule, they are molecular spin-orbitals. In our discussion of many-electron atoms (Chapter 5), we noted that certain atoms in their ground states are fairly well described by assigning two electrons, one of each spin, to each AO, starting with the lowest-energy AO and working up until all the electrons are assigned. If the last electron completes the filling of all the AOs having a given principal quantum number, n, we have a closed shell atomic system. Examples are He(1s2) and Ne(1s22s22p6). Atoms wherein the last electron completes the filling of all AOs having a given l quantum number are said to have a closed subshell. An example is Be(1s22s2). Both types of system tend to be well approximated by a single determinantal wavefunction if the highest filled level is not too close in energy to the 350 Chapter 11 The SCF-LCAO-MO Method and Extensions lowest empty level. (Beryllium is the least successfully treated of these three at this level of approximation because the 2s level is fairly close in energy to the 2p level.) A similar situation holds for molecules; that is, the wavefunctions of many molecules in their ground states are well represented by single determinantalwavefunctions with electrons of paired spins occupying identical MOs. Such molecules are said to be closed-shell systems. We can represent a trial wavefunction for a 2n-electron closed-shell system as .closed shell = f1 (1) ¯ f1 (2)f2 (3) ¯ f2 (4) ···fn (2n-1) ¯ fn (2n) (11-4) where we have used the shorthand form for a Slater determinant described in Chapter 5. For the present, we restrict our discussion to closed-shell single-determinantal wavefunctions. 11-4 The Nature of the Basis Set Some functional form must be chosen for theMOsf. The usual choice is to approximate f as a linear combination of “atomic orbitals” (LCAO), theseAOs being located on the nuclei. The detailed nature of these AOs, as well as the number to be placed on each nucleus, is still open to choice. We consider these choices later. For now, we simply recognize that we are working within the familiar LCAO-MO level of approximation. If we represent the basis AOs by ., we have, for the ith MO, fi = j cji.j (11-5) where the constants cji are as yet undetermined. 11-5 The LCAO-MO-SCF Equation Having a hamiltonian and a trialwavefunction, we are nowin a position to use the linear variation method. The detailed derivation of the resulting equations is complicated and notationally clumsy, and it has been relegated to Appendix 7. Here we discuss the results of the derivation. For our restricted case of a closed-shell single-determinantal wavefunction, the variation method leads to Fˆfi =ifi (11-6) These equations are sometimes called the Hartree–Fock equations, and Fˆ is often called the Fock operator. The detailed formula for Fˆ is (from Appendix 7) Fˆ(1)=- 1 2.2 1 - µ Zµ/rµ1 + n  j=1 (2Jˆj -Kˆj ) (11-7) The symbols Jˆj and Kˆj stand for operators related to the 1/rij operators in Hˆ . Jˆj is called a coulomb operator because it leads to energy terms corresponding to charge cloud repulsions. It is possible to write Jˆj explicitly: Jˆj = f*j (2) 1/r12)fj (2)dt (2 (11-8) Section 11-6 Interpretation of the LCAO-MO-SCF Eigenvalues 351 Kˆj leads ultimately to the production of exchange integrals, and so it is called an exchange operator. It is written explicitly in conjunction with a function on which it is operating, viz. Kˆjfi(1)= f*j (2) (1/r12)fi(2)dt (2)fj (1) (11-9) Notice that an index exchange has been performed. It is not difficult to see that the expression (see Appendix 9 for bra-ket notation) fi |Fˆ|fi =i (11-10) will lead to integrals such as fi |Jˆj |fi  = fi (1)fj (2) |1/r12|fi (1)fj (2)=Jij (11-11) fi |Kˆj |fi  = fi (1)fj (2) |1/r12|fi (2)fj (1)=Kij (11-12) which are formally the same as the coulomb and exchange terms encountered in Chapter 5 in connection with the helium atom. Notice that, if the spins associated with spin-orbitals fi and fj differ, Kij must vanish. This arises because integrations over space and spin coordinates of electron 1 (or 2) in Eq. (11-12) lead to integration over two different (and orthogonal) spin functions. On the other hand, Jij is not affected by such spin agreement or disagreement. It would appear from Eq. (11-6) that the MOs f are eigenfunctions of the Fock operator and that the Fock operator is, in effect, the hamiltonian operator. There is an important qualitative difference between Fˆ andHˆ , however. The Fock operator is itself a function of the MOs f. Since the summation index j in Eq. (11-7) includes i, the operators Jˆi and Kˆi must be known in order to write down Fˆ, but Jˆi and Kˆi involve fi , and fi is an eigenfunction of Fˆ. Hence, we need Fˆ to find fi , and we need fi to know Fˆ. To circumvent this problem, an iterative approach is used. One makes an initial guess at the MOs f. (One could use a semiempirical method to produce this starting set.) Then these MOs are used to construct an operator Fˆ, which is used to solve for the new MOs f. These are then used to construct a new Fock operator, which is in turn used to find new MOs, which are used for a new Fˆ, etc., until at last no significant change is detected in two successive steps of this procedure. At this point, the f’s produced by Fˆ are the same as the f’s that produce the coulomb-and-exchange fields in Fˆ. The solutions are said to be self-consistent, and the method is referred to as the self-consistent-field (SCF) method. 11-6 Interpretation of the LCAO-MO-SCF Eigenvalues The physical meaning of an eigenvalue i is best understood by expanding the integral i =fi |Fˆ|fi  (11-13) with Fˆ given by Eq. (11-7). We obtain i =fi  - 1 2.2 1  fi- µ fi Zu/rµ1 fi+ n  j=1 2Jij -Kij  (11-14) 352 Chapter 11 The SCF-LCAO-MO Method and Extensions It is common practice to combine the first two terms of Eq. (11-14), which depend only on the nature of fi , into a single expectation value of the one-electron part of the hamiltonian, symbolized Hii . Thus, i =Hii + n  j=1 (2Jij -Kij ) (11-15) The quantity Hii is the average kinetic plus nuclear-electronic attraction energy for the electron in fi . The sum of coulomb and exchange integrals in Eq. (11-15) contains all the electronic interaction energy. Observe that the index j runs over all the occupied MOs. For a particular value of j , say j =k =i, this gives 2Jij -Kij as an interaction energy. This means that an electron in fi , experiences an interaction energy with the two electrons in fk of 2 fi(1)fk(2)|1/r12|fi(1)fk(2)-fi(1)fk(2) |1/r12|fk(1)fi(2) (11-16) The first part is the classical repulsion between the electron having an orbital charge cloud given by |fi |2 and the two electrons having charge cloud |fk|2. The second part is the exchange term which, as we saw in Chapter 5, arises from the antisymmetric nature of the wavefunction. It enters (11-16) only once because the electron in fi , agrees in spin with only one of the two electrons in fk, [Equation (11-15) applies because we have restricted our discussion to closed-shell systems.] The summation over 2Jij -Kij includes the case j = i. Here we get 2Jii -Kii . However, examination of Eqs. (11-11) and (11-12) shows that Jii =Kii , and so we are left with Jii . This corresponds to the repulsion between the electron in fi (the energy of which we are calculating) and the other electron in fi . Because these electrons must occur with opposite spin, there is no exchange energy for this interaction. In brief, then, the quantity i , often referred to as an orbital energy or a one-electron energy, is to be interpreted as the energy of an electron in fi , resulting from its kinetic energy, its energy of attraction for the nuclei, and its repulsion and exchange energies due to all the other electrons in their charge clouds |fj |2. 11-7 The SCF Total Electronic Energy It is natural to suppose that the total electronic energy is merely the sum of the oneelectron energies, but this is not the case in SCF theory. Consider a two-electron system. The energy of electron 1 includes its kinetic and nuclear attraction energies and its repulsion and exchange energies for electron 2. The energy of electron 2 includes its kinetic and nuclear attraction energies and its repulsion and exchange energies for electron 1. If we sum these, we have accounted properly for kinetic and nuclear attraction energies, but we have included the interelectronic interactions twice as much as they actually occur. (The energy of repulsion, say, between two charged particles, 1 and 2, is given by the repulsion of 1 for 2 or of 2 for 1, but not by the sum of these.) Therefore, if we sum one-electron energies, we get the total electronic energy plus an extra measure Section 11-8 Basis Sets 353 of electron repulsion and exchange energy. We can correct this by subtracting this extra measure away. Thus, for our closed-shell system Eelec = n  i=1 . . 2i - n  j=1 (2Jij -Kij ) . . (11-17) where the summation is over the occupied orbitals. Comparing Eq. (11-17) with (11-18) makes it evident that we can also write Eelec = n  i=1 . . 2Hii + n  j=1 (2Jij -Kij ) . . (11-18) or Eelec = n  i=1 (i +Hii) (11-19) To obtain the total (electronic plus nuclear) energy, we add the internuclear repulsion energy for the N nuclei: Etot = Eelec +Vnn (11-20) Vnn = N-1  µ=1 N  .=µ+1 ZµZ. rµ. (11-21) 11-8 Basis Sets A great deal of research effort has gone into devising and comparing basis sets for ab initio calculations. There are essentially two important criteria: 1. Wewant a basis set that is capable of describing the actualwavefunction well enough to give chemically useful results. 2. Wewant a basis set that leads to integrals Fij and Sij that we can evaluate reasonably accurately and quickly on a computer. Many types of basis set have been examined and two of these have come to dominate the area of ab initio molecular calculations. These two, which we refer to as the gaussian and the Slater-type-orbital (STO) basis sets, are actually very similar in many important respects. Let us consider the STO basis set first. The essence of this basis choice is to place on each nucleus one or more STOs. The number of STOs on a nucleus and the orbital exponent of each STO remain to be chosen. Generally, the larger the number of STOs and/or the greater the care taken in selecting orbital exponents, the more accurate the final wavefunction and energy will be. At the least sophisticated end of the spectrum of choices is the minimal basis set of STOs, which we encountered in Chapter 7. This includes only those STOs that correspond to occupied AOs in the separated atom limit. If we choose a minimal basis set, 354 Chapter 11 The SCF-LCAO-MO Method and Extensions then we must still decide how to evaluate the orbital exponents in the STOs. One way is to use Slater’s rules, which are actually most appropriate for isolated atoms. Another way is to vary the orbital exponents until the energy of the molecular system is minimized. This amounts to performing a nonlinear variational calculation along with the linear variational calculation. For molecules of more than a few atoms, this procedure consumes much computer time, for reasons we will describe shortly, but for small molecules (two or three first-row atoms plus a few hydrogens) it is possible to accomplish this task. From this, one discovers what orbital exponent best suits an STO in a molecular environment. This leads us to the third way of choosing orbital exponents— choose the values that were found best for each type of atom in nonlinear variational calculations in smaller molecules. One may improve the basis by adding additional STOs to various nuclei. Suppose, for example, each carbon 2pAO were represented as a linear combination of two p-type STOs, each having a different orbital exponent. An example of the basic principle involved is indicated in Fig. 11-1. If we treat these functions independently and do a linear variational calculation, they will both be mixed into the final wavefunction to some degree. If the linear coefficient for the “inner” STO is much larger, it means that the p-type charge cloud around this atom in the molecule is calculated to be fairly contracted around the nucleus. To describe a more diffuse charge cloud, the wavefunction would contain quite a lot of the “outer” STO, and not so much of the “inner” STO. Thus, we have a linear variation procedure that, in effect, allows for AO expansion and contraction. It is akin to optimizing an orbital exponent, but it does not require nonlinear variation. Of course, one still has to choose the values of the “larger . ” and “smaller . ” of Fig. 11-1. This is normally done by optimizing the fit to very accurate atomic wavefunctions or by a nonlinear variation on atoms. A basis set in which every minimal basis AO is represented by an “inner-outer” pair of STOs is often referred to as a “double-zeta” basis set. Afurther kind of extension is frequently made. In addition to the above types of STO, one includes STOs with symmetries different from those present in the minimal basis. This has the effect of allowing charge to be shifted in or out of bond regions in new ways. For example, one could add p-type STOs on hydrogen nuclei. By mixing this with the s-type STOs there, one can describe a skewed charge distribution in the regions of the protons. We have already seen (Chapter 7) that a hydrogen atom in a uniform Figure 11-1  Radial functions R(r)=r exp(-.r) for 2p-type STOs. The larger . value gives an STO more contracted around the nucleus. Hence, it is sometimes called the “inner” STO. Section 11-8 Basis Sets 355 electric field is polarized in a way that is reasonably well described by an s-p linear combination. Since the hydrogen atom in a molecule experiences an electric field due to the remainder of the molecule, it is not surprising that such p functions are indeed mixed into the wavefunction by the variational procedure if we provide them in the basis set. Similarly, d-type STOs may be added to atoms that, in the minimal basis set, carried only s- and p-type STOs. Functions of this nature are often called polarization functions because they allow charge polarization to occur within the molecule as a result of the internally generated electric field. It should be evident that one could go on indefinitely, adding more and more STOs to the basis, even placing some of them in bonds, rather than on nuclei. This is not normally done because the computing task goes up enormously as we add more basis functions. In fact, the number of integrals to be calculated eventually increases as N4, where N is the number of basis functions. The evaluation of integrals can be a logistic bottleneck in ab initio calculations, and for this reason nonlinear variations (of orbital exponents) are impractical for any but smaller molecules. Each new orbital exponent value requires re-evaluation of all the integrals involving that orbital. In essence, a change of orbital exponent is a change of basis set. In linear variations, the basis functions are mixed together but they do not change. Once all the integrals between various basis functions have been evaluated, they are usable for the remainder of the calculation. The STO basis would probably be the standard choice if it were not for the fact that the many integrals encountered in calculating Fij elements are extremely time consuming to evaluate, even on a computer. This problem has led to the development of an alternative basis set class that is based on gaussian-type functions. Gaussian functions include an exponential term of the form exp(-ar2). The radial dependence of such a function is compared to that for a hydrogenlike 1s function (which is identical to a 1s STO) in Fig. 11-2. There are two obvious problems connected with using gaussian functions as basis functions: 1. They do not have cusps at r =0 as s-type hydrogen-like AOs do. 2. They decay faster at larger r than do hydrogen-like AOs. Both of these deficiencies are relevant in molecules because, at r =0 (on a nucleus) and at r=8, the molecular potential is like that in an atom, so similar cusp and asymptotic behavior are expected for molecular and atomic wavefunctions. Balanced against these deficiencies is an advantage: gaussian functions have mathematical properties that make it extremely easy to compute the integrals they produce in Fij . This has led to a practice of replacing each STO in a basis set by a number of gaussian functions. Figure 11-2  Radial dependence of hydrogen-like and gaussian functions. 356 Chapter 11 The SCF-LCAO-MO Method and Extensions By choosing several values of a in exp(-ar2), one can create a set of “primitive” gaussian functions ranging from very compact to very diffuse, and then take a linear combination of these to build up an approximation to the radial part of an STO function. Multiplication by the standard . and f dependences (spherical harmonics) generates p, d, etc. functions. Once this approximation is optimized, the linear combination of gaussian functions is “frozen,” being treated thereafter as a single function insofar as the subsequent molecular variational calculation is concerned. This linear combination of primitive gaussian functions is called a contracted gaussian function. Once we have a contracted gaussian function corresponding to each STO, we can go through the same hierarchy of approximations as before—minimal basis set, double- . basis set, double-. plus polarization functions-only now using contracted gaussian functions in place of STOs. Typically, ab initio calculations on systems involving only light elements, e.g., H–Ne, involve anywhere from 1 to 15 primitive gaussian functions for each contracted gaussian function. Basis sets for heavier elements, however, can contain more than 30 primitive gaussians for each contracted gaussian. We have already described a certain amount of quantum-chemical jargon. Some of the basis set descriptions that one commonly encounters in the modern literature are as follows: • DZP [double-. gaussian basis with polarization] • STO-3G [each STO approximated as a linear combination of three gaussian primitives] • 6-31G [each inner shell STOrepresented by a sum of six gaussians and each valence shell STO split into inner and outer parts (i.e., double-. ) described by three and one gaussian primitives, respectively] • 6-31G* [the 6-31G basis set augmented with six d-type gaussian primitives on each heavy (Z >2) atom, to permit polarization] • 6-31G** [same as 6-31G* but with a set of gaussian p-type functions on H and He atoms. Good for systems where hydrogen is a bridging atom, as in diborane or in hydrogen bonds] • 6-31+G* [the 6-31G* basis set augmented with a set of diffuse s- and p-type gaussian functions on each heavy atom, to permit representation of diffuse electronic distribution, as in anions] • cc-pVnZ, n=D,T, Q, 5 [correlation consistent polarized valence n-. gaussian basis sets. The inner shell STOs are described by single contracted gaussian functions while the valence STOs are described by n contracted gaussian functions, n=D for double-. , n = T for triple-. , etc. Both the number and angular momentum symmetry type of the polarization functions are increased with each successive correlation consistent basis set in a systematic manner. For example, the cc-pVDZ basis set has a set of 5 d-type gaussian primitive functions on each heavy atom, while the cc-pVTZ basis set has 2 sets of d-type gaussian functions and one set of 7 f-type gaussian primitives. These families of basis sets are designed to converge the total energy to the complete basis set limit for the SCF method and its extensions.] Section 11-10 Correlation Energy 357 • aug-cc-pVnZ [cc-pVnZ basis sets augmented with one set of diffuse primitive gaussian functions for each angular momentum symmetry present in the cc-pVnZ basis set, to provide an accurate description of anions and weak interactions, e.g., van derWaals forces and hydrogen bonding.] 11-9 The Hartree–Fock Limit It should be apparent that different choices of basis set will produce different SCF wavefunctions and energies. Suppose that we do an SCF calculation on some molecule, using a minimal basis set and obtain a total electronic energy E1. If we now choose a double-. basis and do a new SCF calculation, we will obtain an energy E2 that normally will be lower than E1. (If one happens to choose the first basis wisely and the second unwisely, it is possible to find E2 higher than E1. We assume here that each improvement to the basis extends the mathematical flexibility while including the capabilities of all preceding bases.) If we now add polarization functions and repeat the SCF procedure, we will find E3 to be lower than E2. We can continue in this way, adding new functions in bonds and elsewhere, always increasing the capabilities of our basis set, but always requiring that the basis describe MOs in a single determinantal wavefunction. The electronic energy will decrease with each basis set improvement, but eventually this decrease will become very slight for any improvement; that is, the energy will approach a limiting value as the basis set approaches mathematical completeness. This limiting energy value is the lowest that can be achieved for a single determinantal wavefunction. It is called the Hartree–Fock energy. The MOs that correspond to this limit are called Hartree–Fock orbitals (HF orbitals), and the determinant is called the HF wavefunction. Sometimes the term restricted Hartree–Fock (RHF) is used to emphasize that the wavefunction is restricted to be a single determinantal function for a configuration wherein electrons of a spin occupy the same space orbitals as do the electrons of ß spin. When this restriction is relaxed, and different orbitals are allowed for electrons with different spins, we have an unrestricted Hartree–Fock (UHF) calculation. This refinement is most likely to be important when the numbers of a- and ß-spin electrons differ. We encountered this concept in Section 8-13, where we noted that the unpaired electron in a radical causes spin polarization of other electrons, possibly leading to negative spin density. 11-10 Correlation Energy The Hartree–Fock energy is not as low as the true energy of the system. The mathematical reason for this is that our requirement that . be a single determinant is restrictive and we can introduce additional mathematical flexibility by allowing. to contain many determinants. Such additional flexibility leads to further energy lowering. There is a corresponding physical reason for the HF energy being too high. It is connected with the independence of the electrons in a single determinantal wavefunction. To understand this, consider the four-electron wavefunction . = f1(1)f1(2)f2(3)f2(4) (11-22) 358 Chapter 11 The SCF-LCAO-MO Method and Extensions Recall from Chapter 5 that the numbers in parentheses stand for the spatial coordinates of an electron; that is, f1(1) really means f1(x1, y1, z1)a(1) or f1(r1, .1,f1)a(1).1 In other words, if we pick values of r, ., and f for each of the four electrons and insert them into Eq. (11-22) we will be able to evaluate each function and we will obtain a determinant of numbers which can be evaluated to give a numerical value for . and .2. The latter number (times d.) can be taken as the probability for finding one electron in the volume element around r1, .1, and f1, another electron simultaneously in d.2 at r2, .2, and f2, etc. The important point to notice is that the effect on .2 of a particular choice of r1, .1, and f1, is not dependent on choices of r, ., f for other electrons because the form of the wavefunction is products of functions of independent coordinates. Physically, this corresponds to saying that the probability for finding an electron in d.1, at some instant is not influenced by the presence or absence of another electron in some nearby element d.2, at the same instant. This is consistent with the fact that the Fock operator Fˆ [Eq. (11-7)] treats each electron as though it were moving in the time-averaged potential field due to the other electrons. Because electrons repel each other, there is a tendency for them to keep out of each other’s way. That is, in reality, their motions are correlated. The HF energy is higher than the true energy because the HF wavefunction is formally incapable of describing correlated motion. The energy difference between the HF and the “exact” (for a simplified nonrelativistic hamiltonian) energy for a system is referred to as the correlation energy. 11-11 Koopmans’ Theorem Despite the fact that the total electronic energy is not given by the sum of SCF oneelectron energies, it is still possible to relate the i ’s to physical measurements. If certain assumptions are made, it is possible to equate orbital energies with molecular ionization energies or electron affinities. This identification is related to a theorem due to Koopmans. Koopmans [1] proved2 that the wavefunction obtained by removing one electron from fk, or adding one electron to the virtual (i.e., unoccupied) MO fj in a Hartree– Fock wavefunction is stable with respect to any subsequent variation in fk, or fj . Notice that this ignores the question of subsequent variation of all of the MOs f with unchanged occupations. It is not necessarily true that they remain optimized, since the potential they experience is changed by addition or removal of an electron. Nevertheless, Koopmans’ theorem suggests a model. It suggests that we approximate the wavefunction for a positive ion by removing an electron from one of the occupied HF MOs for a neutral molecule without reoptimizing any of the MOs. Let us do this and compare the electronic energies for the two wavefunctions. For the neutral molecule, which we assume is a closed-shell system, E = i . . 2Hii + j (2Jij -Kij ) . . (11-23) 1Note that f1 in parentheses represents a coordinate of electron 1, whereas f1 outside the parentheses represents an MO. 2See also Smith and Day [2]. Section 11-11 Koopmans’ Theorem 359 For the cation, produced by removing an electron from fk, E+k = i=k . . 2Hii + j =k (2Jij -Kij ) . .+Hkk + i=k (2Jik -Kik) (11-24) The first sum in Eq. (11-24) gives the total electronic energy due to all but the unpaired electron in fk. Hkk gives the kinetic and nuclear attraction energies for the unpaired electron and the final sum gives the repulsion and exchange energy between this electron and all the others. Nowwe note that the last sum is exactly equal to the void produced in the first sum due to the restriction j =k. Therefore, we can combine these by removing the index restriction and deleting the last sum. This gives E+k = i=k . . 2Hii + j (2Jij -Kij ) . .+Hkk (11-25) To compare this with E of (11-23) we should remove the remaining index restriction. We do this by allowing i to equal k in the sum and simultaneously subtracting the new terms thus produced: E+k = i . . 2Hii + j (2Jij -Kij ) . .-Hkk - j (2Jkj -Kkj ) (11-26) But, by virtue of Eqs. (11-15) and (11-23), this is E+k =E -k (11-27) Hence, the ionization energy I 0 k , for ionization from the fk is I 0 k =E+k -E=-k (11-28) This illustrates that, within the context of this simplified model, the negative of the orbital energies for occupied HF MOs are to be interpreted as ionization energies. Another way to see the relation between I 0 k and-k, is to recognize that the physical interactions lost upon removal of an electron from .k, are precisely those that constitute k, [See Eq. (11-15).] A similar result holds for orbital energies of unoccupied HF MOs and electron affinities. (However, this is less successful in practice; see Problem 11-3.) In actuality, the relation (11-28) is only approximately obeyed. One reason for this has to do with our assumption that doubly occupied SCF MOs produced by a variational procedure on the neutral molecule will be suitable for the doubly occupied MOs of the cation as well. These MOs minimize the energy of the neutral molecule but give an energy for the cation that is higher than whatwould be produced by an independent variational calculation. For this mathematical reason, we expect the Koopmans’ theorem prediction for the ionization energy to be higher than the value predicted by taking the difference between separate SCF calculations on the molecule and cation (which we will symbolize SCF). The corresponding physical argument is that use of Eq. (11-28) views ionization as removal of an electron without any reorganization of the remaining 360 Chapter 11 The SCF-LCAO-MO Method and Extensions TABLE 11-1  Ionization Energies (in electron volts) of Water as Measured Experimentally and as Predicted from SCF Calculations SCF (near HF limit)b Cation state Observeda Koopmans SCF 2B2 12.62 13.79 11.08 2A1 14.74 15.86 13.34 2B2 18.51 19.47 17.61 aFrom Potts and Price [3]. bFrom Dunning et al. [4]. electronic charge. This neglects a process that stabilizes the cation and lowers the ionization energy. Whichever argument we choose, we have here a reason for expecting - to be an overestimate of the value obtained by independent calculations, SCF. Another error results from the neglect of change in correlation energy. We have seen that the total SCF energy for the molecule is too high because the single determinantal form of thewavefunction cannot allowfor correlated electronic motion. The SCF energy for the cation is too high for the same reason, but the error is different for the two cases because there are fewer electrons in the cation. We expect the neutral molecule to have the greater correlation energy (since it has more electrons)3 so that proper inclusion of this feature would lower the energy of the neutral molecule more than the cation, making the true I 0 k larger than that obtained by neglect of correlation. Hence, this leads us expect SCF to underestimate I 0 k . Since - overestimates SCF, and SCF underestimates the ionization energy, we can expect some cancellation of errors in using Eq. (11-28). An illustration of these relations is provided in Table 11-1, where observed vertical ionization energies (i.e., no nuclear relaxation), the appropriate values of -, and the values of SCF are compared. 11-12 Configuration Interaction There are several techniques for going beyond the SCF method and thereby including some effects of electron correlation. Some extremely accurate calculations on small atoms and molecules, making explicit use of interparticle coordinates, were described in Section 7-8. There is one general technique, however, that has traditionally been used for including effects of correlation in many-electron systems. This technique is called configuration interaction (CI). The mathematical idea of CI is quite obvious. Recall that we restricted our SCF wavefunction to be a single determinant for a closed-shell system. To go beyond the optimum (restricted Hartree–Fock) level, then, we allow the wavefunction to be a linear 3This reasoning is rather naive. Significant correlation energy contribution can result from a small energy-level separation between filled and empty MOs (rather than from merely the number of electrons), but production of a cation should normally increase this gap and lead to reduced correlation. Section 11-12 Configuration Interaction 361 combination of determinants. Suppose we choose two determinants D1 and D2, each corresponding to a different orbital occupation scheme (i.e., different configurations). Then we can let . =c1D1 +c2D2 (11-29) and minimize E as a function of the linear mixing coefficients c1 and c2. If we go through the mathematical formalism and express E¯ as .|Hˆ |./.|., expand this as integrals over D1 and D2, and require .E¯/.ci =0, we obtain the same sort of 2×2 determinantal equation that we find when minimizing an MO energy as a function of mixing of two AOs. That is, we obtain  H11 -E¯S11 H12 -E¯S12 H21 -E¯S21 H22 -E¯S22  =0 (11-30) where now Hij = Di  ˆ H Dj (11-31) Sij = Di |Dj  (11-32) We see that, whereas before we might have had two AOs interacting to form two MOs, here we have two configurations (i.e., two determinantal functions) interacting to form two approximate wavefunctions. Our example involves only two configurations, but there is no limit to the number of configurations that can be mixed in this way. Since each configuration D contains products of MOs, each of which is typically a sum ofAOs, the integrals Hij and Sij can result in very large numbers of integrals over basis functions when they are expanded. This is the sort of situation where a computer is essential, and CI on atoms and molecules, while still expensive compared to SCF, have become routine on modern computers. Our purpose in this chapter is not to describe how to carry out a CI calculation, but rather to convey what a CI calculation is and what its predictive capabilities are. Therefore, we will not concern ourselves with the mathematical complexities of evaluating Hij and Sij .4 But we will consider one practical aspect of CI calculations, namely, how one goes about choosing which configurations should be mixed together, and which ones may be safely ignored. We begin by considering the H2 molecule. The LCAO-MO-SCF method expresses the ground state wavefunction for H2 as .(1, 2)=  1sg(1)a(1) 1sg(2)a(2) 1sg(1)ß(1) 1sg(2)ß(2)  (11-33) that is, as the configuration 1s2 g . The SCF procedure mixes the AO basis functions together in the optimum way to produce the 1sg MO. We have noted at several points in this book that, if one begins with a basis set of n linearly independent functions, one ultimately arrives at n independent MOs. Hence, the 1sg MO of Eq. (11-33) is but one of several MOs produced by the SCF procedure. 4In most actual calculations, the D’s are orthonormal, and Sij = dij . 362 Chapter 11 The SCF-LCAO-MO Method and Extensions It is called an occupied MO because it is occupied with electrons in this configuration. All the other MOs in this case are unoccupied or virtual MOs. The virtual MOs of H2 have symmetry properties related to the molecular hamiltonian, just as does the occupied MO. Thus, we can refer to 1su, 2sg, 2su, 1pu, 1pg, etc., virtual MOs of H2. Which of these virtual MOs are produced by an SCF calculation depends on the number and nature of the AO basis set provided at the outset. If no p-type AOs are provided, no p-type MOs will be produced. If only a minimal basis (1sa and 1sb) is provided, 1su will be the only virtual MO produced. It is important to distinguish between the physical content of occupied versus virtual SCF MOs. The SCF procedure finds the set of occupied MOs for a system leading to the lowest SCF electronic energy. The virtual orbitals are the residue of this process. The virtual MOs span that part of the basis set function space that the SCF procedure found least suitable for describing.. The subspace is sometimes referred to as the orthogonal complement of the occupied orbital subspace. (Note that this situation differs from that pertaining to H¨uckel-type calculations, where MOs and energy levels are calculated without regard for electron occupancy. Only after the variational procedure are electrons added.) Our concern with virtual MOs is due to the fact that they provide a ready means for constructing new configurations to mix with our 1s2 g configuration for H2. Thus, using some of the above-mentioned virtual MOs, we could write determinantal functions corresponding to the excited configurations 1sg1su, 1sg2sg, 1sg2su, 1sg1pu, etc.5 These are commonly referred to as singly excited configurations because one electron has been promoted from a ground-state-occupied MO to a virtual MO. (This is not meant to imply that the orbital energy difference is equal to the expected spectroscopic energy of the transition.) It is also possible to construct doubly excited configurations, such as 1s2 u , 1su2sg, 2s2 g , 1su2su, 1su1pu, etc. For systems having more electrons, one can write determinants corresponding to triple, quadruple, etc., excitations. If one has a reasonably large number, say 50, of virtual orbitals and, say, 10 electrons to distribute among them, then there is an enormous number of possible configurations. A major step in doing a CI calculation is deciding which configurations might be important in affecting the results and ought therefore to be included. We can gain insight into this problem by considering our minimal basis set H2 problem in more detail. We have 1sg = Ng(1sA +1sB) (11-34) 1su = Nu(1sA -1sB) (11-35) where Ng and Nu are normalization constants. The spatial part of the ground configuration is .space =1sg(1)1sg(2) (11-36) 5As was shown in Chapter 5, the symmetry requirements of the wavefunction require that each of these open shell configurations be expressed as a linear combination of two 2×2 determinants; for example, 1sg2su stands for the combination 1/v2  1sg(1)2 ¯su(2) ± 1 ¯sg(1)2su(2)  Section 11-12 Configuration Interaction 363 which expands to .space =N2 g [1sA (1) 1sA (2)+1sB (1) 1sB (2)+1sA (1) 1sB (2)+1sB (1) 1sA (2)] If both electrons are near nucleus A, the first term is quite large. This may be rephrased to say that .2 gives a sizable probability for finding both electrons near nucleusA. The second term gives a similar likelihood for finding both electrons near B. These two terms are referred to as ionic terms because they become large whenever the instantaneous electronic dispositions correspond to H-AH+B and H+AH-B , respectively. The last two terms cause .2 to be sizable whenever an electron is near each nucleus. Hence, these are called covalent terms, and their presence means that . contains significant “covalent character.” In fact, because all four terms have the same coefficient, the configuration 1s2 g is said to have 50% covalent and 50% ionic character. Is this bad? It turns out to be no problem at all when the nuclei are close together. Indeed, in the united-atom (helium) limit, the ionic-covalent distinction vanishes. But at large internuclear separations it is very inaccurate to describe H2 as 50% ionic. In reality, H2 dissociates to two neutral ground state H atoms—that is, 100% “covalent,” with an electron near each nucleus. In short, the SCF-MO description does not properly describe the molecule as it dissociates. This means that the calculation of E¯ versus RAB for H2 will deviate from experiment more and more as RAB increases. This defect in the SCF treatment of H2 occurs for many other molecular species also. Can we correct this defect through use of CI?We ask the question this way: “What configuration could we mix with 1s2 g in order to make the mixture of covalent and ionic character variable?” Since 1s2 g expands to give us covalent and ionic terms of the same sign, we need an additional configuration that will give them with opposite sign. Then admixture of the two configurations will affect the two kinds of term differently. The configuration that will accomplish this is 1s2 u : 1su (1) 1su (2) = N2 u [1sA (1) 1sA (2)+1sB (1) 1sB (2)-1sA (1) 1sB (2) -1sB (1) 1sA (2)] (11-37) Mixing these two configurations together gives . (c1/c2) = c11sg (1) 1sg (2)+c21su (1) 1su (2) = c1N2 g +c2N2 u  [1sA (1) 1sA (2)+1sB (1) 1sB (2)] + c1N2 g -c2N2 u  [1sA (1) 1sB (2)+1sB (1) 1sA (2)] (11-38) If c1/c2 is readjusted at each value of RAB to minimize E¯, it is evident that the relative weights of covalent and ionic character in Eq. (11-38) will change to suit the circumstances. Actual calculations on this system show that, as RAB gets large, c1/c2 approaches a value such that c1N2 g + c2N2 u approaches zero, so that the ionic component of . vanishes. This example illustrates that CI of this sort has an associated physical picture. It suggests that, in any CI calculation involving the dissociation (or extensive stretching) of a covalent bond, important configurations are likely to include double excitations into the antibonding virtual “mates” of occupied bonding MOs. 364 Chapter 11 The SCF-LCAO-MO Method and Extensions What about other configurations for H2? What will 1sg2sg do for the calculation, assuming now an extended basis set has produced a 2sg MO? Suppose we take as our trial function . =c11s2 g +c21sg2sg (11-39) where the configurations are understood to stand for determinants. If the 1sg MO has been produced by an SCF calculation on the ground state, and 2sg is a virtual MO from that SCF calculation, then it is possible to show that the CI energy minimum occurs when c2 in Eq. (11-39) is zero. In other words, these determinants will not mix when they are combined in this way. An equivalent statement is that the mixing element H12 = 1s2 g |Hˆ |1sg2sg vanishes. Hence, the CI determinant (11-33) is already in diagonal form, and no variational mixing will occur. This is an example of Brillouin’s theorem, which may be stated as follows: EXAMPLE 11-1 If D1 is an optimized single determinantal function and Dj is a determinant corresponding to any single excitation out of an orbital fj occupied in D1 and into the virtual subspace (orthogonal complement) ofD1, then no improvement in energy is possible by taking . =c1D1 +c2Dj . The proof of Brillouin’s theorem is very simple. We start with a basis set that spans a function space. An SCF calculation is performed, which produces the best single-determinantal wavefunction we can possibly get within this function space. This isD1. Dj differs fromD1 in only one orbital, which means they differ in only one row. A general property of determinants is that, if two of them differ in only one row or column, any linear combination of the two can be written as a single determinant (see Problem 11-4). This means that any combination c1D1 +c2Dj is still expressible as a single determinant. Since Dj makes no use of functions outside our original basis set, c1D1 + c2Dj is a single determinant within our original function space. However,D1 is already known to be the single determinant within this function space that gives the lowest energy, and c1D1 +c2Dj cannot do better. QED. A doubly excited configuration differs from D1 in two rows, and mixing such a configuration with D1 produces a result that cannot be expressed as a single determinant. Because of Brillouin’s theorem, one might decide to omit all single excitations from CI calculations. But it is important to recognize that singly excited configurations can affect the results of CI calculations in the presence of doubly excited configurations. This comes about because nonzero mixing elements can occur between singly and doubly excited configurations in the CI determinant. To illustrate, let .0 be an SCF single determinant, .1 be a singly excited configuration, and .2 be a “double.” Then the CI determinant could be, assuming orthogonal determinants,  H00 -E 0 H02 0 H11 -E H12 H02 H12 H22 -E  =0 (11-40) Section 11-13 Size Consistency 365 The zeros result from Brillouin’s theorem. However, H12 does not necessarily vanish, and solution of this 3×3 determinantal equation leads to a wavefunction of the form . =c0.0 +c1.1 +c2.2 (11-41) with c1 not zero. .1 comes in on the coattails of .2 and is referred to as a secondorder correction. This is not a guarantee that it will be unimportant, however. (See Example 7-4 for similar behavior in a different context.) Another rule that is useful for recognizing configurations that may be omitted is the rather obvious one that each configuration must share the same set of eigenvalues for operators commuting with the hamiltonian. That is, if . is to be associated with a particular symmetry, angular momentum, spin angular momentum, etc., then each configuration in . must have that same symmetry, angular momentum, etc. This means that, for the ground state of H2, 1s2 g will not mix with 1sg1su because the latter has overall u symmetry. 1su2su could contribute, but the symmetrized combination corresponding to the singlet state (|1su2 ¯su|-|1 ¯su2su|) must be used rather than the (positive) triplet state combination. The configuration 1su1pu will not contribute because it has the wrong total angular momentum. Even with the aid of all these rules, a calculation on a molecule such as N2 or O2 using a reasonably extended basis set gives rise to an enormous number of possible configurations. Additional rules of thumb have been found to help choose the major configurations. It has been found, for example, that triply or higher excited configurations are usually of lesser importance than doubly excited configurations. [Since the hamiltonian contains only one- and two-electron operators, interaction elements must vanish between the ground-state configuration and all triply or higher-excited configurations. But, like singly excited configurations, these can, in principle, come in on the coattails of doubly (or other) excited configurations.] In addition, a study of the energy change in some process involving primarily the valence electrons (e.g., stretching N2) really does not require calculation of the correlation energy of the 1s electrons since they are fairly unaffected by the change. Any correlation energy for these electrons tends to cancel itself when initial and final state energies are subtracted. Therefore, in a CI calculation of such a process, it is reasonable to omit configurations corresponding to excitation of a 1s electron unless high accuracy is desired. The acronym CID refers to a CI calculation in which only all doubly excited configurations are included. Inclusion of all singly and doubly excited configurations is referred to as a CISD calculation. Full CI (FCI) means all excited configurations have been included, and this is the limit that gives all of the correlation energy within the chosen basis set. The combination of full CI and a complete basis gives the exact energy (generally nonrelativistic and within the Born–Oppenheimer approximation). 11-13 Size Consistency and the Møller–Plesset and Coupled Cluster Treatments of Correlation Whenever certain parts of a well-defined procedure are omitted, as when full CI is truncated to CID or CISD, one must consider whether systematic errors are introduced. This is indeed the case in the above example. Suppose CID calculations are made for the energy of N2 as a function of internuclear distance. At short distances, we treat the 366 Chapter 11 The SCF-LCAO-MO Method and Extensions system as a 14-electron molecule, including configurations in which 12 of the electrons are in their HF-occupied MOs. At very large distances we have two nitrogen atoms, which we normally treat as having twice the energy of one atom. Now CID on atomA includes the HF configuration, DA 0 , as well as doubly excited configurations in which five of the seven electrons are in their HF-occupied AOs. Let DA 2 represent this class of configuration. Then .A = c0DA 0 + c2DA 2 . Atom B has a similar CID wavefunction: .B =c0DB 0 +c2DB 2 . The wavefunction for the overall, noninteracting system is the antisymmetrized product of these wavefunctions. It will contain terms like DA 0 DB 0 , DA 0 DB 2 , and DA 2 DB 2 . There are no terms present corresponding to a single excitation at each atom, DA 1 DB 1 , and such terms would be present in a CID treatment of the combined system. Also, there is a class of terms present corresponding to four promoted electrons,DA 2 DB 2 , and these termswould not be present in a CID treatment of the combined system. The dilemma is that, if we treat the system as a single 14-electron “molecule,” which is appropriate at small R, we mix in different terms than if we treat it as two separate atoms, which is appropriate at large R. If we choose some fairly large R value to redefineN2 as two separate atoms, we change the nature of the CI in a discontinuous way at an arbitrary point. This feature of truncated CI is called the problem of size consistency; CIDandCISDmethods are not size consistent. Doing separate calculations on each of two separated atoms and combining the energies yields a different result from doing a calculation on one system made up of two separated atoms. A correlation method that is size consistent has been developed by Pople and co-workers.6 It is based on perturbation theory that was introduced many years ago by Møller and Plesset.7 This approach divides the process of treating correlation into a series of corrections to an unperturbed starting point. If one chooses to do such a calculation to, say, third order (MP3, standing for Møller–Plesset to third order), then the set of configurations to be included is determined by the perturbation formulas.8 It does not require further decision by the person doing the calculation and can be wholly managed by a computer program. Møller–Plesset perturbation theory is different from standard CI in at least two important respects: It is size consistent, and it is not variationally bound. One cannot assume, therefore, that going to higher and higher orders of perturbation will cause the calculated energy to approach closer and closer to the true energy from above. Because of the way MP theory defines the unperturbed system, the starting point energy (MP0) is the sum of HF one-electron energies. The first-order correction to the energy (MP1) brings in the appropriate electronic coulomb and exchange integrals, giving the correct HF energy. MP2 brings in contributions wherein doubly excited configurations “interact with” (i.e., occur in the same integral with) the ground configuration. MP3 adds contributions due to doubly excited configurations interacting with each other. MP4 brings in interactions involving also single, double, triple, and quadruple excitations. The selection of interaction terms by the perturbation formalism is what produces size consistency, but it leaves out certain terms at each level that would be included in a variational calculation. In coupled cluster (CC) approaches, which are also size consistent and generally not variationally bound, instead of including all configurations to a particular order as 6See Binkley and Pople [5]. 7See Møller and Plesset [6]. 8Perturbation theory is presented in Chapter 12. The present discussion avoids mathematical details. Section 11-14 Multideterminant Methods 367 in MP theory, each class of excited configurations is included to infinite order. This is accomplished via an exponential excitation operator, CC =eTˆ.0 = 1+Tˆ + ˆ T 2 2! + ˆ T 3 3! +. . .  .0 (11-42) where .0 is the HF determinant for an N-electron system, and Tˆ = Tˆ1 + Tˆ2 + Tˆ3 +···+TˆN. Tˆ1 produces singly excited determinants, Tˆ2 doubly excited ones, and so on. Because of the exponential nature of the excitation operator, each class of excitations is included to all orders, e.g., terms in Tˆ2 would include products of double excitations (Tˆ2 2 ) that would be considered a subset of the possible quadruple excitations in CI. This is what makes CC theory size consistent. Usually coupled cluster theory is truncated to include just Tˆ1 and Tˆ2, i.e., CCSD. One of the most accurate post-HF methods has been shown to be the CCSD(T) method,in which a CCSD calculation is followed by a contribution due to triple excitations (Tˆ3) via perturbation theory. 11-14 Multideterminant Methods Up to this point, the methods that have been presented for describing electron correlation effects have been constructed with the single determinant SCF wavefunction as a starting point. For most molecules near their equilibrium geometries, this is a very good zeroth-order approximation, but as we saw earlier for the H2 molecule, as covalent bonds are stretched towards dissociation multiple determinants are required for even a qualitative description. This puts much stronger demands on these so-called single reference methods, and their accuracy can be much degraded or even unphysical in these regions. In a multiconfigurational SCF (MCSCF) calculation one writes the wavefunction as a linear combination of determinants exactly as in a CI calculation, and the energy is minimized as a function of the linear CI coefficients. However, in an MCSCF calculation one also simultaneously optimizes the MO coefficients of the orbitals that are used to construct the determinants, using methods analogous to SCF theory. Because this greatly adds to the complexity of the calculation, the number of determinants used in MCSCF is generally much smaller than in a standard HF-based CI calculation. In the simplest case, only the additional determinants that allow for a qualitative treatment of the process under study are included, e.g., one would include only the determinants corresponding to excitations of bonding electrons into their respective antibonding orbitals when stretching the triple bond of N2. This procedure results in a set of MCSCF molecular orbitals (some strongly occupied, some weakly occupied) that smoothly changes in character from equilibrium to dissociation. In multireference CISD (MRCISD) calculations, the wavefunction is written as . = i ci.i + s cs.s + d cd.d (11-43) wherei ci.i is the set of MCSCF reference determinants, .s are new determinants formed by single excitations into the virtual orbitals relative to all of the reference determinants, and .d are doubly excited determinants. An MRCISD calculation of this type can yield a very balanced and accurate description of a molecule’s potential energy surface, but often at a relatively steep cost in terms of computational requirements. 368 Chapter 11 The SCF-LCAO-MO Method and Extensions 11-15 Density Functional Theory Methods The wavefunction . for an n-electron molecule is a function of 3n spatial coordinates and n spin coordinates. From . we can produce the molecule’s spin-free electron density function, .(1), by integrating .*. over all of the spin coordinates and all the the space coordinates except those for one of the electrons:9 .(1)= |.(1, 2, . . . ,N)|2d.1dt2 . . . dtn (11-44) which is a function of only the three spatial coordinates.10 We have seen that, in the early days of quantum chemistry, a major challenge was the evaluation of integrals over the interelectronic-repulsion term in the hamiltonian, as well as dealing with the related problem of electron correlation. Several methods were devised that attempted to approximate these quantities from the density function .(1), with moderate success. However, the continuing progress in computer speed and the development of sophisticated ab initio methods gradually shifted attention away from approaches using the density function. In 1964, proof by Hohenberg and Kohn [7] of a connection between the ground state energy, E0, for a system and .0, the ground state density function,11 sparked new interest in finding a rigorous way to go from knowledge of the attractively simple three-dimensional density function to a value for E0. Recall that, for a system having n electrons and N nuclei, the hamiltonian operator for the electronic energy is H =- 1 2 n  i=1 .2 i + n  i=1 N  a=1 -Za ria + n-1  i=1 n  j=i+1 1 rij (11-45) The first and last terms can be written down immediately if we know how many electrons are present, but the middle term depends onN a=1 -Za ria , which is a function of nuclear charges and locations. This quantity is called the external potential, symbolized vext ( r), because it results from the presence of fields produced by particles not included in the group of electrons. Hohenberg andKohn were able to prove that there is a uniqueness relation between .0 and the external potential: Notwo external potentials could give the same .0. This raises the possibility that one could work backwards from .0 to find vext ( r) and then E0. The following route comes first to mind: Integrate .0 to get the number of electrons n. Figure out vext ( r) from .0. This would allow one to write down the hamiltonian operator. Then, using ab initio methods, one could get to an accurate E0 and .0, and from .0 one could calculate T0, Vne0 , Vee0 , and all the other properties of interest for the system. Two problems exist with this scenario. First, there is no generally applicable procedure known for getting from .0 to vext ( r). We can posit that vext ( r) is a functional of .0, which we symbolize vext [.0], but we don’t know what the functional relationship is. Second, even if we could get back to the hamiltonian operator, it would simply 9Because . is antisymmetric for exchange of electrons, the density function is independent of our choice as to which electron’s coordinates should be spared from integration. 10Recall that . is the spin coordinate and t is the coordinate for space and spin. dt = dv d.. 11We henceforth suppress the electron index in .. Section 11-15 Density Functional Theory Methods 369 land us back on square one: We would still have to solve the whole problem in the traditional way. Nevertheless, the hopes raised by this uniqueness theorem have led to the current goal of density functional theory, which is to find a procedure that takes us from . to E in a rigorous way that avoids the complexities of landing on square one and proceeding using standard ab initio methods. [8] Asubsequent relation proved by Hohenberg andKohn [7] indicated away to proceed. They proved that an approximate density function, .0,approx, when subjected to the (unknown) procedure that relates the exact .0 to the exact E0, must yield an energy higher than the exact E0 :E0,approx =E0, so a variational bound exists. Note that the unknown process referred to here is one that assumes vext ( r) to be the same for the analysis of .0 and .0,approx, which means that the same nuclear framework applies in both cases. If a procedure were known for finding E from ., then the existence of a variational bound would allow a variational procedure analogous to what we have applied earlier. One would start with a trial ., calculate its energy, and vary . to locate the . that gives the lowest energy. The barrier to proceeding is the lack of a way to get E from .. Hence, the development of approximate functionals that relate the energy to the electron density is an extremely active area of current research and probably will be for some time to come. In analogy to wavefunction methods, the functional that connects E to ., E[.], can be separated into an electronic kinetic energy contribution, T [.], a contribution due to nuclear-electron attractions, Ene[.], and the electron-electron repulsions, Eee[.]. The latter term can be further decomposed into Coulomb and exchange terms, J [.] and K[.]. Both the nuclear-electron attraction and the interelectronic Coulomb terms can be easily written in terms of the density using their classical expressions as in wavefunction methods. For an accurate treatment of the electronic kinetic energy term, however, one must differentiate a wavefunction,12 and this has led to the practice first proposed by Kohn and Sham [9] of expressing the density in terms of one-electron orbitals f (constructed numerically or from a basis set of Slater or gaussian functions). These orbitals serve two purposes. They allow us to calculate a value of the kinetic energy within a single Slater determinant framework similar to Hartree–Fock theory, TS = n  i=1 fi|- 1 2.2|fi  (11-46) and to obtain the electron density, defined in terms of these Kohn–Sham orbitals as .s = n  i |fi |2 (11-47) The final DFT energy expression is then written as EDFT [.]=TS[.]+Ene[.]+J [.]+Exc[.] (11-48) where the exchange correlation functional Exc[.] contains the difference between the exact kinetic energy and TS, the nonclassical (exchange) part of electron-electron repulsions, K[.], and correlation contributions to both K[.] and J [.]. The Kohn–Sham 12As far as we know, we must differentiate a wavefunction to get kinetic energy. If there is a functional that permits us to obtain kinetic energy directly from the density function, we might avoid having to use orbitals. 370 Chapter 11 The SCF-LCAO-MO Method and Extensions orbitals are eigenfunctions of an effective one-electron hamiltonian that is nearly identical in form to the Fock operator in the SCF equations. In theKohn–Sham case, however, the HF exchange operators are replaced by the functional derivative of the exchange correlation energy. Assuming the existence of Exc[.] and an initial guess for the electron density, one then solves the Kohn–Sham eigenvalue equations for the orbitals, which can then be used to define a new electron density and effective hamiltonian. These iterations continue until the density is converged to within a specified threshold. The exact form of Exc[.] is not currently known, however, and a rapidly growing list of approximate exchange correlation functionals have appeared in the literature. Because these are all estimates of a part of the overall energy, the total energy finally calculated is not an upper bound to the true energy. Also, DFT is not sizeconsistent. Generally, most existing exchange correlation functionals are split into a pure exchange and correlation contribution, Ex[.] and Ec[.] and the current functional nomenclature often reflects this with two-part acronyms, e.g., the BLYP DFT method uses an exchange functional from Becke (B) [10] and a correlation functional by Lee, Yang, and Parr (LYP) [11]. In principle, the exchange contribution could be calculated exactly (for a single determinant) in the same manner as TS, but this is generally not done since this disturbs the balance between Ex[.] and Ec[.]. In hybrid DFT, a percentage of this exact exchange is included in Exc[.]. The great benefit of present day DFT methods is computational cost. With the exchange correlation functionals commonly used, the computational effort is similar to a SCF calculation, but since Exc[.] implicitly includes some amount of electron correlation, the accuracy of DFT (depending on the chosen functional) is often similar to that obtained with MP2 or better. The great weakness of DFT at the present time, however, is the inability to systematically improve upon Exc[.] and converge towards the exact Born–Oppenheimer energy like one might conceptually do in a wavefunctionbased CI or CC calculation, e.g., SCF, CCSD, CCSDT, CCSDTQ, etc. with sequences of correlation consistent basis sets. One of the simplest DFT methods is the local density approximation (LDA), which assumes the density behaves locally like a uniform electron gas. Generally this does not lead to an accurate description of molecular properties, but if one makes Ex[.] and Ec[.] depend also on the gradient of the density, yielding gradient corrected DFT or the generalized gradient approximation (GGA), the results are much more accurate. Finally, the definition of Exc[.] also lends itself to semiempirical contributions. One such parameterization that has been very successful is the B3LYP hybrid DFT method, which includes 20% exact exchange and involves three semiempirical parameters that were obtained by fits to experimental thermochemical data (heats of formation, etc.) of small molecules [12]. 11-16 Examples of Ab Initio Calculations Self-consistent-field and correlated calculations have now been made for a very large number of systems. The best way to judge the capabilities of these methods is to survey some of the results.13 13For extensive surveys, see Schaefer [13], Hehre et al. [14], and Raghavachari [15]. Section 11-16 Examples of Ab Initio Calculations 371 Table 11-2 provides information on energies for a number of atoms in their ground states. Self-consistent-field energies are presented for three levels of basis set complexity. In the STO single-. level, a minimal basis set of one STO per occupied AO is used, and the energy is minimized with respect to independent variation of every orbital exponent . . The STO double-. basis set is similar except that there are two STOs for each AO, the only restriction being that the STOs have the same spherical harmonics as the AOs to which they correspond. The Hartree–Fock energies are estimated by extrapolating from more extensive basis sets, and represent the limit achievable for the SCF approach using a complete basis set. We can make the following observations: 1. The improvement in energy obtained when one goes from a single-. to a double-. STO basis set is substantial, especially for atoms of higher Z. 2. The agreement between the optimized double-. data and the HF energies is quite good. Even for neon, the error is only about 10-2 a.u. (0.27 eV). Thus, for atoms, the double-. basis is capable of almost exhausting the energy capabilities of a single-configuration wavefunction. 3. The disagreement between HF and “exact” energies (i.e., the correlation energy) grows progressively larger down the list. For neon it is almost 0.4 a.u. (10 eV), which is an unacceptable error in chemical measurements. One might think that the magnitude of the correlation energy in these examples would make SCF calculations on heavy atoms useless for quantitative purposes, but this TABLE 11-2  Ground-State Energies (in atomic units) for Atoms, as Computed by the SCF Method and from Experiment STO Atom Single . a Double . a Hartree–Focka Exactb Correlationc energy He -2.8476563 -2.8616726 -2.8616799 -2.9037 -0.0420 Li -7.4184820 -7.4327213 -7.4327256 -7.4774 -0.0447 Be -14.556740 -14.572369 -14.573021 -14.6663 -0.0933 B -24.498369 -24.527920 -24.529057 -24.6519 -0.1228 C -37.622389 -37.686749 -37.688612 -37.8420 -0.1534 N -54.268900 -54.397951 -54.400924 -54.5849 -0.1840 O -74.540363 -74.804323 -74.809370 -75.0607 -0.2513 F -98.942113 -99.401309 -99.409300 -99.7224 -0.3131 Ne -127.81218 -128.53511 -128.54705 -128.925 -0.378 Ar -525.76525 -526.81511 -526.81739 -527.542 -0.725 aFrom Roetti and Clementi [16]. b“Exact” equals experimental with relativistic correction but without correction for Lamb shift. See Veillard and Clementi [17]. cCorrelation energy is “exact” minus HF energy. 372 Chapter 11 The SCF-LCAO-MO Method and Extensions is not the case. Most frequently we are not concerned with the value of the total energy of a system so much as with energy changes (e.g., in spectroscopy or in reactions) or else with other properties such as transition moments (for spectroscopic intensities) or, in molecular systems, dipole moments. Let us, therefore, see how well SCF calculations can predict atomic ionization energies. We have already indicated (Section 11-11) that there are two ways we can get ionization energies from SCF calculations. The first, and simplest, is to take the various -i , as suggested by Koopmans’ theorem. Table 11-3 shows that this gives only rough agreement with experimental values for neon. Another way is to do separate SCF calculations for each excited state produced by removal of an electron from an orbital (i.e., for each “hole state”) and equate the ionization energies to the energy differences between these and the neutral ground state ( SCF). This second method requires much more effort. As Table 11-3 indicates, however, the extra effort leads to great improvement in agreement between theoretical and experimental values. We conclude that SCF calculations on atoms and ions give quantitatively useful data on ionization energies, even for ionization out of deep-lying levels. The Koopmans’ theorem approach is less accurate, although still qualitatively useful. A related problem is the calculation of energies of excited states of atoms. Weiss [19] has reported calculations on some of the excited states of carbon, and his results are summarized in Fig. 11-3. Inspection of this figure reveals that near-HF calculations only roughly reproduce the energy spectrum, but CI (with four or five configurations) brings about marked improvement. Weiss has omitted configurations involving excitations of 1s electrons, and so these results ignore correlation energy for the inner-shell electrons. The agreement suggests that these electrons experience almost no change in correlation for transitions among these states. Weiss has also calculated oscillator strengths14 associated with atomic transitions and he finds that CI is necessary before reasonable agreement with experiment is achieved. TABLE 11-3  Ionization Energies of Neona Ionization potential (a.u.) Ion configuration Koopmans SCF Experiment 1s2s22p6 32.7723 31.9214 31.98 1s22s2p6 1.9303 1.8123 1.7815 1s22s22p5 0.8503 0.7293 0.7937 aFrom Bagus [18]. The basis set includes 5 s-type and 12 p-type STOs (4 of each m quantum number). . ’s were varied as well as linear coefficients. The neutral ground state gives E=-128.547 a.u. (compare Table 11-2). 14The oscillator strength is a measure of the probability (i.e., intensity) of a transition. For a transition between states a and b in a 2n-electron system it is commonly given by the formula 2 3 (Eb -Ea)   .*a .. . 2n  i=1 ri .. . .b dt  2 Section 11-16 Examples of Ab Initio Calculations 373 Figure 11-3  Transition energies in the C+ ion as calculated by HF, CI, and as measured. (From Weiss [19].) Ionization from the ground state of C+ occurs at 0.8958 a.u. In brief, then, the evidence indicates that reasonably accurate atomic ionization energies can be obtained by high-quality SCF calculations on the neutral and ionized species ( SCF, not-), but that transition energies and intensities require CI sufficient to account for much of the valence electron correlation. Before we leave the subject of atoms, it should be pointed out that, for any atom, the expectation value T¯ of the kinetic energy operator is equal to -E¯ if the wavefunction has been optimized with respect to a scale factor in the coordinates r1, r2, etc. This relation, called the virial relation, is proved in Appendix 8. It is necessarily satisfied for any level of calculation that cannot be improved by replacing every ri in . by .ri and allowing . to vary. Since the single- and double-. STO solutions have already been optimized with respect to such scale parameters, they satisfy the virial relation. Thus, for the berylliumatom, the single-. STO value for E¯ is (Table 11-2)-14.556740 a.u., and so we know that T¯ = +14.556740 a.u. and V¯ = -29.113480 a.u. for this wavefunction (since E¯ = T¯ + V¯ ). For the double-. wavefunction T¯ =+14.572369 a.u., etc. The Hartree–Fock wavefunction is, by definition, the lowest-energy solution achievable within a restricted (single determinantal15) wavefunction form. Use of a scale factor does not affect the wavefunction form. Hence, no further lowering of E¯ below the HF level is possible in this way, and the HF energies E¯, T¯, and V¯ must satisfy the virial relation also. Finally, the exact energies are the lowest achievable for 15For open-shell systems, more than one determinant may be needed to satisfy symmetry requirements. This is still considered a HF wavefunction. 374 Chapter 11 The SCF-LCAO-MO Method and Extensions any wavefunction. Again, scaling cannot lower the energy further, so these energies also satisfy the virial relation. We turn next to ab initio calculations on molecules. First, let us compare HF and exact energies for molecules as we did for atoms and see how large the errors due to correlation are. The results are not too different from those for atoms having the same number of electrons, as shown in Table 11-4; that is, the correlation energies for molecules having ten electrons (CH4, NH3, H2O, HF) are about the same as that for neon, whereas that for the 18-electron molecule H2O2 is more like the correlation energy for argon. But this is only a very rough rule of thumb. We have already indicated that the correlation energy in a molecule varies with bond length, a factor not present in atomic problems. In order to get a more meaningful idea of the capabilities of ab initio calculations on molecules, we must look more closely at specific examples. A calculation on the OH radical, reported by Cade and Huo [21], provides a good example of the capabilities of the extended basis set LCAO-MO-SCF technique on a small molecule. Their final wavefunction for the ground state at an internuclear separation R =1.8342 a.u. is presented in Table 11-5. A minimal basis set of STOs for OH would include 1s, 2s, 2px, 2py, and 2pz STOs on oxygen and a single 1s AO on hydrogen. Cade and Huo chose a much more extensive basis. Oxygen is the site for two 1s, two 2s, one 3s, four 2p, one 4f, eight 2pp, two 3dp , and four 4fp STOs. On hydrogen, there are two 1s, one 2s, one 2ps , two 2pp , and two 3dp STOs. (The p-type basis functions are indicated in Table 11-5 for only one of the two directions perpendicular to the O–H axis.) The orbital exponents for all of these STOs have been optimized, and the resulting wavefunction is of “near-Hartree–Fock” quality. The optimized . values appear in Table 11-5. The STO labeled s2po is located on oxygen and has the formula s2po =(2.13528)5/2p-1/2r exp(-2.13528r) cos . (11-49) TABLE 11-4  Estimated Hartree–Fock and Correlation Energies for Selected Moleculesa and Atomsb Molecule or atom E (HF) (a.u.) E (correlation) (a.u.) Molecule or atom E (HF) (a.u.) E (correlation) (a.u.) H2 -1.132 -0.043 Ne -128.547 -0.378 He -2.862 -0.042 CO -112.796 -0.520 BH3 -26.403 -0.195 N2 -108.994 -0.540 O(1D) -74.729 -0.262 Si (1D) -288.815 -0.505 CH4 -40.219 -0.291 B2H6 -52.835 -0.429 NH3 -56.225 -0.334 S (1D) -397.452 -0.606 H2O -76.067 -0.364 H2O2 -150.861 -0.688 HF -100.074 -0.373 Ar -526.817 -0.725 aFrom Ermler and Kern [20]. bSee Table 11-2. Section 11-16 Examples of Ab Initio Calculations 375 TABLE 11-5  Near Hartree–Fock Wavefunction for the OH Molecule in Its Ground-State Configuration (1s22s23s21p3) at R = 1.8342 a.u.a .s C1s C2s C3s .p C1p s1so (. = 7.01681) 0.94291 -0.25489 0.07625 p2po (. = 1.26589) 0.37429 s1so (12.38502) 0.09313 0.00358 -0.00153 p2po (2.11537) 0.46339 s2so (1.71794) -0.00162 0.46526 -0.20040 p2po (3.75295) 0.23526 s2so (2.86331) 0.00418 0.55854 -0.18328 p2po (8.41140) 0.01023 s3so (8.64649) -0.03826 -0.02643 0.00550 p3do (1.91317) 0.02871 s2po (1.28508) -0.00055 0.05179 0.30153 p4fo (2.19941) 0.00506 s2po (2.13528) -0.00056 -0.07538 0.37791 p2pH (1.76991) 0.02442 s2po (3.75959) 0.00115 0.01874 0.18390 p3dH (3.32513) 0.00282 s2po (8.22819) 0.00059 0.00229 0.00952 s3do (1.63646) -0.00047 0.02437 0.04676 s3do (2.82405) 0.00016 0.00845 0.01595 s4fo (2.26641) -0.00013 0.00882 0.01232 s1sH (1.31368) 0.00150 -0.04651 0.21061 s1sH (2.43850) -0.00034 0.09413 0.05113 s2sH (2.30030) 0.00000 0.07654 0.04539 s2pH (2.8052) 0.00018 0.01182 0.00999 aFrom Cade and Huo [21]. There are three s-type MOs and two p-type MOs to accommodate the nine electrons of this radical. One p-type MO is f1py = 0.37429p2poy +0.46339p2poy +0.23526p2poy +0.01023p2poy + 0.02871p3doy +0.00506p4foy +0.02442p2pHy +0.00282p3dHy (11-50) and the other occupied p-type MO would be the same except with x instead of y. (The z axis is coincident with the internuclear axis.) It is evident that writing out the complete wavefunction given in Table 11-5 would result in a very cumbersome expression. It is a nontrivial problem to relate an accurate but bulky wavefunction such as this to the kinds of simple conceptual schemes chemists like to use. One solution is to have a computer produce contour diagrams of the MOs. Such plots for the valence MOs 2s, 3s, and 1p of Table 11-5 are presented in Fig. 11-4. Cade and Huo [21] carried out similar calculations for OH at 13 other internuclear distances and also for the united atom (fluorine) and the separated atoms in the states with which the Hartree–Fock wavefunction correlates. Some of their data are reproduced in Table 11-6. A plot of the electronic-plus-nuclear repulsion energies is given in Fig. 11-5 along with the experimentally derived curve. It is evident that the near HF curve climbs too steeply on the right, leading to too “tight” a potential well for nuclear motion and too small an equilibrium internuclear separation. This comes about because, as mentioned earlier, the HF solution dissociates to an incorrect mixture of states, some 376 Chapter 11 The SCF-LCAO-MO Method and Extensions Figure 11-4  Contour plots of HF valence orbitals for OH as given in Table 11-5. (From Stevens et al. [22].) TABLE 11-6  Spectroscopic Parameters and Dipole Moment for OH from Theoretical Curves and from Experimenta Wavefunction Dipole moment (debyes) Re (a.u.) De (eV) .e (cm-1) .exe (cm-1) ae (cm-1) SCF 1.780 1.795 8.831 4062.6 165.09 0.661 CI 1.655 1.838 4.702 3723.6 83.15 0.628 Experimental 1.66±.01 1.834 4.63 3735.2 82.81 0.714 aFrom Stevens et al. [22]. of which are ionic. It is possible to use the HF curve of Fig. 11-5 to derive theoretical values for molecular constants that can be compared to spectroscopic data. The results are displayed in Table 11-6, and they reflect the inaccuracy in the HF energy curve. Included there are the SCF and experimental values for the molecular dipole moment. We turn now to the behavior of V¯ /T¯ for the HF wavefunctions of Cade and Huo at various internuclear separations. The data appear in Table 11-7. Observe that the Section 11-16 Examples of Ab Initio Calculations 377 Figure 11-5  Theoretical and experimental energy curves for OH (from Stevens et al. [22].) value of -2.00000 for V¯ /T¯ occurs at three values of R: 0,8, and the point where E¯ is a minimum. At R =0 and 8, we are dealing with one or two atoms, for which we have already seen the HF solution should give V¯ /T¯ =-2. At intermediate R we have a diatomic molecule, for which the virial relation is (see Appendix 8) 2T¯ +V¯ +R .E¯ .R =0 (11-51) There are three cases to consider. If .E¯/.R = 0, then V¯ /T¯ = -2. This will occur at the minimum of the potential energy curve (and also at any subsidiary maxima or minima). If .E¯/.R is negative, then, since V¯ /T¯ = -2 - (R/T¯)(.E¯/.R) and T¯ is TABLE 11-7  HF Total Energies and V¯/T¯ for OH as a Function of Internuclear Distancea R (a.u.) E (a.u.) V¯/T¯ R (a.u.) E (a.u.) V¯/T¯ 0 -99.40933 -2.00000 1.90 -75.41837 -2.00129 1.40 -75.34382 -1.99076 2.00 -75.41140 -2.00225 1.50 -75.38378 -1.99398 2.10 -75.40163 -2.00300 1.60 -75.40696 -1.99651 2.25 -75.38372 -2.00380 1.70 -75.41829 -1.99850 2.40 -75.36367 -2.00433 1.75 -75.42065 -1.99933 2.60 -75.33582 -2.00474 1.795 -75.42127 -2.00000 2.80 -75.30822 -2.00492 1.8342 -75.42083 -2.00052 8 -75.30939 -2.00000 aFrom Cade and Huo [21]. 378 Chapter 11 The SCF-LCAO-MO Method and Extensions positive, V¯ /T¯ will be algebraically higher than -2 (e.g.,-1.98). If .E¯/.R is positive, V¯ /T¯ will be lower than -2. Thus, the values of V¯ /T¯ in Table 11-7 reflect the slope of a line tangent to the potential energy curve at each R value. As mentioned earlier, it is possible to at least partly include the effects of electron correlation by allowing determinants corresponding to other configurations to mix into the wavefunction. Such calculations have been performed for the OH radical by several groups, and the results of Stevens et al. [22] are included in Table 11-6 and Fig. 11-5. These data come from intermixing 14 configurations. It is evident that the inclusion of correlation through CI has markedly improved the agreement with experiment. Many diatomic molecules have been treated at a comparable and higher level, and it is clear that ab initio calculations including electron correlation are capable of giving quite accurate molecular data. In cases in which the diatomic system is experimentally elusive, such calculations may be the best source of data available. Afurther example of this is provided in Table 11-8, in which are listed dipole moments for ground and some excited states of diatomic molecules. The dipole moments computed from near-HF wavefunctions contain substantial errors. It can be seen that CI greatly improves dipole moments. It has been observed that inclusion of singly excited configurations is very important in obtaining an accurate dipole moment. As a general rule, CID correlates electron motion and therefore has a significant energy-lowering effect but has little effect on the one-electron distribution or related properties, like dipole moment. Inclusion of singly excited configurations (CISD) allows the one-electron distribution to shift in response to the change in calculated interelectronic repulsion. For example, the value of the ground-state dipole moment of CO (entry 4 of Table 11-8) is calculated at the CID level to be -0.20D and at the CISD level to be +0.12D. Thus CID may be a suitable level of computational effort if the interest is in energy, but CISD is better if the interest is in one-electron properties. TABLE 11-8  Calculated and Experimental Dipole Moments of Diatomic Molecules (in Debyes) Molecule and polarity State HF at Re a CI at Re a Experiment Reference Li+H- X 1+ 6.002 5.86 5.83 [23] C+N- X 2+ 2.301 1.465 1.45±0.08 [24] C-N+ B 2+ — 0.958 1.15±0.08 [24] C-O+ X 1+ -0.274 0.12 0.112±0.005 [23] C+O- A 3 2.34 1.43 1.37 [25] C-S+ X 1+ 1.56 2.03 1.97 [23] C-S+ A 1 -0.09 0.63 0.63±0.04 [26] C-H+ X 2 1.570 1.53 1.46±0.06 [27] O-H+ X 2 1.780 1.655 1.66±0.01 [27] F-H+ X 1 1.942 1.805 1.797 [27] N-H+ X 3- 1.627 1.537 Unknown [27] aExperimental Re Value used. Section 11-16 Examples of Ab Initio Calculations 379 Highly accurate properties can be obtained with more sophisticated electron correlation methods, such as CCSD(T) or MRCISD. Other examples of the use of ab initio methods on small molecules are shown in Table 11-9, which displays some calculated properties for the electronic ground states of H2 andN2 as a function of method with the cc-pVTZ basis set. In the case ofH2, the SCF bond length and harmonic frequency are both slightly too large compared to experiment, but the dissociation energy is underestimated by more than 30 kcal/mole due to the lack of electron correlation. For this two-electron system, the CISD and CCSD methods are equivalent to a FCI and exhibit marked improvement compared to SCF. The remaining deviations from experiment at this level of theory can be attributed to the use of the finite cc-pVTZ basis set. The MP methods show systematic improvement with each order of perturbation theory, but even a fourth-order treatment of single and double excitations results in non-negligible errors compared to FCI for this simple system. The B3LYP hybrid density functional method is observed to yield very reliable properties in this case. As might be expected due to its triple bond, the N2 molecule is considerably more challenging for ab initio methods. With the cc-pVTZ basis set, the SCF dissociation energy is smaller than experiment by nearly a factor of 2. Appreciable differences are now observed between the CISD and CCSD results, with the latter being somewhat closer to experiment. In addition, triple excitations, as measured by the difference between CCSD and CCSD(T), are relatively important for N2, raising the dissociation energy by nearly 9 kcal/mole. In contrast to the H2 case, the results for N2 using perturbation theory (MP2, MP3, MP4) display a disturbing oscillatory behavior. This type of result with MP methods has been the subject of several previous studies.16 TABLE 11-9  Calculated Equilibrium Bond Lengths, Harmonic Vibrational Frequencies, and Equilibrium Dissociation Energies for the Ground States of H2 and N2 with the cc-pVTZ Basis Set Compared to Experiment H2 N2 re(Å) .e(cm-1) De (kcal/mole) re(Å) .e(cm-1) De (kcal/mole) SCF 0.734 4587 83.7 1.067 2732 120.4 CISD 0.743 4409 108.4 1.089 2509 193.1 MP2 0.737 4526 103.6 1.114 2195 228.7 MP3 0.739 4476 107.1 1.090 2532 206.0 MP4 0.741 4441 108.0 1.113 2192 221.2 CCSD 0.743 4409 108.4 1.097 2424 207.7 CCSD(T) 1.104 2346 216.5 B3LYP 0.743 4419 110.3 1.092 2449 229.6 Expt. [28] 0.741 4403 109.5 1.098 2359 228.4 16See Dunning and Peterson [29] and references therein. 380 Chapter 11 The SCF-LCAO-MO Method and Extensions Lastly, it is again the case that the B3LYP method yields relatively accurate results for this molecule and appears to be comparable in quality to MP2. The results shown in Table 11-10 explore the choice of basis set with the CCSD and CCSD(T) methods for the H2 and N2 molecules, respectively. A large dependence on basis set is observed in each case. The use of a minimal basis set, STO-3G, leads to large errors since it provides very fewvirtual orbitals for electron correlation. Just a double-. basis set, either 6-31G** or cc-pVDZ, is observed to be a great improvement. The systematic convergence of the correlation consistent basis sets is readily observed in these results. One should note that increasing the size of the basis set from cc-pVTZ to cc-pV5Z in N2 results in an increase in De by nearly 9 kcal/mole. This implies that the highly accurate result for De shown in Table 11-9 for the MP2 level of theory with the cc-pVTZ basis set was clearly fortuitous. From these results it should be obvious that errors due to basis set incompleteness can often rival those due to inadequate electron correlation. As shown above, the hybrid DFT method B3LYP can be competitive in accuracy to more computationally expensive methods, such as CCSD(T). In fact, recent benchmark calculations [30] have shown that for the calculation of thermochemical quantities like enthalpies of formation, B3LYP exhibits average errors of only 1–5 kcal/mol. While these are still more than a factor of two larger than the accuracy obtainable with coupled cluster methods, the much lower computational cost of B3LYP makes it a very attractive alternative. The accuracy of equilibrium bond lengths and harmonic vibrational frequencies calculated by B3LYP have also been shown to be very satisfactory. The accurate calculation of some molecular properties, however, is still a great challenge to hybrid DFT methods. In particular, reaction activation energies are often too small and van derWaals interactions can be qualitatively incorrect. Correcting these deficiencies is the goal of many second generation hybrid DFT functionals.17 We have seen that inclusion of electron correlation often improves the E¯ versus R curve because it allows for variable ionic-covalent character in the wavefunction. TABLE 11-10  Dependence on Basis Set Choice for the CCSD and CCSD(T) Properties of H2 and N2, respectively H2/CCSD N2/CCSD(T) re(Å) .e(cm-1) De (kcal/mole) re(Å) .e(cm-1) De (kcal/mole) STO-3G 0.735 5002 128.1 1.190 2145 147.8 6-31G** 0.738 4504 105.9 1.120 2342 201.6 cc-pVDZ 0.761 4383 103.6 1.119 2339 200.6 cc-pVTZ 0.743 4409 108.4 1.104 2346 216.5 cc-pVQZ 0.742 4403 109.1 1.100 2356 222.9 cc-pV5Z 0.742 4405 109.3 1.099 2360 225.1 Expt. [28] 0.741 4403 109.5 1.098 2359 228.4 17See, for instance, Zhao et al. [31] and Xu et al. [32]. Section 11-16 Examples of Ab Initio Calculations 381 However, there are some diatomic molecules that maintain a high degree of ionic character even when the nuclei are quite widely separated. NaCl is an example. For such systems, the Hartree–Fock energy curve is quite nearly parallel to the exact energy curve throughout the minimum energy region (i.e., the correlation energy is almost constant) and the theoretical values of spectroscopic constants agree quite well with experimental values. (In vacuo, an electron ultimately transfers from Cl- to Na+, and the experimental curve leads to neutral dissociation products, whereas the HF curve does not. This theoretical error affects the curve only at large R, however, and so has little effect on spectroscopic constants.) Schaefer [13] has reviewed this situation. Of course, ab initio calculations have been performed on molecular systems much larger than the molecules referred to above. However, as one moves to molecules having four or more nuclei, one encounters a new difficulty: Integrals now appear that have the form ab | cd=.a(1).b(2) |1/r12|.c(1).d (2) (11-52) where .a is a basis function located on nucleus a, etc. Such integrals have basis functions on four different nuclei and are referred to as four-center integrals. If the basis functions . are STOs, such integrals are relatively slow to evaluate on a computer. If they are gaussian functions, the computation is much faster, and this is the main reason for using gaussian basis functions. But the number of such integrals becomes enormous for a reasonable basis set and a medium sized molecule. In fact, the number of such integrals grows as the fourth power of the number of basis functions. Thus, replacing each STO by, say, three gaussian functions, will lead to 34 times as many integrals to evaluate. Even though such integrals can be evaluated very rapidly, we eventually come to molecules of such a size that the sheer number of integrals makes for a substantial computing effort. The efficient calculation of molecular integrals continues to be an active research area, however, and newtechniques have nowdiminished the importance of this bottleneck with reasonably sized gaussian basis sets on systems up to hundreds of atoms. Modern quantum chemical programs have made high-quality calculations on reasonably large molecules tractable, but one is always balancing the level of accuracy against the computer time needed to achieve it. While a Hartree–Fock calculation on benzene with a cc-pVTZ basis set (264 contracted gaussian functions) might require just 4 minutes to complete on a given computer, inclusion of electron correlation at the MP2, CCSD, and CCSD(T) levelswould require an additional 0.1, 4.3, and 11 times 4 minutes, respectively. Numerous calculations have been reported for barriers to internal rotation in various molecules. The theoretical barriers agree best with experiment for molecules having threefold symmetry in the rotor. Self-consistent-field values are compared with experimental barrier values inTable 11-11. In every case, the theoretical energy curve predicts the correct stable conformation and even does reasonably well at predicting barrier height. The disagreement between different computed values of the barrier for the same molecule reflects differences in basis sets and, sometimes, differences in choices for bond length and angle made by different workers. The evidence to date suggests that ab initio calculations approaching the HF limit will ordinarily be within 20% of the experimental barrier. Even this level of accuracy is useful because experimental measurements of barriers in transient molecules or for excited molecules are often very rough, ambiguous, or nonexistent. Given the favorable cost and relative accuracy of 382 Chapter 11 The SCF-LCAO-MO Method and Extensions TABLE 11-11  Internal Rotation Barriers from Experiment and as Calculated by the LCAO-MO-SCF Method Barrier (kcal/mole) Molecule SCF Experiment Reference CH3–CH3 2.58 2.88 [33] 2.88 — [34] CH3–NH2 1.12 1.98 [35] 2.02 — [34] CH3–OH 1.59 1.07 [34] CH3–CH2F 2.59 3.33 [33] CH3–N=O 1.05 1.10 [36] CH3–CH=CH2 1.25 1.99 [37] cis-CH3–CH=CFH 1.07 1.06 [37] trans-CH3–CH=CFH 1.34 2.20 [37] CH3–CH=O 1.09 1.16 [38] DFT approaches compared to HF, even higher quality results might be expected with the use of methods such as B3LYP; hence, DFT is often now the method of choice for calculations on medium to large organic molecules. A large number of ab initio calculations have been made on clusters of molecules. Many of these have sought to delineate the distance and angle dependence of the hydrogen bond strength between molecules like water or hydrogen fluoride. Xantheas et al. [39] have reported large basis set MP2 calculations on small water clusters, (H2O)n, where n ranged from 2–6. These calculations predict that there are four distinct isomers of the water hexamer (n = 6) whose relative energies lie within ~1 kcal/mole of each other. These kinds of results are of great usefulness in defining new effective interaction potentials involving water that can be used in large-scale molecular simulations of solvation phenomena. Re et al. [40] have calculated the structures and relative energies of sulfuric acid solvated by 1–5 water molecules using the B3LYP method to provide a fundamental understanding of acid ionization. In addition to investigating the interaction of water with both the cis and trans conformers of H2SO4, they found that just five water molecules were sufficient tomake dissociation intoHSO-4 andH3O+ energetically favorable. The field of materials science is also benefitting from ab initio calculations, and studies of metal clusters and their absorbates are currently areas of high interest. A great deal of attention has been given to the calculation by ab initio methods of energy surfaces for chemical reactions. For many years, such efforts were limited to reactions, such as D+H2.HD+H, which involve only a small number of electrons and nuclei. Much more complicated systems are now being explored. In setting out to perform such a calculation, one likes to have some idea of whether the correlation energy of the system will change significantly with nuclear configuration. If it does not, then a Hartree–Fock or MCSCF calculation will parallel the true energy surface. If the correlation energy does change, it is necessary to include some treatment of electron correlation in the calculation. Section 11-16 Examples of Ab Initio Calculations 383 As a rough rule of thumb, one expects the correlation energy to change least when the reactants, the intermediate or transition state complex, and the products are all closedshell systems, hence all approximately equally well described by a single-determinantal wavefunction. Some calculations on SN2 and radical reactions are summarized in Table 11-12. It can be seen that the SN2 reactions, which do involve closed-shell systems in the three stages mentioned above, are fairly insensitive to the inclusion of CI, whereas the radical reactions undergo extensive change of correlation energy. The determination of the potential energy surface for the unimolecular rearrangement HOClHClO by Peterson et al. [43] provides an example of a very accurate and exhaustive calculation on a fairly small molecule. Because there are only three nuclei, there are only three structural variables to explore, so the number of calculations needed to map out the surface is not too large. (Note that, with three geometric variables, the energy “surface” is really a four-dimensional hypersurface.) These authors were also interested in the reactions occurring on this surface, i.e., Cl+OH.HCl+O and Cl +OH.ClO+H, which required a global representation of the surface that was constructed from over 1500 individual energies. Since the full energy surface involves bond breaking processes, MCSCFand MRCISD methods were utilized. Accurate relative energetics between HOCl, HClO, and the various dissociation asymptotes were obtained by carrying out calculations with a series of three correlation consistent basis sets at each geometry. This produced an approximate complete basis set (CBS) MRCISD energy surface. At the MRCISD CBS limit, HOCl was found to be more stable than HClO by 53.7 kcal/mole and the barrier for HOCl.HClO was predicted to be 73.5 kcal/mole above the HOCl minimum. After determining an analytical representation of this surface from the individual energies, these authors carried out calculations of the full anharmonic vibrational spectrum of HOCl and HClO by solving the Schr¨odinger equation for nuclear motion. The HClO molecule has not yet been experimentally observed, but these calculations predict that the lowest three vibrational levels of this species lie below its dissociation threshold, so it should be detectable. TABLE 11-12  Reaction Barrier Energies for Reactions as Calculated by ab Initio Methods Reaction barrier (kcal/mole) Reactant Transition Product Reaction type SCF CI (no.config.) Reference H- +CH4 (CH5)- CH4 + H- SN2 59.3 55.2(6271) [41] F- +CH3F (FCH3F)- CH3F + F- SN2 5.9 5.9(26910) [41] H• +CH4 CH•5 CH•3 + H2 Radical abstraction (axial) 35.2 18 (692) [42] H• +CH4 CH•5 CH4 + H• Radical exchange (inversion) 63.7 41.7(692) [42] 384 Chapter 11 The SCF-LCAO-MO Method and Extensions The decisions regarding basis set and level of correlation can be daunting and in the past this sometimes discouraged nonspecialists from taking advantage of ab initio methods. However, there are now a wide range of programs that are available, which have made ab initio calculations amenable to theoreticians and experimentalists alike. The best known of these is undoubtedly the GAUSSIAN series of programs originally developed by the group of J.A. Pople. In this and other programs, one can conveniently choose from a large variety of available basis sets and methods to carry out energy evaluations or geometry optimizations and harmonic frequency calculations. These programs have brought about a revolution in the way that chemical research is done. For small molecules (~1–5 nonhydrogen atoms) ab initio methods are sometimes more precise and reliable than experiment, especially for unstable systems. The saga of the energy difference between ground and excited CH2 is one of the best known of these experimental–theoretical confrontations.18 In summary, ab initio calculations provide useful data on bond lengths and angles, molecular conformation and internal rotation barriers, for ground and excited states of molecules. They are also very useful for calculating accurate thermochemistry, ionization energies, oscillator strengths, dipole moments (as well as other one-electron properties) and excitation energies. If one has access to large blocks of computer time, ab initio calculations can reveal the nature of energy surfaces pertaining to chemical reactions or molecular associations, as in fluids. The accuracy of the calculation and the magnitude of the system are limited ultimately by computer speed and capacity. 11-17 Approximate SCF-MO Methods At the beginning of this chapter it was stated that ab initio calculations require exact calculation of all integrals contributing to the elements of the Fock matrix, but we have seen that, as we encounter systems with more and more electrons and nuclei, the number of three- and four-center two-electron integrals becomes enormous, driving the cost of the calculation out of the reach of most researchers. This has led to efforts to find sensible and systematic simplifications to the LCAO-MO-SCF method—simplifications that remain within the general theoretical SCF framework but shorten computation of the Fock matrix. Since many of the multicenter two-electron integrals in a typical molecule have very small values, the obvious solution to the difficulty is to ignore such integrals. But we wish to ignore them without having to calculate them to see which ones are small since, after all, the reason for ignoring them is to avoid having to calculate them. Furthermore, we want the selection process to be linked in a simple way to considerations of basis set. That is, when we neglect certain integrals, we are in effect omitting certain interactions between basis set functions, which is equivalent to omitting some of our basis functions part of the time. It is essential that we know exactly what is involved here, or we may obtain strange results such as, for example, different energies for the same molecule when oriented in different ways with respect to Cartesian coordinates. A number of variants of a systematic approach meeting the above criteria have been developed by Pople and co-workers, and these are nowwidely used. The approximations 18See Goddard [44] and Schaefer [45]. Section 11-17 Approximate SCF-MO Methods 385 are based on the idea of neglect of differential overlap between atomic orbitals in molecules. Differential overlap dS between twoAOs, .a and .b, is the product of these functions in the differential volume element dv: dS =.a(1).b(1)dv (11-53) The only way for the differential overlap to be zero in dv is for .a or .b, or both, to be identically zero in dv. Zero differential overlap (ZDO) between .a and .b in all volume elements requires that .a and .b can never be finite in the same region, that is, the functions do not “touch.” It is easy to see that, if there is ZDO between .a and .b (understood to apply in all dv), then the familiar overlap integral S must vanish too. The converse is not true, however. S is zero for any two orthogonal functions even if they touch. An example is provided by an s and a p function on the same center. It is a much stronger statement to say that .a and .b have ZDO than it is to say they are orthogonal. Indeed, it is easy to think of examples of orthogonal AOs but impossible to think of any pair ofAOs separated by a finite or zero distance and having ZDO. Because AOs decay exponentially, there is always some interpenetration. The attractive feature of the ZDO approximation is that it causes all three- and fourcenter integrals to vanish. Thus, in a basis set of AOs . having ZDO, the integral .a(1).b(2) |1/r12|.c(1).d (2) will vanish unless a =c and b =d. This arises from the fact that, if a =c, .*a (1).c(1) is identically zero, and this forces the integrand to vanish everywhere, regardless of the value of (1/r12).*b (2).d (2). It is not within the scope of this book to give a detailed description or critique of the numerous computational methods based on ZDO assumptions. An excellent monograph [46] on this subject including program listings is available. Some of the acronyms for these methods are listed in Table 11-13. In general, these methods have been popular because they are relatively cheap to use and because they predict certain properties (bond length, bond angle, energy surfaces, electron spin resonance hyperfine splittings, molecular charge distributions, dipole moments, heats of formation) reasonably well. However, they generally do make use of some parameters evaluated from experimental data, and some methods are biased toward good predictions of some properties, while other methods are better for other properties. For a given type of problem, one must exercise judgment in choosing a method. As an example of the sort of chemical system that becomes accessible to study using such methods, we cite the valence-electron CNDO/2 calculations of Maggiora [56] on free base, magnesium, and aquomagnesium porphines. Such calculations enable us to examine the geometry of the complex (i.e., is the metal ion in or out of the molecular plane, and how is the water molecule oriented?), the effects of the metal ion on ionization energies, spectra, and orbital energy level spacings, and the detailed nature of charge distribution in the system. Use of a combination of methods is often convenient. Novoa andWhangbo [57] studied theoretically the relative stabilities of di- and triamides in various hydrogen-bonded and nonhydrogen-bonded conformations, in both the absence and presence of solvent (CH2Cl2) molecules. There are many structural parameters to optimize in each of the conformations, and so high-level ab initio calculations for energy minimization of each class of structure would be prohibitively expensive. Instead, AM1 was used to determine the optimum geometry for each configuration, and then ab initio calculations 386 Chapter 11 The SCF-LCAO-MO Method and Extensions TABLE 11-13  Acronyms for Common Approximate SCF Methods Acronym Description CNDO/1 Complete neglect of differential overlap. Parametrization Scheme no. 1 (Pople and Segal [47]). CNDO/2 Parametrization scheme no. 2. Considered superior to CNDO/1 (Pople and Segal [48]). CNDO/BW Similar to above with parameters selected to give improved molecular structures and force constants. (See Pulfer and Whitehead [49] and references therein.) INDO Intermediate neglect of differential overlap. Differs from CNDO in that ZDO is not assumed between AOs on the same center in evaluating one-center integrals. This method is superior to CNDO methods for properties, such as hyperfine splitting, or singlet-triplet splittings, which are sensitive to electron exchange (Pople et al. [50]). MINDO/3 Modified INDO, parameter scheme no. 3. Designed to give accurate heats of formation (Bingham et al. [51] and also Dewar [52]). NDDO Neglect of diatomic differential overlap. Assumes ZDO only between AOs on different atoms (Pople et al. [53]). MNDO Modified neglect of diatomic overlap. A semiempirically parametrized version of NDDO.Yields accurate heats of formation and many other molecular properties, but fails to successfully account for hydrogen bonding (Dewar and Thiel [54]). AM1 “Austin Model 1.” A more recent parametrization of NDDO that overcomes the weakness of MNDO in that it successfully treats hydrogen bonding. (Dewar et al. [55].) (e.g., 6-31G** with MP2) were done for a few near-optimum geometries for each conformation to check the AM1 results. Additional helpful information on standard programs available at ab initio and semiempirical levels—where to get them, how to use them, what they have been used for—is available in the very well-written reference handbook by Clark [58]. 11-17.A Problems 11-1. Use the data in Table 11-3 to calculate the theoretical transition energies for Ne+ when 1s and 2s electrons are excited into the 2p level. The experimental values are 2p.2s, 0.989 a.u.; 2p.1s, 31.19 a.u. 11-2. Use the data in Table 11-1 to estimate separately the errors in ionization energies for the three states due to a) omission of electron correlation. b) failure to allow electronic relaxation. 11-3. In Section 11-11, it is argued that neglect of electron correlation and electronic relaxation in setting I 0 k =-k causes errors of opposite sign that partly cancel. Section 11-17 Approximate SCF-MO Methods 387 Would this also occur whenKoopmans’theorem is used to predict electron affinities? Why? 11-4. Demonstrate that, if D1 =  a c b d  and D2 =  a e b f  then D1 +.D2 =  a c+.e b d+.f  11-5. A singly excited configuration .1 differs by one orbital from the ground state .0 and also by one orbital from certain doubly excited configurations .2. Brillouin’s theorem gives .0|Hˆ |.1=0, but not .1|Hˆ |.2=0. Where does the attempted proof to show that .1|Hˆ |.2=0 break down? 11-6. Show that, if . = c0.0 + c1.1 + c2.2 +···+cn.n, and if . is to be an eigenfunction of Aˆ with eigenvalue a1, then it is necessary that all the .i(i = 0, . . . , n) also be eigenfunctions of Aˆ with eigenvalues a1. 11-7. How many distinct four-center coulomb and exchange integrals result when one has four nuclei, each being the site of five basis functions? Make no assumptions about symmetry or basis function equivalence or electron spin. 11-8. For a homonuclear diatomic molecule, which of the following singly excited configurations would be prevented for reasons of symmetry from contributing to a CI wavefunction for which the main “starting configuration” is 1s2 g 1s2 u 2s2 g 1p4 u ? a) 1s2 g 1s2 u 2s2 g 1p3 u 1pg (i.e., 1pu.1pg) b) 2sg.3sg c) 2sg.1pg d) 1sg.3sg 11-9. Write down the hamiltonian operator for electrons in the water molecule. Use summation signs with explicit index ranges. Use atomic units. 11-10. An SCF calculation on ground state H2 at R =1.40 a.u. using a minimal basis set gives a sg and a su MO having energies sg =-0.619 a.u. su =+0.401 a.u. The nonvanishing two-electron integrals over these MOs are   sg(1)sg(2)(1/r12)sg(1)sg(2)dv(1)dv(2) = 0.566 a.u.   sg(1)su(2)(1/r12)sg(1)su(2)dv(1)dv(2) = 0.558 a.u.   sg(1)su(2)(1/r12)sg(2)su(1)dv(1)dv(2) = 0.140 a.u.   su(1)su(2)(1/r12)su(1)su(2)dv(1)dv(2) = 0.582 a.u. a) Write down the Slater determinant for the ground state of H2. b) Calculate the SCF electronic energy for H2 at R =1.40 a.u. 388 Chapter 11 The SCF-LCAO-MO Method and Extensions c) Calculate the total (electronic plus nuclear repulsion) energy for H2. d) What is the bond energy for H2 predicted by this calculation, assuming that the minimum total energy occurs at R =1.40 a.u.? e) Estimate the (vertical) ionization energy for H2. f) What is the value of the kinetic-plus-nuclear-attraction energy for one electron in ground-state H2 according to this calculation? References [1] T. Koopmans, Physica 1, 104 (1933). [2] D.W. Smith and O.W. Day, J. Chem. Phys. 62, 113 (1975). [3] A.W. Potts andW. C. Price, Proc. Roy. Soc. (London) A326, 181 (1972). [4] T. H. Dunning, Jr., R. M. Pitzer, and S. Aung, J. Chem. Phys. 57, 5044 (1972). [5] J. S. Binkley and J. A. Pople, Int. J. Quantum Chem. 9, 229 (1975). [6] C. Møller and M. S. Plesset, Phys. Rev. 46, 618 (1934). [7] P. Hohenberg andW. Kohn, Phys. Rev. 136, B864 (1964). [8] R. G. Parr and W. Yang, Density Functional Theory of Atoms and Molecules. Oxford University Press, Oxford, UK, 1989. [9] W. Kohn and L. J. Sham, Phys. Rev. 140, A1133 (1965). [10] A. D. Becke, Phys. Rev. A. 38, 3098 (1988). [11] C. Lee,W.Yang and R. G. Parr, Phys. Rev. B 37, 785 (1988); B. Miehlich, A. Savin, H. Stoll, and H. Preuss, Chem. Phys. Lett. 157, 200 (1989). [12] A. D. Becke, J. Chem. Phys. 98, 5648 (1993); P. J. Stephens, F. J. Devlin, C. F. Chabalowski, and M. J. Frisch, J. Phys. Chem. 98, 11623 (1994). [13] H. F. Schaefer, III, The Electronic Structure of Atoms and Molecules: A Survey of Rigorous Quantum-Mechanical Results. Addison-Wesley, Reading, Massachusetts, 1972. [14] W. J. Hehre, L. Radom, P. v.R. Schleyer and J. A. Pople, Ab initio Molecular Orbital Theory.Wiley, NewYork, 1986. [15] K. Raghavachari, Annu. Rev. Phys. Chem. 42, 615 (1991). [16] C. Roetti and E. Clementi, J. Chem. Phys. 60, 4725 (1974). [17] A. Veillard and E. Clementi, J. Chem. Phys. 49, 2415 (1968). [18] P. S. Bagus, Phys. Rev. 139, A619 (1965). [19] A.W.Weiss, Phys. Rev. 162, 71 (1967). [20] W. C. Ermler and C.W. Kern, J. Chem. Phys. 61, 3860 (1974). Section 11-17 Approximate SCF-MO Methods 389 [21] P. E. Cade andW. M. Huo, J. Chem. Phys. 47, 614 (1967). [22] W. J. Stevens, G. Das, and A. C.Wahl, J. Chem. Phys. 61, 3686 (1974). [23] S. Green, J. Chem. Phys. 54, 827 (1971). [24] S. Green, J. Chem. Phys. 57, 4694 (1972). [25] S. Green, J. Chem. Phys. 57, 2830 (1972). [26] S. Green, J. Chem. Phys. 56, 739 (1972). [27] W. J. Stevens, G. Das, A. C. Wahl, M. Krauss, and D. Neumann, J. Chem. Phys. 61, 3686 (1974). [28] K. P. Huber and G. Herzberg, Molecular Spectra and Molecular Structure IV. Constants of Diatomic Molecules. Van Nostrand, Princeton, 1979. [29] T. H. Dunning, Jr. and K. A. Peterson, J. Chem. Phys. 108, 4761 (1998). [30] V. N. Staroverov and G. E. Scuseria, J. Tao, and J. P. Perdew, J. Chem. Phys. 119, 12129 (2003); N. X. Wang and A. K. Wilson, J. Chem. Phys. 121, 7632 (2004). [31] Y. Zhao, B. J. Lynch, and D. G. Truhlar, J. Phys. Chem. A 108, 2715 (2004). [32] X. Xu, Q. Zhang, R. P. Muller, andW. A. Goddard III, J. Chem. Phys. 122, 014105 (2005). [33] L. C. Allen and H. Basch, J. Amer. Chem. Soc. 93, 6373 (1971). [34] L. Pedersen and K. Morokuma, J. Chem. Phys. 46, 3741 (1967). [35] W. H. Fink and L. C. Allen, J. Chem. Phys. 46, 2261 (1967). [36] P. A. Kollman and L. C. Allen, Chem. Phys. Lett. 5, 75 (1970). [37] E. Scarzafava and L. C. Allen, J. Amer. Chem. Soc. 93, 311 (1971). [38] R. B. Davidson and L. C. Allen, J. Chem. Phys. 54, 2828 (1971). [39] S. S. Xantheas, C. J. Burnham, and R. J. Harrison, J. Chem. Phys. 116, 1493 (2002). [40] S. Re,Y. Osamura, and K. Morokuma, J. Phys. Chem. A 103, 3535 (1999). [41] A. Dedieu, A. Veillard, and B. Roos, Jerusalem Symp. Quantum Chem. Biochem. 6, 371 (1974). [42] K. Morokuma and R. E. Davis, J. Amer. Chem. Soc. 94, 1060 (1972). [43] K. A. Peterson, S. Skokov, and J. M. Bowman, J. Chem. Phys. 111, 7446 (1999). [44] W. A. Goddard, III, Science 227, 917 (1985). [45] H. F. Schaefer, III, Science 231, 1100 (1986). 390 Chapter 11 The SCF-LCAO-MO Method and Extensions [46] J.A. Pople and D. L. Beveridge, Approximate Molecular Orbital Theory. McGraw- Hill, NewYork, 1970. [47] J. A. Pople and G. A. Segal, J. Chem. Phys. 43, S136 (1965). [48] J. A. Pople and G. A. Segal, J. Chem. Phys. 44, 3289 (1966). [49] J. D. Pulfer and M. A. Whitehead, Can. J. Chem. 51, 2220 (1973). [50] J. A. Pople, D. L. Beveridge, and P. A. Dobosh, J. Chem. Phys. 47, 2026 (1967). [51] R. C. Bingham, M. J. S. Dewar, and D. H. Lo, J. Amer. Chem. Soc. 97, 1285 (1975). [52] M. J. S. Dewar, Science 187, 1037 (1975). [53] J. A. Pople, D. P. Santry, and G. A. Segal, J. Chem. Phys. 43, S129 (1965). [54] M. J. S. Dewar andW. Thiel, J. Amer. Chem. Soc. 99, 4899 (1977). [55] M. J. S. Dewar, E. G. Zoebisch, E. F. Healy, and J. J. P. Stewart, J. Amer. Chem. Soc. 107, 3902 (1985). [56] G. M. Maggiora, J. Amer. Chem. Soc. 95, 6555 (1973). [57] J. J. Novoa and M.-H. Whangbo, J. Amer. Chem. Soc. 113, 9017 (1991). [58] T. Clark, A Handbook of Computational Chemistry.Wiley, NewYork, 1985. Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory 12-1 An Introductory Example Imagine a city having one million resident wage earners. The city government plans to raise additional revenue by assessing each such resident a wage tax. This tax will not apply to wage earners residing in suburbs. The government estimates that a $10 assessment will bring in new revenues of $10 million. This estimate assumes that the city population before imposition of the tax will hold after the tax is imposed as well. But the tax will produce a slight change, or perturbation, in the economic climate of the city. It is true that the tax is small, and so few people are likely to move to the suburbs as a result of it. Therefore, it is probably fairly accurate to use the population of the city before the perturbation to calculate the change in revenue brought about by the perturbation. But, if the perturbation were large, say $1000 a head, we would expect a substantial migration of wage earners to the suburbs, and so the estimate produced by using the original unperturbed population would contain substantial error. Corrections should be made, therefore, to account for population changes produced by the perturbation. This use of the unperturbed population to calculate the change in revenue is a crude example of a certain level of estimation (called “first order”) in Rayleigh–Schr¨odinger perturbation theory.1 We will now proceed to develop the theory more formally in the context of wavefunctions and energies. The above example has been presented to encourage the reader to anticipate that there is a lot of simple good sense in the results of perturbation theory even though the mathematical development is rather cumbersome and unintuitive. 12-2 Formal Development of the Theory for Nondegenerate States Perturbation theory involves starting with a system with known hamiltonian, eigenvalues, and eigenfunctions, and calculating the changes in these eigenvalues and eigenfunctions that result from a small change, or perturbation, in the hamiltonian for the 1Other perturbation methods exist, but the Rayleigh–Schr¨odinger (R-S) theory is the oldest and the most widely used in quantum chemistry. 391 392 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory system. We restrict the discussion to stationary states of systems having hamiltonians that are not time dependent. Let the known, unperturbed system have H0 as hamiltonian, and let the eigenfunctions .i be orthonormal. Then H0.i = Ei.i (12-1)  .*i .j dt = dij (12-2) and the functions . form a complete set as discussed in Chapter 6. We are interested in the system with hamiltonian H =H0 +.H (12-3) where .H is the perturbation.2 (The parameter . is a scalar quantity that will be convenient in the mathematical development of the theory. When the derivation is complete, we will set . equal to unity so that it no longer appears explicitly in any formula, and then H must account entirely for the perturbation.) The eigenvalues and eigenfunctions of the perturbed hamiltonian H are unknown. Let us symbolize them as W and f, respectively. Then Hfi =Wifi (12-4) It is clear that, when .=0, then H =H0, fi =.i and Wi =Ei. As . is increased from zero, Wi and fi change in (we expect) a continuous way. In other words, Wi and fi are continuous functions of the variable parameter ., and they are known at the particular value .=0. Therefore, we can expand them3 as series in powers of . about the point .=0. Thus, for a given state i,4 Wi =.0W(0) i +.1W(1) i +.2W(2) i +.3W(3) i +··· (12-5) Here we must remember that . is a variable and W(0) i , W(1) i , etc. are constants. The superscript in parentheses is simply a label to tell us for which power of . this constant is the coefficient. Since we know that Wi =Ei when .=0, we see at once that W(0) i in Eq. (12-5) is equal to Ei . Our problem is to evaluate W(1) i , W(2) i , etc. It is traditional to call W(0) i (or Ei ) the unperturbed energy or, sometimes, the energy to zeroth order. .W(1) i (which is just W(1) i after . is set equal to unity) is the first-order correction to the energy, and W(0) i +.W(1) i is the energy to first order. .2W(2) i , .3W(3) i , . . . , etc. are the second-, third-, etc., order corrections to the energy. Normally, for expansion in a series to be useful, it is necessary for the series to converge at a reasonable rate. In most simple applications of perturbation theory, only a few orders of correction are made. Thus, one very commonly reads of energies calculated “to first order,” or “to second order.” Calculations to much higher orders are also made, but these are not as common. 2Some treatments expand H as H0 +.H +.2H+··· and ultimately achieve working formulas that appear different from those we will achieve. In fact, they are equivalent. For an example of this alternative formulation, see Pauling andWilson [1, Chapter 6]. 3In effect, we assume fi and W to be analytic functions of . in the range 0=.=1. 4It is important to recognize that henceforth the subscript i will refer to the state that we are studying as a function of .. Section 12-2 Formal Development of the Theory for Nondegenerate States 393 In precisely the same manner, we can expand the unknown eigenfunction fi as a power series in .: fi =f(0) i +.f(1) i +.2f(2) i +.3f(3) i +··· (12-6) Since fi is a function of particle coordinates, f(0) i , f(1) i , etc. are also functions, but they are invariant to changes in .. Again, it is clear that f(0) i = .i . f(0) i (or .i) is the unperturbed or zeroth-order wavefunction, .f(1) i is the first-order correction to the wavefunction, etc. Now we substitute Eqs. (12-3), (12-5), and (12-6) into (12-4) and obtain (H0 +.H) f(0) i +.f(1) i +.2f(2) i +··· =W(0) i +.W(1) i +.2W(2) i +···f(0) i +.f(1) i +.2f(2) i +··· (12-7) The variable in Eq. (12-7) is ., and each power of . is linearly independent of all other powers of .. As indicated in Section 3-4D, this means that Eq. (12-7) can be satisfied for all values of . only if it is satisfied for each power of . separately. Collecting terms having the zeroth power of . gives H0f(0) i =W(0) i f(0) i (12-8) However, we have already recognized that f(0) i =.i, W(0) i =Ei (12-9) and Eq. (12-8) is simply a restatement of Eq. (12-1). Collecting terms from Eq. (12-7) containing . to the first power we obtain . Hf(0) i +H0f(1) i -W(0) i f(1) i -W(1) i f(0) i =0 (12-10) This equality must hold for any value of ., so the term in parentheses is zero. Hence, rearranging and making use of Eqs. (12-9), we have the first-order equation H -W(1) i .i +(H0 -Ei)f(1) i =0 (12-11) Let us multiply this from the left by .*i and integrate:  .*i H.i dt -W(1) i  .*i .i dt + .*i H0f(1) i dt -Ei  .*i f(1) i dt =0 (12-12) Using the hermitian property of H0, it is easy to show that the third and fourth terms cancel, leaving W(1) i = .*i H.i dt (12-13a) 394 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory Thus we have arrived at an expression for the first-order correction to the energy in terms of known quantities. It is the expectation value for the perturbation operator calculated using the wavefunction of the unperturbed system. The analogy between this formula and the use of the population of the unperturbed city to calculate additional revenues from a new tax should be apparent. H corresponds to the tax per wage earner, and .*i .i dt corresponds to the sum of wage earners in the city before the tax was imposed. EXAMPLE 12-1 A one-dimensional box potential is perturbed so that it is raised by a constant amount, d, in the left half of the box, and lowered by d in the right half. What is the first-order change in energy for the lowest-energy state .1? for .2? SOLUTION  H is antisymmetric for reflection through the box center, and .2 1 is symmetric, so the integral givingW(1) 1 equals zero by symmetry. The same argument applies to .2. Another way to argue is to recognize that .2 n is symmetric in the box (for all n), which means that half of the distribution responds to the energy increase and half to the equal energy decrease, so the net energy change to first order is zero for all states.  Equation (12-13a) can be written in bra-ket notation (see Appendix 9): W(1) i =.i H .i  (12-13b) or in the notation wherein an integral is indicated by affixing subscripts to the operator (i.e., as a matrix element) W(1) i =H ii (12-13c) In most discussions of perturbation theory one of these alternative notations is used. We will continue our formal development from this point using bra-ket notation for integrals. To find an expression for f(1) i , the first-order correction to the wavefunction for the ith state, we first recognize that we can expand f(1) i in terms of the complete set of eigenfunctions .: f(1) i = j c(1) ji .j (12-14) The summation symbol suggests that . is a discrete set of functions. This need not be true. Contributions from functions whose eigenvalues are in a continuumwould require integration rather than summation. However, we will use the sum symbol since most actual applications of perturbation theory in quantum chemistry invoke only discrete functions. We now insert Eq. (12-14) into (12-11) to obtain H -W(1) i .i +(H0 -Ei) j c(1) ji .j =0 (12-15) Section 12-2 Formal Development of the Theory for Nondegenerate States 395 Multiplying from the left by .*k and integrating yields .k|H|.i-W(1) i .k|.i+ j c(1) ji .k|H0|.j-Ei .k|.j  =0 (12-16) If k=i, this reduces to Eq. (12-13), as was shown above. If k =i, the terms in the sum all vanish except when j =k. Thus, .k|H|.i+c(1) ki .k|H0|.k-Ei .k|.k =0, (k=i) (12-17) or c(1) ki = .k|H|.i  Ei -Ek , k=i (12-18) Inserting this into Eq. (12-14) gives an expression for f(1) i : f(1) i = j =i .j |H|.i  Ei -Ej .j (12-19) This formula prescribes the way the first-order correction to the wavefunction is to be built up from eigenfunctions of the unperturbed system. We discuss this formula in detail later when considering an example. Note that, if the perturbed state of interest (the ith) is degenerate with another state (the lth), Eqs. (12-18) and (12-19) blow up for k or j equal to l unless .l |H|.i  vanishes. Therefore, we restrict the theoretical discussion of this section to states of interest that are nondegenerate and discrete. (States of the system other than the ith may be degenerate or continuum states, however.) If we extract the terms containing .2 from Eq. (12-7) and proceed, in the same way as above, to expand f(2) i as a linear combination of unperturbed eigenfunctions, f(2) i = j c(2) ji .j (12-20) we arrive, after some manipulation, at the following formula for W(2) i : W(2) i =  j (=i) .i |H|.j .j |H|.i  Ei -Ej (12-21) Comparing this with Eq. (12-18) allows us to write W(2) i =  j (=i)  c(1) ji  2 (Ei -Ej ) (12-22) or comparing Eqs. (12-19) and (12-21) gives W(2) i =.i |H|f(1) i  (12-23) 396 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory The formula that emerges for c(2) ji of Eq. (12-20) is c(2) ji =  k(=i) .j |H|.k.k|H|.i  Ei -Ek Ei -Ej - .i |H|.i .j |H|.i  (Ei -Ej )2 , i=j (12-24) The boxed equations are “working equations” since they enable us to calculate the correction terms from the known eigenvalues and eigenfunctions of the unperturbed system. Equation (12-23) is interesting because it indicates that the second-order correction to the energy is calculable if we know the zeroth and first-order functions.5 Comparing Eqs. (12-13) and (12-23) shows that, whereas the first-order correction for the energy is the average value for the perturbation operator with the unperturbed wavefunction, the second-order correction is an interaction element between two functions, and not an average value in the usual sense. Higher-order correction terms may be found by proceeding in a similar way with .3,.4, etc. terms from Eq. (12-7). The equations become progressively more cumbersome and will be of no interest to us for applications to be considered in this book. Having made use of the parameter . to keep terms properly sorted, we can now dispense with it by setting it equal to unity. Then H = H0 +H (12-25) Wi = Ei +W(1) i +W(2) i +··· (12-26) fi = .i +f(1) i +f(2) i +··· (12-27) Notice that fi is not normalized. The normalization coefficient needed will depend on the order to which fi has been calculated.6 12-3 A Uniform Electrostatic Perturbation of an Electron in a “Wire” 12-3.A Description of the System Suppose that a small uniform electric field is applied to an electron somehowconstrained to move on a line segment of length L. In the absence of this field, we assume the electron states to be described by the one-dimensional “box” wavefunctions discussed 5L¨owdin [2] has shown that, if we know all the fi ’s up to f (n) i , we can calculate all theWi ’s up to and including W (2n+1) i . 6One can guarantee normality up to second order in f2 i by setting c (1) ii =0, c (2) ii =-1 2k c (1) ki  2 (see Schiff [3]). Section 12-3 A Uniform Electrostatic Perturbation of an Electron in a “Wire” 397 Figure 12-1  Potential of an electron in a line segment of length L in the presence of a uniform electric field. The perturbation is “small” if the potential change u across L is small compared to p2/2L2, the energy in atomic units of the lowest unperturbed state. in Chapter 2. The electric field will be treated as a perturbation. The potential energy of the electron is sketched in Fig. 12-1. A uniform field produces a constant gradient (in potential energy) along the line segment. For purposes of discussion, we let the perturbation rise from zero at x =0 to u at x =L, but we shall see later that there is a degree of arbitrariness here. The perturbation, then, is given by H =ux/L, 0=x =L (12-28) where u, x, and L are measured in atomic units. 12-3.B The Energy to First Order We now ask, what is the effect, to first order, of H on the energy of the lowestenergy state of the electron? As described in the preceding section, this is obtained by calculating the average value of H using the unperturbed wavefunction: W(1) 1 = L o 2/Lsin(px/L) (ux/L) 2/Lsin(px/L) dx (12-29) We now describe three ways to evaluate this integral. One way is to integrate explicitly, the other two ways involve simple inspection. 1. Explicit Integration. Factoring constants from Eq. (12-29) gives W(1) 1 =2u/L2  L 0 x sin2(px/L) dx (12-30) To achieve a common variable, we multiply x and dx each by p/L and outside by L2/p2, thereby keeping the value unchanged: W(1) 1 =2u/p2  x=L x=0 (px/L) sin2(px/L)d(px/L) (12-31) Letting px/L=y and noting that y =0, p when x =0, L, we have W(1) 1 =2u/p2  p 0 y sin2 y dy (12-32) 398 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory Standard tables (see also Appendix 1) lead to a value of p2/4 for the integral, and W(1) 1 =(2u/p2)(p2/4)=u/2 (12-33) To first order, then, the energy of the lowest-energy perturbed state is, in atomic units, W =(p2/2L2)+u/2 (12-34) 2. Evaluation by Inspection: First Method. Equation (12-34) is certainly a reasonable result since, as the potential is increased everywhere in the box (except at x =0), we expect that the energy of the electron should also increase. The fact that the increase is such a simple quantity (u/2) suggests that there might be a simple way to understand this result, and this is indeed the case. Consider the distribution of the electron in the lowest unperturbed state, shown in Fig. 12-2. This distribution is symmetric about the midpoint of the line segment. Consequently, for each instant of time the unperturbed electron spends in element dx1 of Fig. 12-2, it spends an equal instant in the symmetrically related element dx2. In other words, for each instant the electron experiences a perturbation potential less than u/2, it experiences an instant of potential greater (by an equal amount) than u/2. Hence, the average potential must be precisely u/2. Since we know that .2 is symmetric for every state in the unperturbed box, we can immediately extend our result and say that the first-order correction to the energy of every state is u/2. The ability to evaluate first-order energies by inspection is very useful. Even in cases where exact evaluation by this technique is not possible, it may still be useful in making an estimate or in checking the reasonableness of a computed result. 3. Evaluation by Inspection: Second Method. A variation of the above approach is sometimes useful. We begin by recognizing that, whereas H is neither symmetric nor antisymmetric about the midpoint of the wire, we can make it antisymmetric by subtracting the constant u/2, as indicated in Fig. 12-3. Figure 12-2  .2(n=1) for a particle in the unperturbed box. Section 12-3 A Uniform Electrostatic Perturbation of an Electron in a “Wire” 399 Figure 12-3  The function ux/L is unsymmetric about L/2, but ux/L-u/2 is antisymmetric about L/2. By writing H =ux/L-u/2+u/2=Hanti +u/2 (12-35) we express H as an antisymmetric function plus a constant. Our integral for W(1) 1 becomes W(1) 1 = L 0 .n=1Hanti.n=1 dx + L 0 .n=1(u/2).n=1 dx (12-36) The first integral vanishes because .2 is symmetric. The second integral is just u/2 times unity since . is normalized. Hence, W(1) 1 =u/2. EXAMPLE 12-2 A one-dimensional box potential is perturbed so that it is raised by a constant amount, d, on the left side of the box but is unchanged on the right side. What is the first-order change in energy for .1? for .2? SOLUTION  One can argue, similarly to the reasoning in Example 12-1, that half of the distribution responds to the increase in potential, half to zero change, giving a net first-order increase of d/2. Or, one can turn the perturbation into an antisymmetric function plus a constant: H =H -d/2+d/2=Hanti +d/2, leading toW(1) 1 =d/2. The same result applies to .2 and all higher states because .2 n is symmetric in the box for all n.  12-3.C The First-Order Correction to .1 How should we expect the lowest-energy wavefunction to change in response to the perturbation? Since we are dealing with the lowest-energy state, we might expect the electron to spend more time in the low-potential end of the box (the nonclassical result), and so the wavefunction should tend to become skewed, as shown in Fig. 12-4. In this figure it is demonstrated how the perturbed wavefunction can be resolved into an unperturbed wavefunction and a correction, or difference, function. Since this correction function must increase . on the low potential side and decrease . on the high potential side, it is clear that it must be close to antisymmetric in nature. According to Eq. (12-19), the first-order approximation f(1) 1 to this correction function is formed by adding together small amounts of higher-energy wavefunctions. Comparing the correction function in Fig. 12-4 and these higher-energy wavefunctions (Fig. 12-5) leads us to expect that .2 will be a heavy contributor, whereas .3, being 400 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory Figure 12-4  Sketches of the lowest-energy wavefunction (a) before and after perturbation and (b) the difference between them. Figure 12-5  Sketches of the (a) second, (b) third, and (c) fourth (n=2, 3, 4, respectively) wavefunctions . for the particle in the unperturbed box. symmetric, will not contribute strongly. The mixing coefficient for .2 is, according to Eq. (12-18), c(1) 21 =  L 0 v2/Lsin 2px/L ux/L v2/Lsinpx/L dx -3p2/2L2 = 32L2u 27p4 (12-37) Thus, c(1) 21 is positive, and .2 contributes to f(1) 1 in the manner expected. The mixing coefficient for .3 is given by c(1) 31 = .1|Hanti|.3 E1 -E3 + .1|u/2|.3 E1 -E3 =0+0 (12-38) where we have used expression (12-35) for H. The first integral vanishes because .1 and .3 are symmetric. The second integral vanishes because they are orthogonal. Clearly, no symmetric state will contribute to f(1) 1 . Since .4 is antisymmetric, it can contribute to f(1) 1 . On evaluation, we find that c(1) 41 is about 2% of c(1) 21 (Problem 12-5). c(1) 41 is so much smaller than c(1) 21 for two reasons. First, the integral .1|H|.4 in the numerator of c(1) 41 is much smaller than .1|H|.2 in c(1) 21 . This means that the shifting of charge produced by adding .4 to .1 is much less helpful for lowering the energy than is that produced by adding .2 to .1. Examination of .2 and .4 (Fig. 12-5) reveals why this is so. .2 causes removal of charge from the right-hand half of the box and accumulation of charge in the left-hand half. .4 causes removal of charge from the second and fourth quarters (numbering from the left) and buildup of charge in the first and third quarters. On balance .4 helps, but charge buildup in the third quarter is not desirable nor is removal of charge from the second quarter, and so .4 is much less helpful than .2. The second reason for c(1) 41 being so small is that E1 -E4 in the denominator of c(1) 41 is five times as big as E1 -E2 in c(1) 21 . Section 12-3 A Uniform Electrostatic Perturbation of an Electron in a “Wire” 401 In general, mixing between states of widely different energies is discouraged by the formula for f(1) i . The fact that a large contribution by .j to f(1) i is favored by large .i |H|.j  and small |Ei -Ej | is strikingly similar to the situation found for variational calculations (Chapter 7). There, the mixing between two basis functions .i and .j. is favored if .i |H|.j=Hij is large and if |Hii -Hjj | is small. EXAMPLE 12-3 What sign should be expected for c(1) 21 for the perturbation of Example 12-1? of 12-2? SOLUTION  Assuming that .1 will behave anticlassically, and be skewed so as to decrease the probability distribution in the higher-energy region (left side), we expect c (1) 21 to have the opposite sign of that in Eq. 12-37. Hence, it should be negative. This applies for both Examples 12-1 and 12-2.  12-3.D The Role of an Additive Constant in H Review of the results of Sections 12-3B and C will show that addition of a constant to H will change W(1) i by the same constant for all states and have no effect on f(1) i . The change in energy of all states by a constant is equivalent to a relocation of the zero of energy and is normally of no interest. Our initial statement that the perturbation is produced by a uniform electric field left the additive constant in H unspecified. We chose to set H = 0 at x = 0, but we could have made any of an infinite number of choices for H at x =0. A more sensible choice would have been the antisymmetric function H =ux/L-u/2 because this still leads to changes in wavefunctions due to the perturbation, but introduces no energy change to first order (W(1) i =0 for all states using thisH). In other words, the antisymmetric function includes the relevant physics of the problem and excludes the trivial effects of a constant (u/2) potential change. Our choice of H =ux/L was made for pedagogical reasons. 12-3.E The Calculation ofW(2) 1 Since the constant first-order contribution to the energies of all the states is not physically interesting, let us examine the second-order contribution to the ground-state energy, W(2) 1 . Equation (12-23) shows that this is related to the first-order correction to the wavefunction, f(1) 1 , which, as we have already seen, causes thewavefunction to become skewed toward the low-potential end of the box. It is clear from Eq. (12-22) that, in calculating the coefficients for f(1) 1 , we have already done most of the work needed to find W(2) 1 . We saw earlier that f(1) 1 is made primarily from .2. For simplicity, we will neglect all higher-energy contributions, and so f(1) 1 ~= c(1) 2 .2 =(32L2u/27p4).2 (12-39) 402 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory and, using Eq. (12-22), W(2) 1 ~= (32L2u/27p4)2(-3p2/2L2) (12-40) Thus, the effect of W(2) 1 is to lower the energy. We pause at this point to summarize our results. The perturbation raises the potential everywhere in the box. (This depends on our choice of an arbitrary constant.) The energy to first order is increased by the same constant amount for every state. This is easily seen by inspection, utilizing simple symmetry features ofH and the unperturbed wavefunctions. The wavefunction for the lowest-energy state is skewed, to first order, in a way that is energetically favorable as far as interaction with H is concerned. The second-order contribution to the energy for this state is negative, reflecting this energetically favorable shift of charge. Thus far, everything behaves sensibly. We next examine the behavior of the second-lowest energy state, where some important new features occur. EXAMPLE 12-4 For the .1 case of Example 12-1, should the energy to second order be higher or lower than the unperturbed energy? SOLUTION  The energy to second order is E1 +W (1) 1 +W (2) 1 . We have already seen that W (1) 1 =0. Thus the question comes down to asking whetherW (2) 1 is positive or negative. The firstorder correction to the wavefunction shifts charge from the higher-energy regions, hence produces a negative W (2) 1 .  12-3.F The Effects of the Perturbation on .2 We begin by examining f(1) 2 , the first-order correction to .2. Inspection of Eq. (12-18) leads at once to the observation that c(1) 21 =-c(1) 12 . This means that, since .2 contributes to f(1) 1 with a positive coefficient, .1 contributes to f(1) 2 with a negative coefficient. A sketch of .2 minus a small amount of .1 will show that this has the effect of shifting charge from the left half to the right half of the box. This is just the reverse of what we found for the lowest-energy state. The antisymmetry of Eq. (12-18) for interchange of i and k allows us to make the following general statement. Let a perturbation occur that raises or lowers the potential more in one region of space than in another. The first-order correction to a given wavefunction will contain higher-energy wavefunctions in a manner to cause charge to shift into regions of lowered (or less raised) potential and it will contain lowerenergy wavefunctions in a manner to cause charge to shift into regions of raised (or less lowered) potential. We have not yet completed our construction of f(1) 2 . We must calculate coefficients for contributions from the higher-energy functions .3, .4, etc. The state .3 should contribute fairly strongly since it has the proper symmetry and is not too distant in energy from E2: c(1) 32 = .2|H|.3 E2 -E3 = 3 5 32L2u 25p4 (12-41) Section 12-4 The Ground-State Energy to First-Order of Heliumlike Systems 403 Therefore, .3 contributes to f(1) 2 with a coefficient about 35 the magnitude of the contribution from .1. The sign of c(1) 32 is positive and a sketch of .2 plus a small amount of .3 will demonstrate that this causes charge shifting to the left in accord with our general statement above. Since.4 contributes nothing (by symmetry) and.5 is fairly distant in energy, we will neglect all contributions to f(2) 2 above .3. We have, then, two sizable contributions to f(1) 2 , each favoring charge shifts in opposite directions. Let us see how W(2) 2 reflects this: W(2) 2 ~= c(1)2 12 (E2 -E1)+c(1)2 32 (E2 -E3) (12-42a) ~= c(1)2 12 (3)+( 3 5 c(1) 12 )2(-5)p2/2L2 (12-42b) ~= 1.2c(1)2 12 p2/2L2 (12-42c) W(2) 2 is the difference between energy contributions of opposite sign. The net result (energy increases) comes about in this case because .1 contributes more heavily than .3. The fact that contributing wavefunctions from lower and higher energies affectW(2) i oppositely is made evident by Eq. (12-22). Hence, we can extend our general statement above by adding that the second-order contribution to the energy from states below .i in energy cause the energy of the j th state to go up, contributions from above cause it to go down. Because the unperturbed energies of the particle in the box increase as n2, any state (except the lowest) is closer in energy to states below than to states above. Hence, for at least some kinds of perturbation, we might expect these states to “feel” the effects of states at lower energies more strongly and to rise in energy (as far asW(2) is concerned). This is what happens in this example. There are no states below the lowest, and so W(2) 1 cannot be positive, but W(2) 2 is positive and it turns out that W(2) i is positive for all higher i as well. It is interesting to compare these results with those from classical physics. Classically, the particle moves most slowly at the top of the potential gradient and therefore spends most of its time there. The lowest-energy state has responded in the opposite manner, in a way we might call anticlassical. The second and all higher states have responded classically. 12-4 The Ground-State Energy to First-Order of Heliumlike Systems The hamiltonian for a two-electron atom or ion with nuclear chargeZ a.u. is (neglecting relativistic effects and assuming infinite nuclear mass) H(1, 2)=- 1 2 .2 1 +.2 2 -Z/r1 -Z/r2 +1/r12 (12-43) This may be written as a sum of one-electron operators and a two-electron operator: H(1, 2)=H(1)+H(2)+1/r12 (12-44) 404 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory where H(i) is simply the hamiltonian for the hydrogenlike system with nuclear charge Z: H(i)=- 1 2.2 i -Z/ri (12-45) Since we know the eigenvalues and eigenfunctions for H(i), we can let H(1)+H(2) be the unperturbed hamiltonian with 1/r12 the perturbation. Such a perturbation is not very small, but it is of interest to see how well the method works in such a case. We have, therefore, for the lowest-energy state of the system: H0 = H(1)+H(2) (12-46) H = 1/r12 (12-47) .1 = (Z3/p) exp [-Z(r1 +r2)] (12-48) E1 = -Z2/2-Z2/2=-Z2 (12-49) W(1) 1 = .1|1/r12|.1 (12-50) All quantities are in atomic units. The unperturbedwavefunction [Eq. (12-48)] is simply the product of two one-electron 1sAOs. Because this is an eigenfunction of the system in the absence of interelectronic repulsion, it is too contracted about the nucleus. The first-order correction to the energy is the repulsion between the two electrons in this overly contracted eigenfunction [Eq. (12-50)]. We have encountered this same repulsion integral in our earlier variational calculation on helium-like systems. The evaluation of this integral is described in Appendix 3. Its value is 5Z/8. Hence, to first order, W1 =E1 +W(1) 1 =-Z2 +5Z/8 (12-51) In Table 12-1, this result is compared with exact energies for the first ten members of this series. Several points should be noted: 1. The effect of the perturbation to first order is to increase the ground-state energy. This is expected since 1/r12 is always positive (i.e., repulsive). 2. The energy to first order is never below the exact ground-state energy. This is a general property of perturbation calculations as is easily proved (Problem 12-1). 3. The energy to first order is in error by a fairly constant amount throughout the series. For H-, this gives a substantial percentage of error and fails to show H- stable compared with anHatom and an unbound electron. For higherZ, this error becomes relatively smaller since 1/r12 becomes relatively less important compared withZ/r... The assumption that H is a small perturbation is thus better fulfilled at large Z and results in W(1) 1 being a much smaller correction relative to the total energy W1. Because “large” and “small” are relative terms, they can be misleading. Since energies of chemical interest are often small differences between large numbers (see the discussion at the end of Chapter 7), errors that were originally relatively small can become relatively large after the subtraction. Therefore, even though perturbation terminology would suggest that the results at Z =10 are better than those at Z =1, this may not be the case for some practical applications. TABLE 12-1  Comparison of Exact Energy (in atomic units) with Energy to First Order when H =r-1 12 Z System E1 W(1) 1 E1 +W(1) 1 Ea exact Eexact -E1 -W(1) 1 %Error 1 H- -1.0000 5/8=0.6250 -0.3750 -0.52759 -0.15259 28.92 2 He -4.0000 5/4=1.2500 -2.7500 -2.90372 -0.15372 5.29 3 Li+ -9.0000 15/8=1.8750 -7.1250 -7.27991 -0.15491 2.13 4 Be2 + -16.0000 5/2=2.50000 -13.5000 -13.65556 -0.15556 1.14 5 B3+ -25.0000 25/8=3.1250 -21.8750 -22.03097 -0.15597 0.71 6 C4+ -36.0000 15/4=3.7500 -32.2500 -32.40624 -0.15624 0.48 7 N5+ -49.0000 35/8=4.3750 -44.6250 -44.78144 -0.15644 0.35 8 O6+ -64.0000 5.0000 -59.0000 -59.15659 -0.15659 0.26 9 F7+ -81.0000 45/8=5.6250 -75.3750 -75.53171 -0.15671 0.21 10 Ne8+ -100.0000 25/4=6.2500 -93.7500 -93.90680 -0.15680 0.17 aEexact is the nonrelativistic energy to thirteenth order in Z-1, truncated at the fifth decimal place. See Scherr and Knight [4]. 405 406 Chapter 12 Time-Independent Rayleigh–Schr ¨ odinger Perturbation Theory 12-5 Perturbation at an Atom in the Simple H¨uckel MO Method Perturbation theory can be used to estimate the effect of a change in the value of the coulomb integral Hkk at carbon atom k. This is normally given the value of a, but it might be desirable to consider a different value due, for example, to replacement of an attached hydrogen by some other atom or group. We take the unperturbed H¨uckel MOs and orbital energies as our starting point and let the perturbed value of Hkk be a +da. Therefore, H =da (at center k only) (12-52) The first-order correction for the energy of fi , the ith MO, is W(1) i =fi |H|fi  (12-53) Now fi is a linear combination of the AOs .: fi = j cji.j (12-54) and so W(1) 1 = j  l c*jicli .j |H|.l (12-55) but H is zero except at atom k, where it is da, and so the sum reduces to one term: W(1) i =c*kicki da (12-56) Summing over all the MOs times the number of electrons in each MO gives the firstorder correction to the total energy: W(1) =da i nic*kicki =daqk (12-57) where qk is the p-electron density at atom k. Thus the energy change to first order is equal to the change in Hkk times the unperturbed electron density at that atom. This can also be seen to be an immediate consequence of the alternative energy expression E = l qlal +2 lEb, negative if Ea or < than atomic average IE atoms Does experiment agree with theory? H2 b 15.427 > 13.598 Yes He2 a ~22.0 < 24.46 Yes Li2 b 5.12 > 5.363 No B2 b ~9.5 > 8.257 Yes C2 b 12.0±0.6 > 11.267 Yes N2 b 15.576 > 14.549 Yes O2 a 12.06 < 13.618 Yes F2 a 15.7 < 17.426 Yes Ne2 a 20.1 < 21.47 Yes Si2 b 7.4±0.3 > 8.15 No Cl2 a 11.48±0.1 < 13.02 Yes Br2 a 10.53 < 11.85 Yes I2 a 9.3 < 10.457 Yes CN b 14.5±0.5 > 11.27 (C), 14.55 (N) Yes NO a 9.25 < 13.62 (O), 14.55 (N) Yes CO b 14.013 > 13.62 (O), 11.27 (C) Yes CS b 11.8, 119 > 11.27 (C), 10.36 (S) Yes ICl a 10.3 < 10.46 (I), 13.02 (Cl) Yes IBr a 9.98 < 10.46 (I), 11.85 (Br) Yes aData are from [3] or from Table 7-2. 494 Section 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 495 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams In this section we will describe howthe rules and concepts ofQMOT enable one to rationalizeandpredictmolecularshapes. Theearliestsystematictreatmentofthisproblemwas givenbyWalsh,2 whoseapproachhasbeenextendedbyothers, particularlyGimarc[6].3 We begin by considering the symmetric triatomic class of molecules HAH, where A is any atom. Such molecules can be linear or bent. Walsh’s approach predicts which are linear, which are bent, and sometimes which of two bent molecules is more bent. We approach the problem in the following way. First we sketch the valence MOs for the generalized linear molecule HAH, deciding which is lowest, second lowest, etc. in energy. Then we imagine bending the molecule and argue whether each MO should go up or down in energy on the basis of our QMOT rules. This produces a chart of orbital energies versus bond angle—a one-sided correlation diagram. Finally, we use this diagram to argue that HAH will be linear or bent, depending on how many valence electrons HAH has and on how they are distributed among the MOs. The first problem is to sketch the MOs for linear HAH and decide their energy order. Probably the simplest way to do this is through use of symmetry and perturbation arguments. We know that the linear molecule belongs to the D8h point group, possessing a center of inversion and a reflection plane through the central atom and perpendicular to the HAH axis. This means that the two hydrogen 1sAOs (1s1 and 1s2) will appear in MOsin the symmetry combinations fg=1s1+1s2,fu =1s1-1s2, where g and u stand for gerade and ungerade, respectively (see Chapters 7 and 13 for a background discussion). Recognizing this, we can next consider which MOs will result from interactions between the valenceAOson atomAand the symmetry orbitalsfg andfu. Aperturbationtype diagram for this appears in Fig. 14-7. We have assumed that only the valence s and pAOs onA are involved in bonding. Extension to include dAOs is possible. On the right side of Fig. 14-7, the symmetry orbitals fg and fu are shown to be slightly split. This reflects the greater stability of the in-phase, or bonding, combination. However, the splitting is slight because the hydrogen atoms are quite far apart (so that atom A can fit between them). These two levels sandwich the separated-atom limit of -1 2 a.u. or -13.6eV. The AOs of atom A are sketched on the left. Their energies are arranged so that the 2p energies are about the same as the fg,fu symmetry orbital energies on the right. (For example, AO energies used for nitrogen in EHMO calculations are: 2s~-25eV, 2p~-13 eV.) For the linear molecule, we can label the s and p AOs on atom A as s or p and g or u. To generate the MO energy-level pattern from the interactions between these AOs onA and fg,fu, we use the following rules from perturbation theory (see Chapter 12): 1. Interactions occur only between orbitals of identical symmetry. 2. Interactions lead to larger splittings if the interacting orbitals are closer in energy (overlap considerations being equal). The resulting energy levels appear in the central column of Fig. 14-7. We need sketches of the MOs whose energy level pattern we have just approximated. We can guess the qualitative appearance of these by recalling that, when two orbitals 2SeeWalsh [4] and the papers immediately following. See also Mulliken [5]. 3For a critical review of the theoretical validity ofWalsh’s method, see Buenker and Peyerimhoff [7]. 496 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-7  Orbital energies of MOs formed by interaction of su and sg symmetry orbitals on H1 . . .H2 with valence s and p AOs on central atom A. The p AOs and MOs are degenerate but are sketched as slightly split. interact to give splitting, the lower energy corresponds to a bonding interaction, the higher energy to antibonding. Thus, for example, Fig. 14-7 indicates that the 1sg MO is a bonding combination of the 2sAO onA and the sg symmetry orbital, whereas 2sg is the antibonding combination. The pu MOs are simply the pp AOs on A, since there is nothing of the same symmetry for them to interact with. The six MOs for the HAH molecule are sketched in Fig. 14-8a. Before proceeding, notice that the lowest two MOs are A–H bonding, the next two are nonbonding, the highest two are antibonding. This is an example of the way in which topological, or nearest-neighbor, interactions govern the gross features of energy ordering. In fact, we could have generated this same set of MOs and energy order by simply sketching all of theMOswe could think of that were symmetric or antisymmetric for relevant symmetry operations and then putting the bonding ones lowest (with s lower than p), nonbonding next, and antibonding highest (again recognizing that greater s character should yield lower energy). (Note that the nonbondingMOsare not symmetric or antisymmetric for arbitrary rotations about the C8 axis. Hence, they must form a basis for a representation of dimension greater than one. Hence, they are degenerate. See Chapter 13 for detailed discussion.) Electrons in the two lowest MOs produceA–H bonding. Electrons in the next two MOs have little effect on A–H bonding and, in fact, constitute what a chemist normally thinks of as lone pairs. Electrons in the two highest MOs tend to weaken the A–H bonds. We normally do not worry about questions of Section 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 497 Figure 14-8  The Walsh-type correlation diagram for HAH: (a) linear D8h; (b) bent C2v. The cross-hatched parts of MOs have opposite sign from open parts. shape for systems where these highest two MOs are filled because such an HAH system is not even bonded (i.e., we do not worry about second-order structure if there is no stable first-order structure). Consequently, these two highest MOs will be omitted in many energy-versus-angle diagrams, although there are certain cases where they can be useful (e.g., in singly excited configurations) (see Problem 14-5). EXAMPLE 14-2 Referring to Fig. 14.8, give the QMOT rationale for the MOs in the lower-half of the figure having energies below those in the upper half. What is the rationale for the lowest (a1g) MO being at lower energy that the second-lowest (a2u) MO, and why are these both below the next (eu) MOs? 498 Chapter 14 Qualitative Molecular Orbital Theory SOLUTION  The lowest-energy half of the set are all C–H bonding MOs, the other half all being C–H antibonding. Because there are so many C–H bonds, their nature determines this separation. Within the C–H bonding set, the lowest-energy pair differ from the others in having 2s AOs on carbon, which are sufficiently lower in energy than 2pAOs to give them lower energy. The a1g MO is C–C bonding, hence has lower energy than the a2u MO, which is C–C antibonding.  We now consider how the MO energies change upon bending the molecule. As 1sg is bent, the two hydrogen AOs move closer together. Because they have the same phase, this leads to an overlap increase, but it occurs over a fairly long distance, being a second-nearest neighbor interaction. Therefore, there is an energy lowering, but it is not very large. Bending 1su leads to two changes. First, the 1s AOs move away from the axis of maximum concentration of the 2p AO. This causes a substantial loss of overlap and a substantial increase in energy. Second, the 1s AOs move closer to each other. They disagree in phase, and this also tends to increase the energy, although it is a relatively small effect because it occurs between second-nearest neighbors. In the linear molecule, 1pux and 1puy MOs contain no contribution from hydrogen 1s because the hydrogens are in a nodal plane in each case. If we imagine that, in bending the molecule, we keep the hydrogens in the xz plane, then we see that the hydrogens are remaining in the nodal plane for 1puy, but have moved away from the nodal plane of 1pux. Once this happens, the 1s AOs are no longer forbidden by symmetry from contributing to the px MO, and a “growing in” of 1sAOs occurs, leading to an MO like the 2a1 MO drawn in Fig. 14-8. This behavior is more complicated than we observed for the lower-energy MOs because it involves more than a mere distortion of an existing MO. With a little experience, this additional complication is easily predicted (or, one can do an EHMO calculation and “peek” at the answer by sketching out the MOs contained in the output; see Problem 14-6). The effect on the energy of this 2a1 orbital is quite large because several things happen, all of which are energy lowering: 1. For a given amount of 1s AO, the bonding overlap with 2p increases with bending. 2. For a given amount of 1s AO, the bonding overlap between the two hydrogens increases with bending. 3. Since the amount of 1sAO present is not constant, but increases with bending (from zero in the linear configuration) the rate of energy lowering due to (1) and (2) is further augmented. Also, this is a fairly high-energy MO, and QMOT rule (3) tells us to expect such MOs to respond more dramatically to overlap changes. Finally, the overlap of a 1s AO with a 2p AO on another nucleus varies as cos ., where . is zero when the 2p AO points directly at the 1sAO. This means that the rate of change of overlap with angle is much greater around . =90. than at . =0. (see Problem 14-8). This is another reason for thinking that 1pux.2a1 will drop in energy much faster than 1su.1b2 will rise. The other p MO, 1puy, undergoes no changes in overlap since the hydrogen atoms remain in the nodal plane throughout the bending process. Therefore,QMOTarguments predict no energy change for this MO. The highest two MOs change in energy in ways that should be obvious to the reader, based on the above examples. Since these are the highest-energy MOs, they should show further enhanced sensitivity to overlap changes. The MOs for the bent form are labeled in accordance with the symmetry notation for the C2v point group, with a and b meaning symmetric and antisymmetric, respectively, Section 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 499 for rotation about the two-fold axis and 1 and 2 being analogous symbols for reflection in the plane containing theC2 axis and perpendicular to the molecular plane. The lowest valence MO of each symmetry type is numbered “1” despite the fact that lower-energy inner-shell orbitals exist. Because of the very qualitative nature of the arguments leading to Fig. 14-8, no effort is made to attach a numerical scale, either for energy or angle. We are now in a position to see how predictions based on ourWalsh-type correlation diagram compare with experimental data. For molecules having only one or two valence electrons, we expect the preferred shape in the ground state to be bent. Examples areH+3 and LiH+2 , both of which have been shown by experiment and/or accurate calculation to be bent. Molecules with three or four valence electrons should be linear, since the 1su -1b2 energy rise is much greater than the energy change for the lower MO. Examples are BeH+2 ,BeH2, and BH+2 , which are indeed linear. Addition of one or two more electrons nowbrings the 1pu-2a1 MOinto play, and we have already argued that the energy change of this MO should be considerably greater than that of 1su -1b2. Basically we have here a competition between a filled MO that favors the linear form and a higher, partially or completely filled MO favoring the bent form. We therefore might reasonably expect molecules in which 2a1 is singly occupied to be bent, and molecules in which it is doubly occupied to be more bent. Occupancy of the 1pu -1b1 MO should have no effect on angle. Thus, that a molecule like BH2(1a1)2(1b2)2(2a1) has an equilibrium bond angle of 131., whereas SiH2(1a1)2(1b2)2(2a1)2 has an angle of 97.,NH2(1a1)2(1b2)2(2a1)2(1b1) has 103. and H2O(1a1)2(1b2)2(2a1)2(1b1)2 has 105. is in pleasing accord with these simple ideas. Changes of angle upon electronic excitation also agree well with the correlation diagram. The triplet state of SiH2 resulting from the 2a1.1b1 excitation has a wider angle (124.) than does the ground state (97.). The excited singlet corresponding to the same excitation has a comparable angle (126.). NH2, when excited from . . . (2a1)2(1b1) to (2a1)(1b1)2 opens from 103. to 144.. The isoelectronic PH2, under similar excitation, opens from 92. to 123.. There are other examples to support the validity of the HAH diagram, but these suffice to illustrate that this qualitative approach has considerable generality and utility. Note again that singly charged cations appear to fit QMOT predictions despite the fact that there is less theoretical basis for success here. EXAMPLE 14-3 SiH2 and H2O have rather similar H-A-H angles of 97. and 105., respectively. How would you expect these angles to compare for SiH+2 and H2O+? SOLUTION  SiH2 loses an electron from the 2a1 MO, and should open up to an angle in the 130. range. H2O loses an electron from the 1b1 MO, so its angle should not be greatly affected. (The value for H2O+ is 110.5..)  Walsh-type correlation diagrams have been constructed and discussed for many systems, among them AH3, HAB, HAAH, BAAB, H2AAH2, B2AAB2, H3AAH3.4 It is not appropriate that these all be described here. We will briefly discuss two more cases that bring in some additional features. Molecules with HAB configuration lack the high symmetry of HAH, and this means that the MOs are not as highly symmetry-determined as in HAH. Probably the simplest 4See Gimarc [6]. 500 Chapter 14 Qualitative Molecular Orbital Theory way to arrive at sketches for linear HAB MOs is to start with AB MOs (similar to A2 MOs) and add the 1s AO of H in bonding and antibonding modes to form linear MOs. The results for the seven lowest-energy MOs (all A–H bonding or nonbonding) are seen at the left side of Fig. 14-9. (Diatomic MOs were discussed in Chapter 7.) We now imagine the hydrogen atom to move away from theAB axis, as shown, and use our QMOT rules to decide whether the energies should rise or fall. As before, we expect overlap changes between H and A (nearest neighbors) to have a greater effect on energy than those between H and B. The bent molecule has only one symmetry element, namely a reflection plane containing the nuclei. An a MO is symmetric under this reflection, a is antisymmetric. Figure 14-9  TheWalsh-type diagram for the HAB system: (a) linear C8v; (b) bent Cs. (After Gimarc [6].) Reprinted with permission from Accounts Chem. Res. 7, 384 (1974). Copyright by the American Chemical Society. Section 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 501 This paucity of symmetry types means that many of the correlation lines in the diagram refer to the same symmetry. Hence, it is not too surprising that some of our correlations run into conflict with the noncrossing rule. In Fig. 14-9, dashed lines are drawn from 3s to 4a and from 1px to 3a. These dashed lines connect theMOdrawings in the manner expected if we ignore the noncrossing rule and simply bend the MOs along with the molecule. They are, as it were, intended correlations. But these lines are associated with the same symmetry a and hence cannot cross. Instead we have an avoided crossing, as 3s switches course and connects with 3a and 1px goes to 4a (solid lines in Fig. 14-9). Avoided crossings are not uncommon in quantum chemistry, and they occur in curves referring to state energy as well as orbital energy. A generalized sketch exemplifying the idea is shown in Fig. 14-10. The actual energy change (as a function of bond length, angle change, or whatever process is occurring) may show an intermediate maximum or minimum as a result of the avoided crossing. (The dashed lines are energies we predict by “forgetting” to allow the two functions of the same symmetry to be mixed in the variational procedure. The error involved in this is small if the two functions are of dissimilar energy. As they grow closer in energy, the error grows worse, and the deviation between solid and dashed lines gets bigger as the dashed lines converge.) A classic example of such an intermediate maximum is seen in an excited state of H2, illustrated in Fig. 14-11. Such maxima are important in understanding highenergy processes because they provide a means for some molecules to exist in bound vibrational states even while unstable with respect to dissociation products. Such states are called metastable states. The HAB Walsh diagram of Fig. 14-9 rationalizes the fact that the ten-valence electron HCN is linear in the ground state (due to 1px -4a) but bent in the first excited state. Such molecules as HNO, HNF, and HOCl, with 12–14 electrons, are bent in both ground and excited states because there is always at least one electron in the 5a level. The final system we shall consider is the H3AAH3 system. We will show how QMOT can be used to understand why diborane (B2H6) has a bridged structure whereas ethane (C2H6) does not, why ethane prefers to be staggered (D3d) rather than eclipsed (D3h), and what geometry changes we might expect if ethane is forced into the eclipsed conformation. Figure 14-10  Intended (dashed lines) and actual correlations between orbitals or states having the same symmetry. 502 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-11  The four lowest states of H2. Note the curve for the first excited 1+g state. This shape results from an avoided crossing. (From Sharp [8].) Gimarc’s diagram relating MOs for A2H6 in D3d staggered and D2h bridged shapes is shown in Fig. 14-12. The MOs on the left are all H–A bonding and are arranged pretty much in the order one would expect on the basis of theA–A bonding. The lowest two are composed mainly of valence s AOs on atoms A. Next come the p bonding combinations, then the ps bond, followed finally by the p antibonds. The 1eup bonding levels lie below the 2a1gps bonding level because the former have greater ability to overlap with the hydrogens. There are only two MOs that show much energy change the molecule goes from D3d to D2h geometry. These are the 2a1g.2ag and the 1eg.1b2g. In the former case, the energy drops because two hydrogens have moved from positions off axis of one p lobe into positions off axis of two p lobes, thereby increasing the total amount Section 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 503 Figure 14-12  Walsh-type correlation diagram forD3d -D2h shapes ofH3AAH3. (After Gimarc [6].) Reprinted with permission from Accounts Chem. Res. 7, 384 (1974). Copyright by theAmerican Chemical Society. of overlap. The energy in the latter case rises very markedly because the change in geometry places all six hydrogens into nodal planes, greatly reducing the overlap. The prediction is that a 10- or 12-valence electronA2H6 system should favor a bridged D2h geometry over D3d but a 14-valence electron system should prefer D3d over D2h. Diborane (12 valence electrons) and ethane (14) have structures consistent with this. EXAMPLE 14-4 What structure should be expected for B2H-6 ? 504 Chapter 14 Qualitative Molecular Orbital Theory SOLUTION  Assuming that the extra electron occupies the 1b2g MO, one might expect that B2H-6 would have the D3d structure of ethane. It depends on whether or not one electron in the 1b2g MO suffices to overcome the effects of the doubly occupied MOs. (Experiments and ab initio calculations indicate that B2H-6 has the same shape as ethane.)  In structural problems such as this, one must be careful that, when comparing two possible molecular shapes, one is not overlooking other possibilities that might be even more stable. For instance, even though the diagram in Fig. 14-12 indicates that ethane should prefer D3d geometry to D2h, it says nothing about D3d relative to D3h, the eclipsed form. For this we must construct another diagram, shown in Fig. 14-13. Here the principal energy changes occur in the doubly degenerate e- type MOs. Recall that Figure 14-13  Walsh-type diagram for D3d–D3h A2H6: (a) D3d (staggered); (b) D3h (eclipsed). Note that the e-type MOs for the two forms do not turn into each other by rotating about the C–C bond. This is really a two-sided correlation diagram, the high symmetry of each form determining the MOs. Then QMOT rules are used to decide how similar MOs on the two sides should relate in energy. (See Lowe [9].) Section 14-8 Frontier Orbitals 505 degenerate MOs need not be symmetric or antisymmetric for all symmetry operations of the molecular point group. This results, in this case, in the e-type MOs being much more unbalanced, or lopsided, than the nondegenerate a-type MOs. As a result, the changes in overlap population upon rotation are much bigger for e-than for a-type MOs. The higher-energy eg -e set of MOs dominates the lower energy eu -e set for two reasons; it is at higher energy and hence more sensitive to overlap change, and the coefficients on the hydrogens are bigger due to the central nodal plane, which reduces the size of the MO in the A–A bond, forcing it to be larger elsewhere. In sum, a 14-valence electron A2H6 molecule prefers the staggered (D3d) form because the long-range H···H antibonding in 1e dominates the long range bonding in 1e. We can describe this as “nonbonded repulsion” between hydrogens at opposite ends of the molecule. The energy changes in Fig. 14-12 are much larger than those in Fig. 14-13. In the former, we are charting energy changes associated with changes in bond angle and even molecular topology. Overlap changes are large and occur between nearest neighbors. In Fig. 14-13 we are charting energy changes associated with internal rotation. Here the overlap changes occur between third-nearest neighbors and are very small. The order in which the possibilities have been examined—D3d versusD2h followed byD3d versusD3h—is thus sensible in that we are considering the grosser energy changes first. One can go even further and guess the qualitative changes in C–C, C–H distances, and C–C–H angle if ethane is forced into the eclipsed conformation. We argue that the 1eg.1e pair of MOs suffer the greatest overlap change, losing population between the vicinal hydrogens. We must renormalize the MO to compensate for this loss, just as we had to in H+2 , discussed earlier. To renormalize, the MO 1e is multiplied by a factor slightly greater than unity. This magnifies the p antibond between the carbons and the bonding between carbon and hydrogens. Thus, we expect eclipsed ethane to have a slightly lengthened C–C bond, slightly shorter C–H bonds, and a larger C–C–H angle (the latter presumably mainly due to the increased vicinal repulsion brought about by overlap changes in the original rotation). Ab initio calculations5 support these predictions. 14-8 Frontier Orbitals We have indicated that higher-energy MOs tend to undergo more pronounced energy changes upon overlap change due to distortion of the nuclear frame. This fact has led to a shortcut method for guessing the results of full orbital correlation diagrams of the sort we have already discussed. One merely considers what the energy behavior will be for the highest occupiedMO(HOMO) and bases the prediction entirely on that MO, ignoring all the others. Fukui6 was the first to draw attention to the special importance of the HOMO. He also noted that, in certain reactions in which the molecule in question acted as an electron acceptor, the lowest unfilledMO(LUMO) of the molecule (before it has accepted the electrons) is the important one. These two MOs are called the frontier MOs. It sometimes happens that the second-highest-energy occupied MO undergoes a 5Stevens [10] finds that the C–C distance increases by 0.01 Å, the C–H distance decreases by 0.001 Å, and the C–C–H angle opens by 0.3.. 6See Fujimoto and Fukui [11]. 506 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-14  HOMO and LUMO frontier MOs and subjacent and superjacent MOs. much bigger overlap change than theHOMOdoes, and therefore dominates the process. Recognition of this fact has led to a special name for the MOs just belowtheHOMOand just above the LUMO. They are called subjacent and superjacent MOs (see Fig. 14-14). An example of the frontier MO approach is provided by reconsidering the staggered versus eclipsed conformation for ethane. We expect the highest-energy, lopsided MO to be the p antibonding eg -e degenerate pair. Qualitative molecular orbital theory rules lead us to expect this pair to have higher energy in the eclipsed form. Therefore, we expect ethane to be staggered. The same approach can be used to predict the conformation of dimethylacetylene, H3C–C=C–CH3. The HOMO is again a degenerate pair of p-type MOs. (Generally speaking, one assumes that occupied p MOs are higher in energy than occupied s MOs, and this is often true. Even when it is not, however, the p-type MOs often tend to be more lopsided and hence to dominate because their overlap changes are greater.) The two degenerate HOMOs are delocalized over the entire molecule and can be expected to have the following characteristics: 1. They will be orthogonal to each other. 2. They will be bonding in the central C=C region, helping to establish multiple bond character there. 3. They will be antibonding in the C–C single bond regions, thereby cancelling out double-bond character from a lower set of p-type MOs. 4. They will be C–H bonding. [Use of rules (2)–(4) often suffices to establish the qualitative nature of HOMOs of hydrocarbons.] The results of all these conditions are the MOs sketched in Fig. 14-15. Observe that the end-to-end hydrogen overlap is most positive in the eclipsed conformation. This leads to the prediction that this molecule is more stable in the eclipsed conformation. Since the hydrogens are so far apart, the overlap change is expected to be very small. Ab initio calculations indicate that dimethylacetylene is more stable in the eclipsed form and that it has a barrier of less than 0.02 kcal/mole. As another example of frontier orbital usage, consider the methyl rotation barrier in propene, H3C–CH=CH2. Here the HOMO should be p-bonding in the double bond, antibonding in the single bond, and C–H bonding in the methyl group. This MO is sketched in Fig. 14-16 for the two possible conformations. The end-to-end antibonding in this MO is greatest for case (b), and so conformation (a) is favored. Indeed, it has been observed that, in general, a threefold rotor attached to a double bond Section 14-8 Frontier Orbitals 507 Figure 14-15  Degenerate HOMOs for dimethylacetylene, (a) eclipsed, (b) staggered, lead to the prediction that this molecule should prefer the eclipsed conformation. Figure 14-16  HOMO of propene in two conformations. Methyl group (a) eclipses and (b) staggers the double bond. The energy is lower in (a). prefers to eclipse the double bond. A few examples are acetaldehyde (H3C–CH=O), N-methylformaldimine (H3C–N=CH2), nitrosomethane (H3C–N=O), and vinyl silane (H3Si–CH=CH2). Notice that the HOMO of Fig. 14-16 is qualitatively similar to one of the 1eg -1e HOMOs of ethane. The QMOT frontier orbital argument for the stability of staggered ethane is basically the same as that for the stability of form (a) in propene. Observe that this MO also resembles the HOMO of 1,3-butadiene, and would lead to the prediction that the trans form of this molecule is more stable than the cis. This is, in fact, observed to be the case. It is important to bear in mind that the frontier orbital approach is an approximation to an approximation. It is not always easy to know when one is on safe ground. Of the examples mentioned here, ethane and dimethylacetylene are safest because the overlap changes are small. Hence, the perturbation is slight, and our assumption that the MOs of the two forms are essentially identical, except for AO overlap changes, is quite accurate. Also, symmetry is high, and so we know that s MO overlap changes will be smaller than lopsided p MO overlap changes. Propene and butadiene are risky. Here the whole molecular framework is lopsided, so overlap changes are large in s as 508 Chapter 14 Qualitative Molecular Orbital Theory well as in p MOs. Indeed, if one performs EHMO calculations on these molecules, one finds that s-type MO energies change much more than does the p HOMO energy. This comes about because of the fairly close approach by some of the hydrogens in these molecules. Much of this s energy change cancels out among the several s MOs. In propene, the cancellation is so complete that the p HOMO energy change is almost the same as the total EHMO energy change. In butadiene, however, the p HOMO accounts for only about one-third of the total energy change. Even though the frontier orbital method has an astonishing range of qualitative usefulness (we shall see more applications shortly), it is clear that caution is needed. EXAMPLE 14-5 How should the bond lengths and angles of propene change if the methyl group rotates from the stable conformation (Fig. 14-16a) to the unstable one (Fig. 14-16b)? SOLUTION  The HOMO for form b has a bit more negative overlap, which requires a slightly larger normalizing coefficient. This increases the influence of the HOMO, so we expect a slight shortening of the two out-of-plane C–H bonds and the C=C bond, a lengthening of the C–C single bond, and a slight opening of the H–C–H angle for the out-of-plane C–H bonds. No changes are predicted for the in-plane C–H bond lengths or angles as a result of HOMO renormalization. (However, other MOs are also being renormalized, so other changes will result. However, the HOMO-induced changes should dominate.)  14-9 Qualitative Molecular Orbital Theory of Reactions It has been found possible to extend and amplify QMOT procedures so that they apply to chemical reactions. One of the most striking examples of this was application to unimolecular cyclization of an open conjugated molecule (e.g., cis-1,3-butadiene, closing to cyclobutene). This type of reaction is called an electrocyclic reaction. The details of the electrocyclic closure of cis-1,3-butadiene are indicated in Fig. 14-17. If we imagine that we can keep track of the terminal hydrogens in butadiene (perhaps by deuterium substitution as indicated in the figure) then we can distinguish between two products. One of them is produced if the two terminal methylene groups have rotated in the same sense, either both clockwise or both counterclockwise, to put the two inside atoms of the reactant (here D atoms) on opposite sides of the plane of the four carbon atoms in the product. This is called a conrotatory (c.on · r ¯o · t ¯a · tory) closure. The other mode rotates the methylenes in opposite directions (disrotatory) to give a product wherein the inside atoms appear on the same side of the C4 plane. A priori, we do not know whether the reaction follows either of these two paths. Figure 14-17 depicts processes where both methylene groups rotate by equal amounts Figure 14-17  Two idealized modes of electrocyclic closure of cis-1,3-butadiene. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 509 as the reaction proceeds. This is an extreme case of what is known as a concerted process. The two processes occur together, or in concert. The opposite extreme is a nonconcerted, or stepwise process, wherein one methylene group would rotate all the way (90.) and only after this was completed would the other group begin to rotate. This process would lead to an intermediate having a plane of symmetry (ignoring the difference between D and H), which means that the second methylene group would be equally likely to rotate either way, giving a 50–50 mixture of the two products pictured in Fig. 14-17. One can make a case for the reaction having some substantial degree of concertedness (by which we mean that the second methylene should be partly rotated before the first one is finished rotating). The reaction involves destruction of a four-center conjugated p system and formation of an isolated p bond and a new C–C s bond. Energy is lost in the dissolution of the old bonds, and gained in formation of the newones. Therefore, we expect the lowest-energy path between reactants and products to correspond to a reaction coordinate wherein the new bonds start to form before the old ones are completely broken. But the new s bond cannot form to any significant extent until both methylene groups have undergone some rotation. Thus, concertedness in breaking old bonds and forming new ones is aided by some concertedness in methylene group rotations. (Note that concertedness does not necessarily imply absence of an intermediate. If the reaction surface had a local minimum at a point at which both methylenes were rotated by 45., it would not affect the argument at all.) Because they knewthat many electrocyclic reactions are observed to be stereospecific (i.e., give ~100% of one product or the other in a reaction like that in Fig. 14-17), Woodward and Hoffmann [12] sought an explanation of a qualitative MO nature. They used frontier orbitals and argued how their energies would change with a con- or disrotatory motion, due to changes in overlap. For butadiene in its ground state, the HOMOis the familiar p MOshown in the center of Fig. 14-18. The figure indicates that the interaction between p–p AOs on terminal carbons is favorable for bonding in the region of the incipient s bond only in the conrotatory case. Therefore, the prediction is that, for concerted electrocyclic closure, butadiene in the ground state should prefer to go by a conrotatory path. When the reaction is carried out by heating butadiene (thermal reaction), which means that the reactant is virtually all in the ground electronic state, the product is indeed purely that expected from conrotatory closure. One can also carry out electrocyclic reactions photochemically. The excited butadiene now has an electron in a p MO that was empty in the ground state. This MO was the lowest unoccupied MO (LUMO) of ground-state butadiene, pictured in Fig. 14-19. One can see that the step to the next-higher MO of butadiene has just introduced one Figure 14-18  The HOMO of ground state cis-1,3-butadiene as it undergoes concerted closure by either mode. 510 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-19  The HOMO of the first excited state of cis-1,3-butadiene as it undergoes closure by either mode. more node, reversing the phase relation between terminal p AOs, and reversing the predicted path from con-to disrotatory. Experimentally., the photochemical reaction is observed to give purely the product corresponding to disrotatory closure. (It is not always obvious which empty MO becomes occupied in a given photochemical experiment. One assumes that the LUMO of the ground state is the one to use, but there is some risk here.) EXAMPLE 14-6 Is the LUMO . HOMO transition dipole-allowed for cis-1,3- butadiene? SOLUTION  The HOMO is antisymmetric for reflection through the symmetry plane that bisects the molecule, and the LUMO is symmetric for this reflection. This is the only symmetry reflection plane where the MOs have opposite symmetry, so the transition is dipole-allowed (and is polarized from one side of the molecule towards the other). The group theory approach for this C2. molecule is that the HOMO has a2 symmetry, the LUMO has b1 symmetry, their product has b1 symmetry, and, since x also has b1 symmetry, the transition is allowed and is x-polarized (where x is colinear with the central C–C bond).  One might worry about the fact that we are looking at only a part of one MO, thereby ignoring a great deal of change in other MOs and other parts of the molecule. However, much of this other change, while large, is expected to be about the same for either of the two paths being compared. The large overlap changes between p–p AOs on terminal and inner carbon atoms, for instance, are about the same for either mode of rotation. This approach, then, is focused first on the frontier orbitals, which are guessed as being most likely to dominate the energy change, and second on those changes in the frontier orbitals that will differ in the two paths. This method is trivially extendable to longer systems. Hexatriene closes to cyclohexadiene in just the manner predicted by the frontier orbitals. The only significant change in going from butadiene to hexatriene is that we go from four to six p electrons. This means that the HOMO for hexatriene has one more node than that for butadiene (or, the HOMO for a 2n p-electron system is like the LUMO for a 2n-2 p-electron system insofar as end-to-end phase relations are concerned). The net effect is that the predictions for hexatriene are just the reverse of those for butadiene. That is, hexatriene closes thermally by the disrotatory mode and photochemically by the conrotatory mode. The general rule, called a Woodward–Hoffmann rule, is this: the thermal electrocyclic Section 14-9 Qualitative Molecular Orbital Theory of Reactions 511 reactions of a k p-electron system will be disrotatory for k =4q +2, conrotatory for k = 4q (q =0, 1, 2, . . . ); in the first excited state these relationships are reversed.7 It is possible to treat electrocyclic reactions in another way, namely, via a two-sided correlation diagram approach. This was first worked out by Longuet-Higgins and Abrahamson [14]. Only orbitals (occupied and unoccupied) that are involved in bonds being made or broken during the course of the reaction are included in the diagram. For butadiene, these are the four p MOs already familiar from simple H¨uckel theory. For cyclobutene, they are the two p MOs associated with the isolated 2-center p bond and the two s MOs associated with the new C–C s bond. These orbitals and their energies are shown in Fig. 14-20. In cyclobutene, the s and s* MOs are assumed to be more widely split than the p and p* because the ps AOs overlap more strongly. Also, the s MO is assumed lower than p1 of butadiene. However, these details are not essential. All we have to be certain of is that we have correctly divided the occupied from the unoccupied MOs on the two sides. The dashed line in Fig. 14-20 separates these sets. Next we must decide which symmetry elements are preserved throughout the idealized reactions we wish to treat. Let us consider first the reactants and products. These have C2v symmetry, that is, a two-fold rotational axis, C2, and two reflection planes s1 Figure 14-20  MOs associated with bonds being broken or formed in the electrocyclic closure of (a) cis-1,3-butadiene to (b) cyclobutene. 7SeeWoodward and Hoffmann [13, p. 45]. 512 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-21  Sketches illustrating that the conrotatory mode (b) preserves the C2 axis while the disrotatory mode (a) preserves the reflection plane s1. and s2 containing the C2 axis (see Fig. 14-21). A conrotatory twist preserves C2, but, during the intermediate stages between reactant and product, s1 and s2 are lost as symmetry operations. A disrotatory twist preserves s1 but destroys C2 and s2. Therefore, when we connect energy levels together for the disrotatory mode, we must connect levels of the same symmetry for s1, but for the conrotatory mode, they must agree in symmetry for C2. The s2 plane applies to neither mode and is therefore ignored. The symmetries for each MO are easily determined from examination of the sketches in Fig. 14-20, and are given in Table 14-3. These assignments lead to two different correlation diagrams, one for each mode. It is conventional to arrange these as shown in Fig. 14-22. TABLE 14-3  Symmetries for C2v MOs MO s1 C2 MO s1 C2 Butadiene Cyclobutene p1 Sa A s S S p2 A S p S A p3 S A p* A S p4 A S s* A A aS is symmetric; A antisymmetric. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 513 Figure 14-22  A pair of two-sided correlation diagrams (one for each mode) for the electrocyclic reactions of cis-1,3-butadiene: (a) cyclobutene; (b) butadiene; (c) cyclobutene. There is curve crossing in these diagrams, but it is always lines of different symmetry that cross, and so no violation of the noncrossing rule occurs. If we are considering a thermal reaction, the lowest two p MOs of butadiene are occupied. These correlate with the lowest two MOs of cyclobutene if the conrotatory mode is followed, and the thermal conversion of cis-butadiene to cyclobutene by a conrotatory closure is said to be symmetry allowed. The other mode correlates p2 with an empty cyclobutene MO (p*). Taking this route moves the reactant toward doubly excited cyclobutene. (Even though we might anticipate de-excitation somewhere along the way, the energy required in early stages would still be much higher than would be needed for the symmetry-allowed mode.) This is said to be a symmetry-forbidden reaction. If we now imagine photo-excitation of cis-butadiene to have generated a state associated with the configuration p2 1 p2p3, and trace the fate of this species for the two 514 Chapter 14 Qualitative Molecular Orbital Theory modes of reaction, we note that the disrotatory route leads to cyclobutene in the configuration s2pp* while the conrotatory mode gives sp2s*. Both of these are excited, but the former corresponds to the lowest excited configuration (p .p*) while the second corresponds to a very high-energy excitation (s .s*). Therefore, the former is “allowed” (since it goes from lowest excited reactant to lowest excited product) and the latter is “forbidden.” The two-sided correlation diagrams of Fig. 14-22 thus lead to the same predictions as the frontier orbital maximization of overlap approach. The difference between these approaches is as follows: The frontier-orbital approach requires sketching the HOMOand then judging overlap changes upon nuclear motion usingQMOT reasoning. The two-sided correlation diagram approach requires sketching all the MOs (occupied and unoccupied) of both reactant and product involved in bonds breaking or forming, ordering the corresponding energy levels, and finding symmetry elements preserved throughout the reaction. Once all this is done, the levels are connected by correlation lines without reliance on QMOT reasoning. Some qualitative reasoning enters in the ordering of energy levels (levels with more nodes have higher energy), but the two-sided correlation diagram technique is the more rigorous method of the two and tends to be preferred whenever the problem has enough symmetry to make it feasible. Reliance on frontier orbitals is more common for processes of lower symmetry. Concern is sometimes expressed about the apparent restrictions resulting from use of symmetry in correlation diagram arguments. One can imagine the butadiene cyclization occurring with less-than-perfect concertedness, the two methylene groups rotating by different amounts as the reaction proceeds. But that would destroy all symmetry elements. Will our symmetry-based arguments still pertain to such an imperfectly concerted reaction coordinate? Again, if we label certain sites by substituting deuteriums for hydrogens as shown in Fig. 14-17, the symmetry will be destroyed. Do our predictions still apply? One can answer these questions affirmatively by reasoning in the following way. If we had a collection of nuclei and electrons, and we could move the nuclei about in arbitrary ways and study the ground-state energy changes, experience tells us that the energy would be found to change in a smooth and continuous way. We can think of the energy as a hypersurface, with hyperdimensional “hills,” “valleys,” and “passes.” Now, in a few very special nuclear configurations, identical nuclei would be interrelated by symmetry operations, and we would be able to make deductions on group-theoretical grounds. Such deductions would only strictly apply to those symmetric configurations, but they would serve as indicators of what the energy is like in nearby regions of configuration space. Thus, the correlation diagram indicates that a perfectly concerted thermal electrocyclic reaction of butadiene will require much less energy to go conrotatory as opposed to disrotatory. The inference that a less-perfectly concerted reaction will have a similar preference is merely an assumption that it is easier to pass through the mountains in the vicinity of a low pass than a high one. Experience also leads us to expect that substituting for H a D (or even a CH3) will have little effect on the MOs, even though, strictly speaking, symmetry is lost. In essence, we work with an ideal model and use chemical sense to extend the results to less ideal situations, just as we do when, in applying the ideal gas equation of state to real gases, we avoid the high-pressure, low-temperature conditions under which we know the oversimplifications in the ideal gas model will lead to significant error. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 515 It is possible to combine information on orbital symmetries and energies to arrive at state symmetries and energies. Then one can construct a correlation diagram for states.8 We now demonstrate this for the dis- and conrotatory reactions just considered. Each orbital occupation scheme is associated with a net symmetry for any given symmetry operation. Character tables could be used to assign these symmetries, but this is not necessary. All we need to use is the fact that, in multiplying functions together, symmetries follow the rules: S×S=S,A×A=S,S×A=A. Thus, any doubly occupied MO in a configuration will contribute symmetrically to the final result. To ascertain the net symmetry, then, we focus on the partly filled MOs. The symmetries for C2 and s1 of ground and some excited configurations of butadiene and cyclobutene are listed in Table 14-4. Included are the cyclobutene configurations that result from intended correlations of various butadiene configurations. (For instance, p2 1 p2 2 butadiene has an TABLE 14-4  Symmetries and Intended Correlations of Some Configurations of Butadiene and Cyclobutene Cyclobutene Symmetry for “intended” configuration Configuration s1 C2 Con Dis Butadiene p1 2p2 2 Sa S p2s2 s2p*2 p1 2p2p3 A A p2ss* s2p*p p1 2p2p4 S S p2sp* s2p*s* p1p2 2p3 S S ps2s* sp*2p p1p2 2p4 A A ps2p* sp*s*2 p1 2p2 3 S S s2s*2 s2p2 ... Cyclobutene s2p2 S S s2pp* A A sp2p* A S sp2p* A S sp2s* A A s2p*2 S S s2p*s* S A ... aS is symmetric; A antisymmetric. 8Actually, we shall be looking at simple products of MOs, or configurations. Each configuration is associated with one or more states and gives the proper symmetry for these states as well as an approximate average energy of all the associated states. Hence, the treatment described here gives a sort of average state correlation diagram. It might be more accurately called a configuration correlation diagram. 516 Chapter 14 Qualitative Molecular Orbital Theory intended correlation with s2p*2 cyclobutene if the disrotatory mode is followed. This is inferred from the orbital correlation diagram, Fig. 14-22.) Assuming that the energies of states associated with these configurations fall into groups roughly given by sums of orbital energies, we obtain the two-sided diagram shown in Fig. 14-23. Only a few of the configurations are interconnected, to keep the diagram simple. Note that the ground-state configuration of butadiene correlates directly with the ground state of cyclobutene for conrotatory closure, but has an intended correlation with a doubly excited configuration in the disrotatory mode. This intended correlation would violate the noncrossing rule by crossing another line of S symmetry, so that the actual curve turns around and joins onto the ground state level for cyclobutene. The effect of the intended correlation with a high-energy state is to produce a significant barrier to reaction. The figure shows that, for the first excited configuration, the highenergy barrier occurs for the opposite mode of reaction. State correlation diagrams Figure 14-23  A state or configuration correlation diagram for the electrocyclic closure of cis- 1,3-butadiene. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 517 thus convert a “symmetry-forbidden” orbital correlation diagram into a high-activationenergy barrier: the conclusions are the same using either diagram. Another kind of reaction that is formally closely related to the electrocyclic reaction is the cycloaddition reaction, exemplified by the Diels–Alder reaction between ethylene and butadiene to give cyclohexene (I). Such reactions are classified in terms of the number of centers between the points of connection. Thus, the Diels–Alder reaction is a [4+2] cycloaddition reaction. One can conceive of several distinct geometrical possibilities for a concerted mechanism for such a reaction. The two new s bonds can be envisioned as being formed on the same face (suprafacial) (II) or opposite faces (antarafacial) (III) of each of the two reactants. The various possibilities are illustrated in Fig. 14-24. Qualitative MO theory is used to judge which process is energetically most favorable. One has a choice between the two-sided correlation diagram and the frontier orbital approach. We demonstrate the latter9 since it is simpler. Both methods lead to the same conclusion. In the course of this reaction, electrons become shared between the p systems of butadiene and ethylene. This is accomplished, to a rough approximation, by interaction between the HOMO of butadiene and the LUMO of ethylene and also between the LUMO of butadiene and the HOMO of ethylene. Let us consider the former interaction. The MOs are shown in Fig. 14-25 and the overlapping regions are indicated for the four geometric possibilities. Inspection of the sketches indicates that the two MOs have positive overlap in the regions of both incipient s bonds only for the [4s+2s] and [4a+2a] modes. Therefore, the prediction is that these modes proceed with less activation energy and are favored. Now let us turn to the other pair of MOs, namely the LUMO of butadiene (IV) and the HOMO of ethylene (V). 9See Hoffmann andWoodward [15]. 518 Chapter 14 Qualitative Molecular Orbital Theory Figure 14-24  Four suprafacial–antarafacial combinations possible for the Diels–Alder 2 + 4 cycloaddition reaction. Figure 14-25  Overlaps betweenHOMOof butadiene and LUMO of ethylene resulting from four interactive modes pictured in Fig. 14-24. The geometries for the four modes would all differ. These drawings are highly stylized. Note that, for each MO, the end-to-end phase relationship is reversed from what it was before. Two symmetry reversals leave us with no net change in the intermolecular phase relations. It is easy to see, therefore, that theseMOsalso favor the [s, s] and [a, a] modes. In cycloaddition reactions of this sort, one need analyze only one HOMO-LUMO pair Section 14-9 Qualitative Molecular Orbital Theory of Reactions 519 in order to arrive at a prediction. Extension to longer molecules or to photochemical cycloadditions proceeds by the same kinds of arguments presented for the electrocyclic reactions. Another type of reaction to which qualitative MO theory has been applied is the sigmatropic shift reaction, where a hydrogen migrates from one carbon to another and simultaneously a shift in the double bond system occurs. An example is given in Fig. 14-26. The usual treatment of this reaction10 involves examining the HOMO for the system at some intermediate stage in the reaction where the hydrogen has lost much of its bonding to its original site and is trying to bond onto its new site. At this stage, the HOMO of the molecule becomes like that of the nonbonding MO of an odd alternant hydrocarbon (Fig. 14-27) with a slightly bound hydrogen on one end. The suprafacial mode is favored in this particular case because the hydrogen can maintain positive overlap simultaneously with its old and new sites—the new bond can form as the old Figure 14-26  The two possible distinct products resulting from a shift of a hydrogen from position 1 to position 5 in a substituted 1,3-pentadiene. Groups A, B, C,Dare deuterium atoms, methyl groups, etc., enabling us to distinguish the products. Figure 14-27  Phase relations in the HOMO for (a) suprafacial and (b) antarafacial [1, 5] sigmatropic shifts. 10SeeWoodward and Hoffmann [16]. 520 Chapter 14 Qualitative Molecular Orbital Theory bond breaks. This is not possible for the antarafacial [1, 5] shift. However, the [1, 7] shift prefers the antarafacial mode. Many other types of chemical reaction have been rationalized using qualitative MO theory. The association of SN2 reactions withWalden inversion (i.e., the adding group attacks the opposite side of an atom from the leaving group) is rationalized by arguing that an approaching nucleophile will donate electrons into the LUMO of the substrate. The LUMO for CH3Cl is shown in Fig. 14-28. A successful encounter between CH3Cl and a base results in a bond between the base and the carbon atom, so the HOMO of the base needs to overlap the p AO of carbon in the LUMO of Fig. 14-28a. Attack at the position marked “1” in the figure is unfavorable because any base MO would be near a nodal surface, yielding poor overlap with the LUMO. Therefore, attack at site 2 is favored. As the previously empty LUMO of CH3Cl becomes partially occupied, we expect a loss of bonding between C and Cl. Also, negative overlap between the forming C-base bond and the three “backside” hydrogens should encourage the latter to migrate away from the attacked side, as indicated in Fig. 14-28b. The tendency of a high-energy, occupied MO of the base to couple strongly with the LUMO of a molecule like CH3Cl depends partly on the energy agreement between these MOs. If they are nearly isoenergetic, they mix much more easily and give a bonded combination of much lower energy. Molecules where theHOMOis high tend to be polarizable bases. A high-energy HOMO means that the electrons are not very well bound and will easily shift about to take advantage of perturbations. Such bases react readily with molecules having a low-energyLUMO(Fig. 14-29a). This corresponds to a “soft-base-soft-acid” interaction in the approach of Pearson [17]. When theHOMOand LUMO are in substantial energy disagreement, orbital overlap becomes less important as a controlling mechanism, and simple electrostatic interactions may dominate. This Figure 14-28  (a) The LUMO of CH3Cl. (b) Positive overlap between HOMO of base B and LUMO of CH3Cl increases antibonding between C and Cl and also repels H atoms from their original positions. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 521 Figure 14-29  LUMO–HOMOinteractions and splitting for (a) nearly degenerate levels, (b) wellseparated levels. is a “hard-acid-hard-base” situation. Thus, we expectQMOT rules to apply to soft-soft, rather than hard-hard interactions. The reader may, by this time, begin to appreciate the very wide scope of QMOT and the large number of variations of a common theme that have been used. In this chapter we have given only a few representative examples. We have not described all the variations or all types of application. For fuller treatment, the reader should consult specialized books on this subject, some of which have been referred to in this chapter [18]. 14-9.A Problems 14-1. Calculate to first order the electronic energy of a hydrogen atom in its 1s state and in the presence of an additional proton at a distance of 2 a.u. What is the total energy to first order? Repeat for distances of 1 and 3 a.u. (See Appendix 3.) 14-2. Evaluate and graph the effects of dividing Haa ±Hab by 1±S for each of the following cases: Haa = 0,-5,-10,+10,-20. In each case, let Hab =-5, S = 0.5. Does the QMOT rule that antibonding interactions are more 522 Chapter 14 Qualitative Molecular Orbital Theory destabilizing than bonding interactions are stabilizing apply in all cases? Is the bonding level always the lower of the two? Does the QMOT expectation appear to be better followed by very low-energy levels, or by higher-energy levels? 14-3. Table P14-3 is a list of electron affinities (in electron volts) of certain molecules and atoms. Can you rationalize the molecular values relative to the atomic values using QMOT ideas? TABLE P14-3  Atomic and Molecular Electron Affinities (in electron volts) H (0.75) Cl (3.61) C2 (3.4) O2 (0.45) Cl2 (2.38) C (1.26) Br (3.36) CN (3.82) F2 (3.08) Br2 (2.6) N (0.0 ± 0.2) I (3.06) N2 (-16) S2 (1.67) I2 (2.55) O (1.46) S (2.08) CO (<-1.8) SO (1.13) ICl (1.43) F (3.40) H2 (~-2) CS (0.21) FCl (1.5) IBr (2.6) 14-4. For some time, it was uncertain whether the ground states of CH2 and NH+2 are singlets (2a1)2 or triplets (2a1)(1b1). The ground-state geometries of these systems have HAH angles of 134.(CH2) and 140–150.(NH+2 ). Based on other data described in the text for HAH systems, would you say these angles are more consistent with a singlet or a triplet ground state? Assuming that the first excited state is the other multiplicity, should the first excited state be more or less bent than the ground state? 14-5. Based on Fig. 14-8, what should happen to the geometry ofH2Oupon 3a1.1b1 excitation? 14-6. Carry out EHMO calculations for CH2 at HCH angles of 180., 150., 120., and 90.. (Use a constant C–H bond distance of about 1Å in all cases.) From an examination of the MO coefficients, sketch and assign symmetry symbols to each MO. Plot the energies versus angle. Critically discuss your computed results compared to Fig. 14-8. If there are differences, try to rationalize them. (Remember that any EHMO orbital energy change can be analyzed in terms of Mulliken population changes, as discussed in Chapter 10.) 14-7. Using the EHMO energy formula [Eq. (10-25)], analyze the contributions to the orbital energy change between 180. and 120. that you calculated for the 1b2 MO in Problem 14-6. What percentage of the energy change comes from loss of overlap between 1s and 2pAOs? From antibonding between hydrogens? Compare this with the discussion in the text. 14-8. If overlap between an s AO and a p AO goes as cos . (for constant and finite R) (Fig. P14-8), what is the mathematical expression for the rate of change of overlap with angle? Calculate the effect on overlap of a 30. shift, starting from . =0. Calculate the effect of a 30. shift from . =90.. Section 14-9 Qualitative Molecular Orbital Theory of Reactions 523 Figure P14-8  14-9. Use symmetry to help establish an energy level pattern and MO sketches for planarAH3 (equilateral triangular). UseQMOT rules to produce the correlation diagram for planar versus pyramidal AH3. Based on your reasoning, which of the following should be planar? BH3,CH+3 ,BeH-3 ,NH3,PH3,H3O+,CH-3 . Can you think of any other shapes that might be examined as possibilities for AH3 systems? 14-10. Use an EHMO program to generate MOs for CO2 at 180., 150., 120., and 90.. Sketch the MOs, characterize their symmetries, and construct an orbital energy correlation diagram for this molecule. Indicate what causes each MO energy to rise or fall. Based on your figure, would you expect the following molecules to be linear or bent? BeCl2,C3,CO2,N-3 ,NO2,O3,F2O. 14-11. Should a pu.pg electronic transition for ozone cause the O–O–O angle to increase or decrease according toWalsh-type arguments? Sketch the MOs and indicate your reasoning. [The notation pg and pu refers to the MOs in the linear molecule. For the equilibrium bent structure, these MOs are: pg . a2, b2;pu.a1, b1.] 14-12. Consider the electrocyclic reaction wherein the allyl anion closes to form a cyclopropenyl p anion (VIII). (The negative charge in the cyclopropyl anion may be thought of as resulting from double occupancy of a p–p AO on the singly protonated carbon.) Sketch the orbitals being formed or destroyed in this process. Determine the orbital symmetries for the symmetry operations conserved in conrotatory and disrotatory modes of closure. Set up an orbital correlation diagram and decide which mode is more likely for thermal and photochemical reactions. 14-13. Construct an orbital correlation diagram for the “broadside” 2+2 cycloaddition of two acetylenes to form cyclobutadiene (IX). Is the reaction likely to proceed 524 Chapter 14 Qualitative Molecular Orbital Theory through an intermediate of square planar geometry? Assuming this geometry, would reaction be easier thermally or photochemically? 14-14. For a system having no symmetry elements (except E), what is the result of the noncrossing rule? 14-15. a) Construct an orbital correlation diagram for the 2 + 4 cycloaddition (Diels–Alder) reaction discussed in the text. b) Construct a state correlation diagram for this reaction. 14-16. Based on the orbital relations discussed in the text and extensions of these relations to other cases, formulate generalized verbal rules (Woodward–Hoffmann rules) for cycloadditions and sigmatropic shift reactions. References [1] D. R.Yarkony and H. F. Schaefer, III, J. Chem. Phys. 61, 4921 (1974). [2] K. Ruedenberg, J. Chem. Phys. 66, 375 (1977). [3] Ionization potentials, appearance potentials, and heats of formation of gaseous positive ions, NSRDS-NBS 26, Natl. Bur. Stand. (1969). [4] A. D.Walsh, J. Chem. Soc. p. 2260 (1953). [5] R. S. Mulliken, Rev. Mod. Phys. 14, 204 (1942). [6] B. M. Gimarc, Accounts Chem. Res. 7, 384 (1974). [7] R. J. Buenker and S. D. Peyerimhoff, Chem. Rev. 74, 127 (1974). [8] T. E. Sharp, Atomic Data 2, 119 (1971). [9] J. P. Lowe, J. Amer. Chem. Soc. 92, 3799 (1970). [10] R. M. Stevens, J. Chem. Phys. 52, 1397 (1970). [11] H. Fujimoto and K. Fukui, in Chemical Reactivity and Reaction Paths (G. Klopman, ed.).Wiley (Interscience), NewYork, 1974. [12] R. B.Woodward and R. Hoffmann, J. Amer. Chem. Soc. 87, 395 (1965). See also R. Hoffmann, Angew. Chem. Int. Ed. 43, 2, 2004, for historical insights. [13] R. B. Woodward and R. Hoffmann, The Conservation of Orbital Symmetry. Academic Press, NewYork, 1970. [14] H. C. Longuet-Higgins and E. W. Abrahamson, J. Amer. Chem. Soc. 87, 2045 (1965). Section 14-9 Qualitative Molecular Orbital Theory of Reactions 525 [15] R. Hoffmann and R. B.Woodward, J. Amer. Chem. Soc. 87, 2046 (1965). [16] R. B.Woodward and R. Hoffmann, J. Amer. Chem. Soc. 87, 2511 (1965). [17] R. G. Pearson, ed., Hard and Soft Acids and Bases. Dowden, Hutchison, and Ross, Stroudsburg, PA, 1973. [18] B. M. Gimarc, Molecular Structure and Bonding. Academic Press, New York 1979. Chapter 15 Molecular Orbital Theory of Periodic Systems 15-1 Introduction A structure is periodic in space, and hence has a periodic potential, if a subunit of the structure can be found that generates the entire structure when it is repeated over and over while traversing one or more spatial coordinates. Thus, a regular polymer can be “generated” mentally by translating a unit cell along an axis, sometimes with accompanying rotations. A crystalline solid results from such translations along three coordinates. In an analogous sense, some molecules are periodic. Benzene is an example since it can be generated by rotating a C–H unit in 60. increments about a point which ultimately becomes the molecular center. We have already seen (Chapters 8 and 13) that the orbital energies, degeneracies, and coefficients in benzene are to a great extent determined by symmetry. But these symmetry constraints can also be viewed as resulting from the cyclic “periodicity” of benzene, and this finite cyclic periodicity is, in essential ways, like the extended infinite periodicity of, say, regular polyacetylene or of graphite. It is reasonable, therefore, that some of the aspects of the energies and orbitals in these extended periodic structures are closely related to those in cyclic periodic systems. The concepts commonly used to describe periodic polymers, surfaces, or solids are closely related to those we have already developed for molecules, but their terminology and depiction are not familiar to most chemists. Our goal in this chapter is to develop an understanding of these concepts by progressing from more familiar cyclic systems, like benzene, to polymers, and to show how qualitative MO concepts apply to periodic systems in general. 15-2 The Free Particle in One Dimension We will first examine the particle moving parallel to the x coordinate with no variation in potential energy. (Let V =0.) This system is periodic because every segment of x is the same as every other, and it provides a convenient starting point for discussion of periodic systems in general. 526 Section 15-2 The Free Particle in One Dimension 527 The quantum-mechanical solutions for this system are discussed in Chapter 2. The relevant points are: 1. All non-negative energies are possible because there are no boundary conditions (beyond the conditions applying to well-behaved functions). 2. Except for E =0, all energy levels are doubly degenerate. 3. The pair of linearly independent wavefunctions associated with each doubly degenerate energy E can be combined in an infinite number of ways to produce different resultant wavefunction pairs. Two of these pairs are especially convenient. These are .+ = expv2mE ¯h ix  (15-1a) .- = exp-v2mE ¯h ix  (15-1b) and .sin = sinv2mE ¯h x  (15-2a) .cos = cosv2mE ¯h x  (15-2b) 4. The exponential forms [Eqs. (15-1ba, b)] are also eigenfunctions for the momentum operator, pˆx = (h¯/i)d/dx. Thus .+ corresponds to a particle moving parallel to the x axis with momentum +v2mE (i.e., toward x =+8), and .- corresponds to motion toward x=-8. We can associate the double degeneracy of the energies with the fact that there is no difference in the potential felt by the particle, regardless of whether it moves from left to right or right to left: The system has two equivalent directions. 5. The real forms [Eqs. (15-2ba, b)] give oscillating particle distributions (except when E =0). In a given pair, the sine puts maximum particle density where the cosine puts its minimum, and vice versa. As E increases, the frequency of nodes increases. These wavefunctions are not “pure” momentum states because they are not eigenfunctions of pˆx (except when E=0). They are formed by mixing of the exponential forms. However, these functions are sometimes more convenient to work with, especially pictorially, and they are just as good as the exponential forms as long as we are concerned with energy rather than momentum. Equations (15-1) and (15-2) are sometimes written with v2mE/¯h replaced by the symbol k, with k having any value from zero to infinity. Evidently, k in this system is proportional to particle momentum, hence to the square root of the kinetic energy, T. (Since V =0, the kinetic energy is identical to the total energy, E.) For a free particle, de Broglie’s relation holds, so p is proportional to 1/., where . is the de Broglie wavelength. This means that k is proportional to the number of de Broglie wavelengths 528 Chapter 15 Molecular Orbital Theory of Periodic Systems Figure 15-1  Three ways of picturing the energies of the free particle. (b) and (c) show the dependence of E on k. In (c), the energies are understood to be degenerate (except at E =0). per unit of distance. Consequently, k is often referred to as the wavenumber. It is the de Broglie-wave-equivalent to the wavenumber of light or of any other classical harmonic wave. Summarizing, for the free particle, k .px .vE =vT .1/. (15-3) There are several ways in which the solutions of the Schr¨odinger equation for periodic structures are displayed graphically. One way is the familiar one of drawing all the energy levels against a vertical energy scale, as in Fig. 15-1a. (Since all energies are allowed for the free particle, this gives a single energy at E = 0 and a degenerate continuum, or “band”, from E >0 to 8, rather than a series of discrete lines.) Alternatively, we can plot E versus k to obtain the parabolic plot of Fig. 15-1b. This can be simplified, though, by plotting only one arm of the parabola (Fig. 15-1c), making implicit the fact that, for each solution with wavenumber k, there is a degenerate solution with wavenumber -k. Graphs that relate energy to k for periodic polymers, surfaces, or solids are called band diagrams. Figure 15-1c is the band diagram for the one-dimensional free particle. Another quantity of great physical importance is the number of states near a particular energy value. This is called the “density of states” (DOS). If we consider Fig. 15-1c, we can see that the states associated with 0<|k|<1 all lie within a certain range which we call E0 -E1 in Fig. 15-2a. If we assume that all values of k are equally likely (we show this to be true in Section 15-5), then there is an equal (infinite) number of states associated with 1<|k|<2, and these lie in the larger range E1 -E2. Hence, the states in the range E1 -E2 are less densely packed than those in the range E0 -E1. It is not hard to show that the density of states for the free particle drops off as 1/vE, as plotted in Fig. 15-2b. Ultimately we will consider one-dimensional periodic structures (polymers) with varying potentials caused by the presence of nuclei and other electrons, and we will find that these systems retain some of the above features. In particular, degeneracy due to directional equivalence and choice of real or complex forms for orbitals persist. Also, the quantity k retains its meaning as a wavenumber (in a restricted sense). However, k loses its simple relation to momentum and energy (because the electrons possess potential energy as well as kinetic energy), and, typically, certain energies become disallowed. This changes the infinite band to band segments separated by gaps in energy. Section 15-3 The Particle in a Ring 529 Figure 15-2  The states between |k|=0 and 1 are equal in number to those between |k|=1 and 2, but the former set is packed into a smaller energy range, producing a greater density of states. 15-3 The Particle in a Ring A particle of mass m constrained to move in the angular coordinate f about a ring of radius r with V (f)=0 is the cyclic analog of the free particle. This system, also described in Chapter 2, is similar in many ways to the one just discussed. Except for the case of E =0, all solutions are doubly degenerate and describable with real (trigonometric) or complex (exponential) functions. The quantum number j (we will use j in cyclic systems, k in linear systems) is proportional to the angular momentum, to the root of the energy, to the reciprocal wavelength, and to the number of de Broglie waves in one circuit of the ring. However, since the number of waves in the ring must be integral, we have a periodic boundary condition that restricts j to integer values and leads to a discrete energy spectrum as opposed to the continuous spectrum of the free particle. The diagrams summarizing relations between energies, j , and number of states are collected in Fig. 15-3. –2 –1 0 +1 +2 j E (a) (b) 0 0 1 2 NOS Figure 15-3  (a) The energy for a particle in a ring has parabolic dependence on j , but exists only when j is an integer. (b) The number of states versus energy. This is the discrete-state analog of the density-of-states plot for a continuum of energies. 530 Chapter 15 Molecular Orbital Theory of Periodic Systems Cyclic structures with varying potentials exist (e.g., benzene), and such systems retain the degeneracy and real or complex orbital features. Also, j continues to have meaning (restricted) with respect towavelength and number of nodes. However, angular momentum and energy are no longer simply related to j . 15-4 Benzene We next examine benzene—a cyclic system having a nonuniform potential. We consider only the p electrons as described by the simple H¨uckel method, since the features we wish to point out are already present at that elementary level. A formula is presented (without derivation) in Chapter 8 [Eq. (8-52)] for the MOs of molecules like benzene. This gives, for the j th MO of benzene, fj = 6  n=1 {6-1/2 exp[2pij (n-1)/6]}2pp (n), j =0, 1, . . . , 5 (15-4) where i =v-1 and 2pp (n) is a 2pp atomic orbital centered on the nth carbon atom. The energy for MO fj is equal to [Eq. (8-51)] j =a +2ß cos(2pj/6). (15-5) If j =0, all the exponentials equal one and Eq. (15-5) gives 0 =a +2ß, which can be recognized as the lowest-energy p MO, and Eq. (15-4) gives f0=(1/v6)[2pp (1)+ 2pp (2)+2pp (3)+2pp (4)+2pp (5)+2pp (6)]. This gives all the 2pp AOs the same phase, which is what we expect for the totally bonding, lowest-energy p MO. If j = 1, 1 = a + 2ß cos(p/3) = a + 2ß(1/2) = a + ß. This is an energy in the second-lowest level, which is a degenerate level. The corresponding MO is f1 = (1/v6){[exp(0)]2pp (1) + [exp(pi/3)]2pp (2) + [exp(2pi/3)]2pp (3) + [exp(pi)] × 2pp (4) + [exp(4pi/3)]2pp (5) + [exp(5pi/3)]2pp (6)}. This somewhat formidablelooking MO is complex. Because f1 is one of a degenerate pair of MOs (the other one is f5), we can mix the complex MOs to obtain real ones that are still eigenfunctions. The real forms of the benzene MOs are described in Section 8-10D. The MO formulas produced by Eq. (15-4) are real for nondegenerate cases but are usually complex for degenerate MOs. However, it is always possible to mix any pair of degenerate complex MOs to produce a pair of degenerate real MOs. Notice that the complex MOs place the same electron density at each carbon whereas the real forms do not. The real MOs show nodes at various points in the ring (see Fig. 8-13) just as the real wavefunctions for the free particle show nodes at various points in x. In these formulas, j is restricted to the integer values ranging from 0 to 5, giving six MOs. If one tries other integer values of j in formulas (15-4) and (15-5), one simply reproduces members of the above set. Indeed, any six sequential integer values for j produces the same set of solutions as does the sequence 0–5. In particular, the set -2,-1, 0, 1, 2, 3 is perfectly acceptable and provides a match with the conventions of solid-state physics and chemistry. The energies for the six unique MOs of benzene are reproduced over and over again if we allow the index j to run beyond the specified range. This is depicted in Fig. 15-4a, and it is easy to see that the same six energies result for any six contiguous j values. Section 15-4 Benzene 531 Figure 15-4  (a) Energies for benzene as a function of j . (b) Unique energies of benzene. (c) Number of states at each energy. All data refer to p energies at the simple H¨uckel level. The set from -2 to 3, alluded to above, can be replotted in condensed form over the range 0–3 (Fig. 15-4b) if we keep in mind the fact that the energies are degenerate except for the first and last. The “number of states” diagram for benzene is sketched in Fig. 15-4c. If we consider a monocycle with thousands of carbon atoms, Eq. (15-5) gives (condensed) results as sketched in Fig. 15-5a. The very large number of energies still lie in the range a +2ß to a -2ß with the cosine wave now stretched over a much larger range of integers. The well-separated points of Fig. 15-4b coalesce into the line of Figure 15-5  (a) H¨uckel energies and (b) density of states for a very large number of carbons in a cycle. 532 Chapter 15 Molecular Orbital Theory of Periodic Systems Fig. 15-5a, which now appears almost continuous. The shape of the curve in Fig. 15-5a makes it evident that the states around j =0 and j =N/2 are closer together in energy than are those around j =N/4, giving the density of states curve shown in Fig. 15-5b. There is a qualitative difference between what we found for the particle in a ring and benzene. In the former case, k can increase without limit, and energy keeps increasing as the square of k. In benzene, increasing j beyond the prescribed range simply causes the energy to cycle back and forth between a +2ß and a -2ß. Why do these systems behave so differently in this regard? The energy of the free particle increases as we fit more and more waves into a circle of fixed radius, obtaining, therefore, shorter de Broglie wavelengths. The H¨uckel energy of a benzene p MO depends on the extent of bonding and antibonding character between adjacent p AOs. When j is zero, all the interactions are bonding and =a+2ß. Asj increases, bonding interactions disappear, ultimately to be replaced by antibonding interactions. At j = 3, all interactions are antibonding and  = a -2ß. It certainly makes sense that this is the highest energy an MO can have because this is the most antibonding arrangement imaginable. But what happens mathematically to make this work out? Let us examine the exponential functions in Eq. (15-4) for two values of j , say j =3 and j =9, to see if we can resolve this question. An immediate problem confronts us: The function is complex, and sketching a complex function is not convenient. However, because the complex MOs can always be mixed to form real MOs, we can let the mixing occur within Eq. (15-4) itself to give sine and cosine equivalents. For example, fj cos = 6  n=1 {6-1/2 cos[2pj (n-1)/6]}2pp (n). (15-6) (For some values of j , this expression will yield MOs that are not normalized.) Equation (15-6) tells us that MO fj has six terms, each term being a 2pp AO times a coefficient. The values of the coefficients are given by the term in curly brackets. This term produces a discrete set of values, since n is a discrete set of integers, but we can sketch a continuous function [by replacing (n-1)/6 with the continuous variable f] and then locate the places on this “coefficient wave” that correspond to the discrete points of interest. Sketches for this “coefficient wave” when j =3 and j =9 are shown in Fig. 15-6. The special points of interest, where actual coefficient values are given Figure 15-6  Coefficient waves for j =3 (dashed curve) and j =9 (dotted curve). Both curves intercept the same coefficient values (±1/v6) at the positions related to carbon atoms 1–6, so the curves produce the same MO. Section 15-5 General Form of One-Electron Orbitals in Periodic Potentials 533 for benzene MOs, are also shown (at the positions labeled 1–6), and it is obvious that the same set of coefficients is produced by each function. Increasing j has caused the coefficient wave to oscillate with a shorter wavelength, but the extra oscillations do not register because they occur between the special points of interest where we are sampling the function (i.e., at the points where the carbon nuclei reside). In other words, the extra oscillations in our benzene coefficient functions are just “empty wiggling” and have no effect on the MO, so we cut off the j range where the meaningful wiggling starts to become empty wiggling. The reason that the particle in a ring does not show this behavior is that the exponential (or sine, cosine) functions in that system actually are the wavefunctions, while in benzene these functions are sampled only at discrete points where AOs are located, and the resulting coefficients are then used to produce MOs. Notice that the MOs for benzene as given by Eq. (15-4) are produced from an equation having the following form: There is an exponential term that, for each MO, supplies the coefficients for various carbons in the molecule, and there is a basis set of functions located on the various carbons. This basis set has the same “periodicity” as does the molecule. (In the case of benzene, we have so far taken this to be six identical 2pp AOs.) Wavefunctions for all periodic systems have this same form—an exponential (or equivalent trigonometric) expression times a periodic basis. This is the content of Bloch’s theorem, which we prove in the next section. EXAMPLE 15-1 What coefficients are generated for a benzene MO by a coefficient wave of the type shown in Fig. 15-6 with j =-3? j =0? SOLUTION  The arguments of the cosine function of Eq. (15-6) will be the negative of those of the j =3 case. Because the cosine is symmetric about an argument of zero, we obtain the same curve for j =-3 as for j =+3, hence the same MO coefficients. This demonstrates that the MO generated by j -6 is the same as that generated by j. For j =0, the cosine arguments all vanish, the cosines all equal +1, and the curve of Fig 15-6 becomes a straight line at a coefficient value of +1/v6. This produces the lowest-energy MO of benzene.  15-5 General Form of One-Electron Orbitals in Periodic Potentials—Bloch’s Theorem We will establish in this section the general mathematical nature of eigenfunctions for periodic hamiltonians. This is the content of Bloch’s theorem. The actual proof will be carried through for cyclic “periodic” systems, and those results will then be extended to noncyclic periodic systems. The rationale for this is that a nearly infinite linear periodic structure can be treated as cyclic without introducing error. That is, a sufficiently long extended chain of atoms can be assumed to have the same wavefunctions and energies as the same chain joined end to end to form a cycle. For short chains and rings, this is not the case; the lowest-energy MO for hexatriene is not as low in energy as that for benzene, nor is it uniform over the whole chain (as it is in benzene). But for long enough chains, the difference becomes negligible. Bloch’s theorem, as it applies to periodic cycles, states that eigenfunctions have the form j (f)=exp(ijf)Uj (f) (15-7) 534 Chapter 15 Molecular Orbital Theory of Periodic Systems where i = v-1, j is an integer and Uj (f) has the periodicity of the cycle. If the cycle belongs to the Cn point group, then U(f) = U(f + 2p/n). Functions of the form of Eq. (15-7) are called Bloch functions. It was stated in Section 13-8 that every wavefunction or orbital is a member of a basis for an irreducible representation for the point group of the system, so it follows that the wavefunctions we seek to characterize have to be bases for irreducible representations of the Cn point group. These groups are discussed in Section 13-11, where it is shown that the functions  =exp(if) and * =exp(-if) are bases for irreducible representations for these groups, which is to say that they are eigenfunctions for the rotation operator, no matter what the value of the integer n in Cn. We consider the more general functions exp(±ijf) with j an integer (so that the function obeys the cyclic continuity condition) and ask what happens when such functions are subjected to a rotation. Let the rotation operator be symbolized Cq n for a rotation of q times 2p/n. The n-fold cyclic system is invariant to this rotation if q is an integer. It is not difficult to show that (see Section 13-11) Cq nf (f)=f (f -2pq/n) (15-8) (For example, clockwise rotation by 60. brings to each point in the circle the value that used to be 60. in the counterclockwise direction.) Then Cq n exp(ijf) = exp[ij (f -2pq/n)] = exp(-2piqj/n) exp(ijf) (15-9) Equation (15-9) shows that exp(ijf) is an eigenfunction for the rotation operator, so exp(ijf) is a basis for an irreducible representation, with j any integer. We can build additonal flexibility into this eigenfunction if we write an exponential function in a more complicated way: exp[i(j +nN)f] with N also an integer. We know that this must still be a basis function because (j +nN) must still be an integer. If we operate with Cq n, we find that (Problem 15-6) Cq n exp [i(j +nN)f]=exp(-2piqj/n) exp [i(j +nN)f] (15-10) This shows that our newest, most general, exponential has the same eigenvalue for Cq n no matter what integer value we choose for N. [Observe that the eigenvalue in Eq. (15-10) depends on q, j, and n, but not N.] Therefore, since we have an unlimited number of choices for the integer N, we have an unlimited number of degenerate eigenfunctions for Cq n for each choice of j . We can mix together such degenerate eigenfunctions in any way we please and still have an eigenfunction. Let us take the linear combination 8 N=-8 AN exp [i(j +nN)f] (15-11) This can be factored into the form exp(ijf) 8 N=-8 AN exp(inNf) (15-12) The sum is a function of f which we can call U(f). It has the same periodicity in f as the cycle because n is equal to the number of identical cells in the cycle and N is Section 15-5 General Form of One-Electron Orbitals in Periodic Potentials 535 an integer. It is actually the general Fourier series expansion of a periodic function. For N =0, the contribution is constant (A0) at all f. For N =±1, the functions repeat n times around the cycle and clearly have the periodicity of the system. For N =±2 the functions repeat 2n times, so they obviously still have the periodicity of the system. Since all the individual members of the sum have the periodicity of the system, so does the sum itself. The specific nature of the resultant function U(f) depends on the choice of coefficients AN. We have shown that exp(ijf)U(f) (15-13) with U(f) n-fold periodic is a general mathematical form for bases for irreducible representations for the Cn group. But wavefunctions, .(f), for a periodic potential must also be bases for such irreducible representations. Therefore, wavefunctions must be expressible in this form: .j (f)=exp(ijf)U(f), j =0,±1,±2, . . . (15-14) We are not claiming that all functions of the form Eq. (15-13) are wavefunctions for the n-fold periodic potential. Rather, we are claiming that all such wavefunctions belong to the class of functions represented by Eq. (15-13). Only certain choices of U(f) will serve to make Eq. (15-13) become a wavefunction for the system, and the choice of an appropriate function U(f) is not necessarily the same at different values of j . To make this dependence on j explicit, we include it as a subscript on U: .j (f)=exp(ijf)Uj (f), j =0,±1,±2, . . . (15-15) with Uj (f) n-fold periodic. This completes the proof of Bloch’s theorem for periodic cycles. Before extending this result to noncyclic systems, a number of comments and clarifications should be made. First, the function Uj (f) may still seem to be something of a mystery. Think of it this way. For benzene p MOs we know we need to create a basis set having two properties: It should be like a 2pp AO near any carbon atom, and it should be the same at each carbon atom. Bloch’s theorem does not comment on whether U should look like a 2pp AO at a carbon atom, but it does require that it be the same at each carbon atom, that it be the same at the midpoints between adjacent carbons, that it be the same at a point one bohr above each carbon atom, etc. In short, U must be symmetric for the six-fold rotation operation. It is up to us to figure out what that six-fold symmetric basis set should look like in detail, and to construct U appropriately. Second, U depends on j . What does this mean? Suppose we compare the lowestand highest-energy p MOs for benzene. The lowest is totally bonding and the highest is totally antibonding. Ordinarily, we use the identical set of 2pp AOs as basis functions for both MOs, but this is not required. Perhaps the pp AOs for the bonding MO would be more appropriately chosen to be slightly larger, so they overlapped better, and maybe those for the antibonding MO should be slightly smaller, to reduce antibonding interactions. These objectives could be achieved, for example, by making U a mixture of 2pp and 3pp AOs on each center and letting the nature of the mixture depend on j . This makes for a more involved variational calculation, so it is normally not included in 536 Chapter 15 Molecular Orbital Theory of Periodic Systems cases like this one where such effects are expected to be small. (3pp character should not mix in to a significant extent because the 3pp AO is considerably higher in energy than the 2pp AO.) However, there are many cases where a unit cell contains several AOs in the same symmetry class and of similar energy (e.g., 2s, 2ps ), and it is then quite important to allow the mix of these to vary with j . A given band, for example, could start out being mainly 2s at low energies and end up being mainly 2ps at the high-energy end. This means that the appropriate description of the function U must be redetermined for each choice of wavenumber j . Third, careful comparison of Eqs. (15-15) and (15-4) shows that they are not exactly the same. Equation (15-15) instructs us to find a periodic functionUj (f) and multiply it by exp(ijf) at every point in f. Think of the sine or cosine related to the exponential and imagine what this means as we multiply it times a 2pp on some carbon. Say the cosine is increasing in value as it sweeps clockwise past the carbon nucleus at 2:00 on a clock face. This produces a product of cosine and 2pp that is unbalanced—smaller toward 1:00 than toward 3:00, because the cosine wave modulates Uj (f) everywhere. But Eq. (15-4) is different. It instructs us to take the value of the cosine at 2:00 and simply multiply the 2pp AO on that atom by that number. The 2pp AO is not caused to become unbalanced. Only its size in the MO is determined by the cosine. Equation (15-4) is called a Bloch sum. Such sums are approximations to Bloch functions, but any errors inherent in this form are likely to be quite small if the basis functions and unit cell are sensibly chosen. (Using Bloch sums is similar in spirit to the familiar procedure of approximating a molecular wavefunction as a linear combination of basis functions.) Extending Bloch’s theorem to linear periodic structures requires identifying an appropriate substitute for the coordinate f and reconsidering the appropriate values for j (which we will call k in noncyclic systems). We require that these substitutions yield eigenfunctions and eigenvalues for the linear system that are the same as what results when it is treated as a cycle. Let us suppose that we have some very large number, n, of unit cells in the cycle, with a repeat distance of a. This means that the cycle has a Cn axis and also a circumference of na, so the linear coordinate (call it x) ranges from 0 to na as the angular coordinate f ranges from 0 to 2p. When we move s steps around the cyclic polymer, the exponential’s argument changes by a factor of 2pijs/n, with j an integer. The same number of steps should give us the same effect on the exponential for the linear polymer. If we choose our exponential’s argument to be of the form ikx, then s steps creates the factor iksa, so we require that iksa = 2pijs/n. This gives k=2pj/na, with j an integer. We see that, for extremely large values of n, k becomes an effectively continuous variable (points separated by 2p/na). Also, we see that k is uniformly distributed (j =0,±1,±2, etc.), which affects our DOS calculations, as was mentioned in Section 15-2. We can also ask about the range of k, which should correspond to the 0–2p range of f. We expect “empty wiggling” to occur when the sin or cos equivalent of exp(ikx) has a wavelength shorter than 2a. The largest nonredundant value of k should come when cos(kx) goes from cos(0) to cos(2p) as x goes from 0 to 2a. Therefore kx=k(2a)=2p, so k has a range of 2p/a. Since the convention is to center k about zero, k runs from -p/a to p/a. (Note that this range becomes infinite for the free particle in a constant potential, for which a is zero.) The range of k over which all unique wavefunctions for the periodic system are produced once and only once (-p/a -1, a>0  8 0 e-ax2 dx = 1 2p/a  8 0 xe-ax2 dx =1/2a  8 0 x2e-ax2 dx = 1 4 p/a3  8 0 x3e-ax2 dx =1/2a2  8 0 x2ne-ax2 dx = 1 · 3 · ··· · (2n-1) 2n+1  p a2n+1  8 0 x2n+1e-ax2 dx = n! 2an+1  8 1 e-axdx = e-a a  1 0 e-axdx =(1/a)(1-e-a)  8 1 xe-axdx =(e-a/a2)(1+a)  1 0 xe-axdx =(1/a2)[1-e-a(1+a)]  8 1 x2e-axdx =(2e-a/a3)(1+a +a2/2)  1 0 x2e-axdx =(2/a3)[1-e-a(1+a +a2/2)]  8 1 xne-axdx =(n!e-a/an+1) n  k=0 ak/k!=An(a) 582 Appendix 1 583  8 y xne-axdx =(n!e-ay/an+1) n  k=0 (ay)k/k!  +1 -1 e-axdx =(1/a)(ea -e-a)  +1 -1 xe-axdx =(1/a2)[ea -e-a -a(ea +e-a)]  +1 -1 xne-axdx =(-1)n+1An(-a)-An(a)  +1 -1 xndx = 0, n=1, 3, 5, . . . 2/(n+1), n=0, 2, 4, . . .  sinx dx=-cos x  cosx dx =sin x  sin2 x dx = x 2 - sin 2x 4  cos2 x dx = x 2 + sin 2x 4  x sinx dx =sin x -x cos x  x cosx dx =cos x +x sin x  x sin2 x dx = x2 4 - x sin 2x 4 - cos 2x 8  x cos2 x dx = x2 4 + x sin 2x 4 + cos 2x 8 Appendix 2 Determinants A determinant is a scalar calculated from an ordered set of elements according to a specific evaluation recipe. The elements are ordered in a square array of rows and columns, bounded at left and right by straight vertical lines. For instance, (A2-1) is a 2×2 determinant:  x -i 2 y2  (A2-1) The recipe for evaluating a 2×2 determinant is: From the product of the elements on the principal diagonal (upper left to lower right) subtract the product of the other two elements. Thus, (A2-1) has the value xy2 +2i. Larger determinants are evaluated by a process that reduces them step by step to a linear combination of smaller determinants until, finally, they are all 2×2’s, which are then evaluated as above. The process of reduction involves the concept of a cofactor. As our example, we use the 4×4 determinant (A2-2), symbolized |M|, whereM is the array of elements within the vertical bars: (A2-2) The elements are numbered so that the first index tells which row, and the second index which column, the element is in. The cofactor of element a11 is defined as the determinant obtained by removing the rowand column containing a11. We see in (A2-2) that striking out row 1 and column 1 gives us a 3×3 determinant (dashed outline) as cofactor of a11. Symbolize this cofactor as |A11|. To evaluate the determinant |M|, we expand in terms of cofactors. We begin by choosing any row or column of M. (We will choose row 1.) Then we write a linear combination containing every element in this row or column times its cofactor: |M|=a11|A11|-a12|A12|+a13|A13|-a14|A14| (A2-3) The sign of each term in the linear combination is determined as follows. lf the sum of row and column indices is even, the sign is plus. If the sum is odd, the sign is minus. Since the indices of a12 and a14 sum to odd numbers, they are minus in (A2-3). 584 Appendix 2 585 The method of expanding in cofactors is successively applied until a large determinant is reduced to 3×3’s or 2×2’s that can be evaluated directly (see Problem A2-1). Thus, a 5 × 5 is first expanded to five 4 × 4’s and each 4 × 4 is expanded to four 3×3’s giving a total of 20 3×3’s. This method becomes extremely clumsy for large determinants. Some useful properties of determinants, symbolized |M|, are stated below without proof. The reader should verify that these are true using 2×2 or 3×3 examples, or by examining Eq. (A2-3). 1. Multiplying every element in one row or one column of M by the constant c multiplies the value of |M| by c. 2. If every element in a row or column of M is zero, then |M|=0. 3. lnterchanging two rows or columns of M to produce M results in M =-|M|; i.e., it reverses the sign of |M|. 4. Adding to any row (column) of M the quantity c times any other row (column) of M does not affect the value of the determinant. 5. If two rows or columns of M differ only by a constant multiplier, then |M|=0. A2-1 Use of Determinants in Linear Homogeneous Equations Suppose that we seek a nontrivial solution for the following set of linear homogeneous equations: a1x +b1y +c1z = 0 (A2-4) a2x +b2y +c2z = 0 (A2-5) a3x +b3y +c3z = 0 (A2-6) Here, x,y, and z are unknown and the coefficients ai, bi, ci are given. Let us collect the coefficients into a determinant |M|, |M|=  a1 b1 c1 a2 b2 c2 a3 b3 c3  (A2-7) As before, let |A1| be the cofacter of a1, etc. Now, multiply Eq. (A2-4) by |A1| (since |A1| is a determinant, it is just a number, and so this is a scalar multiplication), Eq. (A2-5) by -|A2|, and Eq. (A2-6) by |A3| and add the results to get x(a1|A1|-a2|A2|+a3|A3|)+y(b1|A1|-b2|A2|+b3|A3|) +z(c1|A1|-c2|A2|+c3|A3|)=0 (A2-8) The coefficient of x is just |M|. The coefficients of y and z correspond to determinants having two identical rows and hence are zero. Therefore, |M|x =0 (A2-9) 586 Determinants In order for x to be nonzero (i.e., nontrivial) it is necessary that |M|=0. This is a result that is very useful. The condition that must be met by the coefficients of a set of linear homogeneous equations in order that nontrivial solutions exist is that their determinant vanish. A2-2 Problems A2-1. Expand the 3×3 determinant (Fig. PA2-1), by cofactors and showthat this result is equivalent to the direct evaluation of the 3×3 by summing the three products parallel to the main diagonal (solid arrows) and subtracting the three products parallel to the other diagonal (dashed arrows). Figure PA2-1  A2-2. Evaluate (a)  0 1 2 1  (b)  1 1 0 0 1 1 1 1 1  (c)  1 2 0 1 3 0 1 4 1 1 0 1 0 2 1 1  (d)  x 2 1 x  =0 for x A2-3. Verify that the coefficient of y in Eq. (A2-8) is zero. A2-4. Consider the following set of linear homogeneous equations: 4x +2y -z=0, 3x -y -2z=0, 2y +z=0 Do nontrivial roots exist? A2-5. Find a value for c that allows nontrivial solutions for the equations cx -2y +z=0, 4x +cy -2z=0, -8x +5y -cz=0 A2-6. Five properties of determinants have been listed in this appendix. (a) Prove statement (5) is true assuming statements (1)–(4) are true. (b) Demonstrate statements (1)–(4) using simple examples. Appendix 3 Evaluation of the Coulomb Repulsion Integral Over 1s AOs Evaluation of  1s(1)1s(2)(1/r12)1s(1)1s(2)dv(1)dv(2) (A3-1) where 1s(1)=. 3/p exp(-.r) (A3-2) may be carried out in two closely related ways.1 Each method is instructive and sheds light on the other, and so we will give both of them here. The first methodworks from a physical model and requires knowledge of two features of situations governed by the 1/r2 force law (e.g., electrostatics, gravitation). Suppose that there exists a spherical shell in which charge or mass is distributed uniformly, like the soap solution in a soap bubble. The first feature is that a point charge or mass outside the sphere has a potential due to attraction (or repulsion) by the sphere that is identical to the potential produced if the sphere collapsed to a point at its center (conserving mass or charge in the process). Thus, the electrostatic interaction between two separated spherical charge distributions may be calculated as though all the charge were concentrated at their centers. The second feature is that the potential is identical for all points inside the spherical shell; that is, if the core of the earth were hollow, a person would be weightless there. There would be no tendency for that person to drift toward a wall or toward the center. Armed with these facts, we can evaluate the integral. First, we remark that all the functions in the integrand commute. This enables us to write Eq. (A3-1) as  1s2(1)(1/r12)1s2(2)dv(1)dv(2) (A3-3) The functions 1s2(1) and 1s2(2) are just charge clouds for electrons 1 and 2, and the integral is evidently just the energy of repulsion between the clouds. Suppose (see Fig. 3-1) that, at some instant, electron 1 is at a distance r1 from the nucleus. What is its energy of repulsion with the charge cloud of electron 2? The charge cloud of electron 2 can be divided into two parts: the charge inside a sphere of radius r1 and the charge outside that sphere. From what we just said, electron 1 experiences a repulsion due to the cloud inside the sphere that is the same as the repulsion it would feel if that 1Other methods, not discussed here, also exist. See, for example, Margenau and Murphy [1, pp. 382–383]. 587 588 Evaluation of the Coulomb Repulsion Integral Over 1s AOs Charge cloud 1s2 (2) r1 1 Figure A3-1  Sketch of spherical charge cloud 1s2(2) with electron 1 at a distance r1 from the nucleus. part of the cloud were collapsed to the center. We can calculate the energy due to this repulsion (call it the “inner repulsion energy”) by dividing the product of charges by the distance between them: inner repulsion energy = (fraction of charge cloud 2 inside r1)×1/r1 = 4p  r1 0 1s2(2)r2 2 dr2 ×1/r1 (A3-4) where the factor 4p comes from integrating over .2,f2. Electron 1 also experiences repulsion from charge cloud 2 outside the sphere of radius r1. But, from what we said above, electron 1 would experience this same “outer repulsion” no matter where it was inside the inner sphere. Therefore we will calculate the energy due to this repulsion as though electron 1 were at the center, since this preserves spherical symmetry and simplifies the calculation. It follows that all the charge in a thin shell of radius r2 repels electron 1 through an effective distance of r2. Integrating over all such shells gives outer repulsion energy=4p  8 r1 (1/r2)1s2(2)r2 2 dr2 (A3-5) The total energy of repulsion between charge cloud 2 and electron 1 at r1 is the sum of inner and outer repulsive energies. But electron 1 is not always at r1. Therefore, we must finally integrate over all positions of electron 1, weighted by the frequency of their occurrence: repulsive energy=16p2  8 0 1s2(1) (1/r1)  r1 0 1s2(2)r2 2 dr2 +  8 r1 1s2(2)r2dr2r2 1 dr1 (A3-6) Appendix 3 589 Figure A3-2  The inner integrals contain but one variable, r2, and can be evaluated with the help of Appendix 1. After they are performed, the integrand depends only on r1 and this is also easily integrated yielding 5./8 as the result. A positive value is necessary since a net repulsion exists between two clouds of like charge. The second method of evaluation is more mathematical and more general. The function 1/r12 is expressible2 as a series of terms involving associated Legendre functions: 1 r12 = 8 l=0 +l  m=-l (l -|m|)! (l +|m|)! rl< rl+1 > P|m| l (cos .1)P |m| l (cos .2) exp[im(f1 -f2)] (A3-7) This infinite series will give the distance between particles 1 and 2 located at positions r1, .1,f1 and r2, .2,f2 (Fig. A3-2). All we need to do is pick the larger of r1 and r2 and call that r>, the other being r<, and substitute those numbers into the formula. We also need all the Legendre functions for cos .1 and cos .2, and values of exp[im(f1 -f2)] for all integral values of m. Putting all this together as indicated by Eq. (A3-7) would give a series of numbers whose sum would converge to the value of 1/r12. While this is an exceedingly cumbersome way to calculate the distance between two points, it turns out that use of the formal expression (A3-7) enables us to integrate Eq. (A3-1). This comes about because the Legendre functions satisfy the relation  p 0 P|m| l (cos .)P|m| l (cos .) sin. d. = 2 2l +1 (l +|m|)! (l -|m|)! dll . (A3-8) The first Legendre polynomial P0 is equal to unity. Therefore, 1s(i), which has no . dependence, can be written 1s(i)=(. 3/p)1/2 exp(-.ri)P0(cos .i) (A3-9) Thus, in the integral (A3-1) there will be an integration over .1 of the form  p 0 P0(cos .1)P |m| l (cos .1)P0(cos .1) sin .1 d.1 (A3-10) for each term in the sum (A3-7) (and a similar integral over .2). However since 12 =1, [P0(cos .1)]2 =P0(cos .1), and integral (A3-10) becomes [by Eq. (A3-8)]  p 0 P0(cos .1)P |m| l (cos .1) sin .1 d.1 =2d0l (A3-11) 2See Eyring et al. [2, Appendix 5]. 590 Evaluation of the Coulomb Repulsion Integral Over 1s AOs [m must equal zero here, otherwise the integral over f will vanish.] In other words, all terms of the sum over l and m vanish except the first term, for which l =m=0. This gives that the 1/r12 operator is equal to 1/r>. Hence, the repulsion integral is 16p2  8 0  8 0 1s2(1)1s2(2)r2 1 r2 2 r>  dr2dr1 (A3-12) where r> is the greater of r1, r2. Suppose that we integrate over r2 first. As r2 changes value, it is sometimes smaller, sometimes larger than a particular value of r1. When it is larger, r> is r2. When it is smaller, r> is r1. Putting this argument into mathematical form gives 16p2  8 0  r1 0 1s2(1)1s2(2)r2 1 r2 2 dr2 r1 + 8 r1 1s2(1)1s2(2)r2 1 r2 2 dr2 r2 dr1 (A3-13) Since the variable of integration in the two inner integrals is r2, the quantities 1s2(1), r2 1 , and r1 may be brought outside these inner integrals, giving us the same equation (A3-6) that we obtained by the first method. This second method is more generally useful because it can be used when repulsions involving p, d, etc. charge clouds are calculated. In these cases, terms involving l = 1, 2, etc., become nonvanishing, but the series generally truncates after a few terms. References [1] H. Margenau and D. M. Murphy, The Mathematics of Physics and Chemistry. Van Nostrand-Reinhold, Princeton, New Jersey, 1956. [2] H. Eyring, J. Walter, and G. E. Kimball, Quantum Chemistry. Wiley, New York, 1944. Appendix 4 Angular Momentum Rules A4-1 Introduction In Chapters 4, 5, and 7, we deal with the angular momentum of electrons due to their orbital motions and spins and also with angular momentum due to molecular rotation. The conclusions we arrive at are based partly on physical arguments related to experience with macroscopic bodies. While this is the most natural way to introduce such concepts, it is not the most rigorous. In this appendix we shall demonstrate how use of the postulates discussed in Chapter 6 leads to all of the relationships we have introduced earlier. The approach is based entirely on the mathematical properties of the relevant operators, making no appeal to physical models. A4-2 The Classical Expressions for Angular Momentum According to postulate II (Section 6-3), we must first find the classical physical expressions for the quantity of interest in terms of x, y, z,px,py,pz , and t . We know that the angular momentum for an object of mass m moving with velocity v in a circular orbit of radius r is L=mr×v=r×p (A4-1) where the × symbol means we are taking a cross-product of vectors. To examine this in detail, we must resolve r and p into x,y, and z components: r =ix +jy +kz (A4-2) p=mv=midx dt +mjdy dt +mkdz dt =ipx +jpy +kpz (A4-3) Here i, j,k are unit vectors pointing, respectively, along the x, y, z Cartesian axes. A cross product is taken by expanding the following determinant: L=r×p=  i j k x y z px py pz  (A4-4) to give L=i(ypz -zpy)+j(zpx -xpz)+k(xpy -ypx) (A4-5) 591 592 Angular Momentum Rules The coefficient for i is the magnitude of the x-component of angular momentum, etc. That is, L=Lxi+Lyj+Lzk (A4-6) where we now see that Lx =ypz -zpy (A4-7) Ly =zpx -xpz (A4-8) Lz =xpy -ypx (A4-9) (The simple cyclic x, y, z relationships among these formulas can be used as a memory aid.) We will also be interested in the square of the angular momentum, L2: L2 =L ·L=(Lxi+Lyj+Lzk)(Lxi+Lyj+Lzk)=L2x +L2y +L2z (A4-10) where we have used the orthonormality of the unit vectors (i · i=1, i · j=0, etc). Note that L2 is a magnitude, not a vector. This gives the classical expressions we need for the x,y, and z components of angular momentum and for the square of its magnitude. A4-3 The Quantum-Mechanical Operators Postulate II tells us what to do next: Replace px with ( ¯h/i)./.x, and similarly for py,pz . The resulting operators are: Lˆx =-ih¯ y . .z -z . .y  (A4-11) Lˆy =-ih¯ z . .x -x . .z  (A4-12) Lˆz =-ih¯ x . .y -y . .x  (A4-13) and Lˆ2 =Lˆ2x +Lˆ2y +Lˆ2z (A4-14) All of these operators are hermitian. Now that we have the quantum-mechanical operators, we are free to transform them to other coordinate systems. In spherical coordinates, they are1 Lˆx =ih¯ sin f . .. +cot . cosf . .f  (A4-15) Lˆy =-ih¯ cosf . .. -cot . sin f . .f  (A4-16) 1See Eyting et al. [1, p. 40.] Appendix 4 593 Lˆz =-ih¯ . .f (A4-17) Lˆ2=-¯h2  .2 ..2 +cot . . .. + 1 sin2 . .2 .f2  =-¯h2  1 sin . . .. sin . . .. + 1 sin2 . .2 .f2  (A4-18) In atomic units, the quantity ¯h becomes unity and does not appear. We will use atomic units henceforth in this appendix. A4-4 Commutation of Angular Momentum Operators with Hamiltonian Operators and with Each Other We have indicated (Chapter 4) that a rotating classical system experiencing torque maintains E,Lz , and |L| (or, equivalently, L2) as constants of motion, but not Lx or Ly. We might anticipate a similar situation in quantum mechanics. This would mean that a state function . would be an eigenfunction for Hˆ ,Lˆ2, and Lˆz but not for Lˆx or Lˆy. This in turn requires that Hˆ ,Lˆ2, and Lˆz commute with each other, but that Lˆx and Lˆy do not commute with all of them. We first consider Hˆ and Lˆ2. Note that Lˆ2 appears in .2 [compare Eq. (A4-18) with (4-7)]: .2 = 1 r2 . .r r2 . .r - 1 r2 ˆ L2 (A4-19) Also, since L2 does not contain the variable r,Lˆ2 commutes with any function depending only on r. Since Lˆ2 must also commute with itself, it follows that Lˆ2 and .2 commute, i.e., that Lˆ2,.2=0 (A4-20) If V in a hamiltonian operator is a function of r only, then Lˆ2,Hˆ =0 (A4-21) This proves that Hˆ for a hydrogenlike ion commutes with Lˆ2. From the expressions for Lˆ2 and Lˆz in spherical polar coordinates, it is obvious that Lˆz ,Lˆ2=0 (A4-22) Lˆz does not operate on functions of r, and so Lˆz commutes with .2. For a spherically symmetric system, V =V (r) and we have that Lˆz ,Hˆ =0 (A4-23) Thus, we have shown that Hˆ ,Lˆ2,Lˆz all commute in a system having a spherically symmetric potential. 594 Angular Momentum Rules Now let us test Lˆx and Lˆy with each other and also with Lˆ2. Lˆx,Lˆy=LˆxLˆy -LˆyLˆx (A4-24) We will examine this using the Cartesian system. We write out each operator product and subtract: LˆxLˆy =-y . .z -z . .y z . .x -x . .z  =-y . .x +yz .2 .z.x -z2 .2 .y.x -yx .2 .z2 +zx .2 .y.z  (A4-25) LˆyLˆx =-zy .2 .x.z -xy .2 .z2 -z2 .2 .x.y +x . .y +xz .2 .z.y  (A4-26) Subtracting (A4-26) from (A4-25): Lˆx,Lˆy=-y . .x -x . .y =x . .y -y . .x =iLˆz (A4-27) A similar approach to other operator combinations gives Lˆy,Lˆz=iLˆx (A4-28) Lˆz ,Lˆx=iLˆy (A4-29) (Notice the x, y, z cyclic relation.) These commutation relations do not depend on choice of coordinate system. Use of the r, ., f coordinate system would give the same results. Evidently, these operators do not commute with each other since their commutators are unequal to zero. From this point we will dispense with the carat symbol, since the context of the discussion makes it obvious that we are referring to operators. We still have not checked L2 with Lx and Ly. We will now show that Lx,L2=Lx, (L2x +L2y +L2z )=0 (A4-30) We proceed by finding the commutator of Lx with L2x ,L2y , and L2z individually. It is obvious that the first of these, Lx,L2x equals zero. The second can be evaluated as follows. LxLy -LyLx =iLz (from A4-27) (A4-31) We multiply (A4-31) from the left by Ly: LyLxLy -L2y Lx =iLyLz (A4-32) We multiply (A4-31) from the right by Ly: LxL2y -LyLxLy =iLzLy (A4-33) Appendix 4 595 We sum (A4-32) and (A4-33): LxL2y -L2y Lx =Lx,L2y =i(LyLz +LzLy) (A4-34) A similar strategy, starting with (A4-29), multiplying from left and right by Lz , and summing, yields L2z Lx -LxL2z =-Lx,L2z =i(LzLy +LyLz) (A4-35) Equation A4-34 minus A4-35 plus zero (from Lx,L2x ) is equal to (A4-30) and is easily seen to equal zero. A similar proof gives Ly,L2=0 (A4-36) It follows at once that, since Lx and Ly operate only on . and f and since .2 contains all . and f terms in the form of L2: Lx,.2=Ly,.2=0 (A4-37) If the potential V for a system is independent of . and f, then Lx,H=Ly,H=0 (A4-38) We have shown that H for an atom and L2 commute with each other and also with Lx,Ly, and Lz , but that the latter three operators do not commute with each other. Therefore, we can say that there exists a set of simultaneous eigenfunctions for H,L2, and one ofLx,Ly,Lz , but not the other two. We chooseLz to be the privileged operator. This means that an atom can exist in states having “sharp” values of energy, magnitude of angular momentum (hence square of L), and z component of angular momentum, but not x or y components. H and L2 will commute if V in H is independent of . and f (central field potential). H and Lz will commute if V is independent of f, even if it is dependent on .. This is the case for any linear system. Therefore, ML continues to be a sharp quantity for linear molecules like H2 or C2H2, but total angular momentum value does not, because L2 does not commute with H, so L is not a good quantum number. This is why the main term symbol for a linear molecule is based onML, whereas the main term symbol for an atom is based on L. A4-5 Determining Eigenvalues for L2 and Lz We will now make use of our operator commutation relations to determine the nature of the eigenvalues for L2 and Lz . We begin by defining two new operators: L+ =Lx +iLy (A4-39) L- =Lx -iLy (A4-40) These are called “step-up” and “step-down” operators, respectively, or “raising” and “lowering” operators. They correspond to no observable property and are not hermitian. 596 Angular Momentum Rules They have been devised solely because they are useful in formal analysis of the sort we are doing here. The reason for their names will become apparent soon. We now prove the following theorem for the step-up operator L+: LzL+ =L+(Lz +1) (A4-41) Expanding L+ gives LzL+ =Lz(Lx +iLy)=LzLx +iLzLy (A4-42) We now add zero in the form [see Eq. A4-29] -(LzLx -LxLz -iLy)=0 (A4-43) to obtain LzL+ =LxLz +iLy +iLzLy (A4-44) We again add zero, this time in the form [see Eq. A4-28] iLyLz -iLzLy +Lx =0 (A4-45) This results in LzL+ =LxLz +iLy +iLyLz +Lx (A4-46) Rearranging, LzL+ =(Lx +iLy)Lz +Lx +iLy =(Lx +iLy)(Lz +1)=L+(Lz +1) (A4-47) This proves Eq. A4-41. An analogous procedure proves the analogous relation for the step-down operator L-: LzL- =L-(Lz -1) (A4-48) Our ultimate goal is to discover the nature of the eigenvalues of L2 and Lz . Since these two operators commute, there must exist a common set of eigenfunctions for them. Let us symbolize these simultaneous eigenfunctions of L2 and Lz with the symbol Y . A given eigenfunction Y will be associated with an eigenvalue for L2 and a (possibly) different eigenvalue for Lz . To keep track of these, we use subscript labels l and m, defined in the following manner: L2Yl,m =klYl,m (A4-49) LzYl,m =kmYl,m (A4-50) Nowwe make use of our “ladder” operators. It is not difficult to show(ProblemA4-1) that LzL+Yl,m =(km +1)L+Yl,m (A4-51) This shows that operating on Yl,m with the step-up operator L+ produces a new function and that this new function is also an eigenfunction of Lz . Furthermore, the new Appendix 4 597 eigenfunction has an eigenvalue that is greater by one than the eigenvalue (km) of the original eigenfunction. This is the reason L+ is called a step-up operator. The analogous relation for L- is LzL-Yl,m =(km -1)L-Yl,m (A4-52) The import of Eqs. (A4-51) and (52) is that a set of eigenfunctions for Lz exists with eigenvalues separated by unity. We now use the ladder operators in another pair of useful operator relations (Problem A4-2): L2 =L+L- +L2z -Lz (A4-53) L2 =L-L+ +L2z +Lz (A4-54) It is also possible to show (Problem A4-3) that L+Yl,m =C+Yl,m+1 (A4-55) L-Yl,m =C-Yl,m-1 (A4-56) where C+ and C- are constants. Equations A4-55 and A4-56 tell us that the ladder operators change the functions Y in a manner such that their eigenvalues for L2 do not change. We will now show that kl =k2m (A4-57) We begin with the obvious relation (L2 -L2z )Yl,m =(kl -k2m )Yl,m (A4-58) But, from Eq. A4-14, L2 -L2z =L2x +L2y (A4-59) and so (L2x +L2y )Yl,m =(kl -k2m )Yl,m (A4-60) Equation A4-60 is remarkable because it indicates that the functions Yl,m (which are not eigenfunctions of Lx or Ly, since these do not commute with Lz ) are nevertheless eigenfunctions for the combination L2x +L2y . We will now show that the eigenvalues of Eq. A4-60 must be positive definite, which suffices to prove that kl =k2m . We proceed by recognizing that the operators Lx and Ly do possess eigenfunctions, though they are different from the functions Yl,m. Let us symbolize them f and g, respectively. Then Lxfi =xifi (A4-61) Lygi =yigi (A4-62) Because Lx and Ly are hermitian operators, x and y are real numbers and the function sets {f } and {g} are complete and can be assumed to be orthonormal. We can therefore expand the functions Y in terms of either set: Yl,m = i cl,m i fi = j dl,m j gj (A4-63) 598 Angular Momentum Rules Substituting Eqs. A4-63 into A4-60 and operating: (kl -k2m )Yl,m =(L2x +L2y )Yl,m =L2x Yl,m +L2y Yl,m =L2x  i cl,m i fi +Ly2 j dl,m j gj = i cl,m i x2 i fi + j dl,m j y2 j gj (A4-64) We can isolate kl -k2m by multiplying from the left by (Yl,m)* and integrating. On the left: (Yl,m)*(kl -k2m )Yl,mdv =(kl -k2m ) (Yl,m)*Yl,mdv =(kl -k2m ) (A4-65) On the right:  p (cl,m p fp)* i cl,m i x2 i fi dv +  q (dl,m q gq )* j dl,m j y2 j gjdv = p  i (cl,m p )*cl,m i x2 i (fp)*fi dv + q  j (dl,m q )*dl,m j y2 j (gq )*gj dv (A4-66) But {f } and {g} are orthonormal sets, and so Eq. (A4-65-66) becomes (kl -k2m )= i |cl,m i |2x2 i + j |dl,m j |2y2 i (A4-67) Since x and y are real, neither sum has negative terms, so (kl -k2m )=0. (If we resort to a physical argument, we see the reasonableness of this since L2x +L2y is the square of the angular momentum projection in the x,y plane.) Knowing that kl = k2m , we now consider the implications of applying the step-up operator many times to Yl,m and then operating with Lz : LzL+L+L+ ···L+Yl,m =(km +1+1+1 ···+1)Yl,m+1+1+1···+1 =kmYl,m (A4-68) Eventually we will get a km that is too large to satisfy k2 m =kl . At that point the series must terminate, which means that L+Yl,m = 0. Let us call the maximum-value km reached in this way kmax. Use of L- likewise gives us a lowest possible value of km which we call kmin. The corresponding eigenfunctions are labeled Yl,max and Yl,min. Since L+Yl,max =0 (A4-69) it follows that L-L+Yl,max =0 (A4-70) But, substituting for L+L- from Eq. A4-54, this can be written (L2 -L2z -Lz)Yl,max =0 (A4-71) Appendix 4 599 so k2 l -k2 max -kmax =0 (A4-72) or k2 l =k2 max +kmax (A4-73) A similar treatment on L-L+Yl,min yields k2 l =k2 min -kmin (A4-74) From these two expressions for kl, we have k2 min -k2 max =kmin +kmax (A4-75) This relation can be satisfied only if kmax=-kmin (A4-76) What has been shown is that, for functions Yl,?, the eigenvalue of L2, kl , determines the maximum value of km through kl =k2 max +kmax, that the minimum value of km is -kmax, and that intermediate values of km are given by kmax -1, kmax -2, etc. If we let the value of kmax be symbolized by l, then we have that kl =l2 +l =l(l +1), so that L2Yl,m =l(l +1)Yl,m (A4-77) If we let the value of km be symbolized by m, then LzYl,m =mYl,m (A4-78) For a given value of l,m can take on the values m=l, l -1, l -2, . . . ,-l +1,-l (A4-79) There are only two possible scenarios for such a set of numbers m. One set is the integers; e.g., if l = 3,m = 3, 2, 1, 0,-1,-2,-3. The other set is the half-integers; e.g., if l =3/2,m=3/2, 1/2,-1/2,-3/2. Either way, there are 2l +1 allowed values of m. All of the above relations have been derived from the commutation relations for the angular momentum operators. They hold for: Electron orbital angular momentum L2, Lz Electron spin angular momentum S2, Sz Resultant of orbital and spin momenta J 2, Jz Molecular rotational angular momentum J 2, Jz Nuclear spin angular momentum I 2, Iz If we seek analytical expressions for the functions Yl,m, we can start with the assumption that they are separable into products of two kinds of functions, one type depending only on ., the other only on f: Yl,m =(.)(f) (A4-80) 600 Angular Momentum Rules Since Lz =-i./.., since LzYl,m =mYl,m, and furthermore since Yl,m must be single valued, it follows that (f)=(1/v2p)exp(imf), with m=0,±1,±2, . . . . Note that m must be an integer. We have the curious result that, if Yl,m is separable into . and f parts, the eigenvalues for Lz cannot belong to the half-integer set mentioned above. The separable, analytic functions Yl,m with l an integer are the spherical harmonics described in Chapter 4 and long known to classical physics. Indeed, the symbol Yl,m has come to stand for those functions. For systems involving half-integer l and m values, no analytical functions of the usual sort can be written. Instead, matrices and vectors are used that manifest the correct relationships. Thus, for the spin of a single electron, Pauli used the following representation: a = 1 0 , ß= 0 1 , a|a= 1 0 1 0 =1 Sz = 1 2 1 0 0 -1 , Sx = 1 2 0 1 1 0 , Sy = 1 2 0 -i i 0 A4-5.1 Problems A4-1. Prove Equation A4-51. A4-2. Prove Equation A4-53. A4-3. Prove Equation A4-55. What does this equation imply about L2 and L+? A4-4. Evaluate Yl,m|Lx|Yl,m using only operator relations from this appendix. A4-5. Demonstrate that the Pauli spin matrices and vectors satisfy the following relations: a) a|ß=0 b) S+ß =a c) S+a =0 d) [Sx, Sy]=iSz A4-6. Use the Pauli spin matrices to evaluate S2a. Reference [1] H. Eyring, J. Walter, and G. E. Kimball, Quantum Chemistry. Wiley, New York, 1944. Appendix 5 The Pairing Theorem1 A5-1 Pairing of Roots and Relation Between Coefficients for Alternant Systems The HMO assumptions are Hij = ... .. a if i =j ß if i =j are bonded together 0 ifi =j are not bonded together (A5-1) Sij = dij (A5-2) fk = i cik.i (A5-3) where .i is a 2pp AO on carbon i. An alternant hydrocarbon can be labeled with asterisks to demonstrate the existence of two sets of carbon centers in the molecule such that no two atoms in the same set are nearest neighbors (see Section 8-9). The following discussion pertains to alternant systems. The simultaneous equations leading to H¨uckel energies and coefficients are of the form cix +cj +ck +cl +···=0 (A5-4) where x = (a -E)/ß. Atoms j, k, l must be bonded to atom i if cj, ck, cl are to be unequal to zero. Hence, atoms j, k, and l belong to one set of atoms, and atom i belongs to the other set. If we have already found a value of x and a set of coefficients satisfying the simultaneous equations (A5-4), it is easy to show that these equations will also be satisfied if we insert -x and also reverse the signs of the coefficients for one set of centers or the other. If we reverse the coefficient signs for the set j, k, l, we obtain, on the left-hand side ci(-x)-cj -ck -cl -··· (A5-5) which is the negative of Eq. A5-4 and hence still equals zero. If we reverse the sign of ci, we have -ci(-x)+cj +ck +cl +··· (A5-6) which is identical to Eq. A5-4. 1See Coulson and Rushbrooke [1]. 601 602 The Pairing Theorem This proves that each root of an alternant hydrocarbon at x(=0) has a mate at -x and that their associated coefficients differ only in sign between one or the other sets of atoms. Note that, if x =0, the ci term vanishes, leaving coefficients for only one set of centers. Reversing all signs in this case corresponds to multiplying the entire MO by -1, which does not generate a new (linearly independent) function. Thus, it is possible for an alternant system to have a single, unpaired root at x =0. It is necessary for odd alternants to have such a root. An even alternant may have a root at x =0, but, if it has one such root, it must have another since, in the end, there must be an even number of roots. A5-2 Demonstration That Electron Densities Are Unity in Ground States of Neutral Alternant Hydrocarbons From n AOs result n MOs. The AOs as well as the MOs can be assumed normalized and (in the HMO method) orthogonal. If each AO contains one electron, the electron density at eachAO is unity. If theseAOs are combined to form MOs, and each MO has one electron, each AO electron density would still be unity, since the set of all singly occupied MOs is just a unitary transformation of the set of all singly occupied AOs. (The matrix equivalent of these statements is C†C=1=CC† (A5-7) The left equality means that the sum of squares (absolute) of coefficients over all atoms in one MO is unity, so the MO is normalized. The right equality means that the sum of squares of coefficients over one atom in all MOs is unity.) For an alternant hydrocarbon, however, the squares of coefficients in an MO at E =a +kß are identical to those in the MO at E =a -kß. Therefore, no change in electron density will result if each electron in the upper half of ourMOenergy spectrum is shifted to its lower-energy mate. Thus, the resulting state, which is the neutral ground state, still has unit electron density at each AO. A5-3 A Simple Method for Generating Nonbonding MOs An immediate consequence of Eq. A5-4 is that the coefficients for any nonbondingMO, for which x =0 by definition, satisfy the following simple rule: The coefficients on all the atoms attached to any common atom sum to zero. Consider the nonbonding MOs below: 0 0 0 0 0 – – – – It is clear that, no matter which atom one chooses as reference, the sum of coefficients of attached atoms is zero. Appendix 5 603 It is possible to use this observation to generate nonbonding MOs without the aid of a computer or tabulation. For odd-alternant systems such an MO is guaranteed to exist, and the recipe is especially easy, so we start with these. The procedure is: 1. Divide the centers into asterisked and unasterisked sets, as described in Chapter 8. One set will have fewer centers. Set all of the coefficients in this set equal to zero. 2. Choose one of the nonzero sites and set its coefficient to be x. Then work around the molecule, setting other coefficients to values needed to satisfy the “sum to zero” rule. 3. When finished, evaluate x by requiring that the sum of squares of coefficients equal unity. For example, consider the naphthyl system: 8 7 5 6 4 3 2 9 1 10 11 * * * * * * The asterisked centers are more numerous, so we set the others to zero. 0 0 0 0 0 Now we set one of the nonzero sites equal to x. Let us use the one marked with an arrow. Then we move around the molecule, setting the other values. 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 x x –x –x –x –x –x –x x x x x 2x 2x –3x Now we set the sum of squares to unity: 17x2 =1;x =0.242. The final nonbonding MO is –0.242 –0.727 –0.242 0.242 0.242 0.484 0 0 0 0 0 If an even alternant has a nonbonding MO, then it must have a pair of them. Each one corresponds to setting a different subset of coefficients equal to zero. This simple recipe would be little more than a parlor trick were it not for the fact that the nonbonding MO is often very important in determining a molecule’s chemical or physical properties. For example, the neutral molecule used above as an example is alternant, hence has p-electron densities of one at every carbon. However the spin density is controlled by the nonbonding MO, since that is where the unpaired electron is, so the ESR splitting pattern should correspond to a series of coupling constants proportional to the squares of these nonbonding MO coefficients (if we ignore negative spin density). If the neutral radical is ionized to the cation, the electron is lost from the nonbonding MO, and so the deficiency of electronic density (i.e., the positive charge) appears on 604 The Pairing Theorem carbons with nonzero coefficients, in proportion to the squares of the coefficients. In the case of our previous example, the cation charge distribution is predicted to be +0.06 +0.23 +0.53 +0.06 +0.06 +0.06 0 0 0 0 0 If we were to modify the molecule chemically by attaching a methyl group (which donates p electron charge via hyperconjugation) or by substituting for carbon a more electronegative nitrogen atom, we could hope to influence the ease of ionization. But it is apparent that such modifications will have greatest effect if they occur at sites where the largest changes in electron density occur upon ionization, i.e., at positions 11 and 1. Modifying the molecule at a position where a zero coefficient exists in the nonbonding MO (i.e., positions 4, 5, or 7) should have little effect on the ease of carbocation formation. Reference [1] C. A. Coulson and G. S. Rushbrooke, Proc. Cambridge Phil. Soc. 36, 193 (1940). Appendix 6 H¨uckel Molecular Orbital Energies, Coefficients, Electron Densities, and Bond Orders for Some Simple Molecules Each molecule is labeled as alternant or nonalternant. For alternants, only the occupied MO data are tabulated since the remainder may be generated by use of the pairing theorem (see Appendix 5). Bond orders are tabulated for only one bond from each symmetry-equivalent set in a molecule. x HMO root =(a -E)/ß n number of electrons in MO when molecule is in neutral ground state ci LCAO-MO coefficient of AO at atom i qi p-electron density on atom i pij p-bond order between atoms i and j Ep total p energy of the molecule=sum of p-electron energies Molecules in this tabulation are grouped according to the number of centers in the conjugated system. (In all cases, it is assumed that the system is planar and undistorted which, in some cases, is not correct.) 605 606 H¨uckel Molecular Orbitals Index of System Tabulated Butadiene-2,3-bimethyl Benzene Fulvene Heptatrienyl Benzyl Cycloheptatrienyl Cyclooctatetraene Octatetraene Benz-cyclopentadienyl Azulene Hexatriene Naphthalene Cyclobutadienyl methyl Cyclopentadienyl Pentadienyl Methylene cyclopropene 2-Allylmethyl Cyclobutadiene Butadiene Cyclopropenyl Allyl Ethylene Appendix 6 607 Two Centers Ethylene (alternant) C2H2 all q =1.0,p12 =1.0, Ep =2a +2ß n x c1 c2 2 -1.000 0.7071 0.7071 Three Centers Allyl radical (alternant) C3H5 all q =1.0,p12 =0.707, Ep =3a +2.8284ß n x c1 c2 c3 2 -1.4142 0.5000 0.7071 0.5000 1 0.0000 0.7071 0.0000 -0.7071 Cyclopropenyl radical (nonalternant) C3H3 all q =1.0,p12 =0.5, Ep =3a +3.0000ß n x c1 c2 c3 2 -2.0000 0.5774 0.5774 0.5774 1 2 1.0000 -0.8165 0.4082 0.4082 1 2 1.0000 0.0000 0.7071 -0.7071 608 H¨uckel Molecular Orbitals Four Centers Butadiene (alternant) C4H6 all q =1.0,p12 =0.8944,p23 =0.4472, Ep =4a +4.4721ß n x c1 c2 c3 c4 2 -1.6180 0.3718 0.6015 0.6015 0.3718 2 -0.6180 0.6015 0.3718 -0.3718 -0.6015 Cyclobutadiene (alternant) C4H4 all q =1.0,p12 =0.5,Ep =4a +4.000ß n x c1 c2 c3 c4 2 -2.0000 0.5000 0.5000 0.5000 0.5000 1 0.0000 0.5000 0.5000 -0.5000 -0.5000 1 0.0000 0.5000 -0.5000 -0.5000 0.5000 2-Allylmethyl (alternant) C4H6 all q =1.0,p12 =0.5774,Ep =4a +3.4641ß n x c1 c2 c3 c4 2 -1.7320 0.7071 0.4082 0.4082 0.4082 1 0.0 0.0000 0.7071 -0.7071 0.0000 1 0.0 0.0000 0.4082 0.4082 -0.8165 Methylene cyclopropene C4H4 (nonalternant) Ep =4a +4.9624ß,p12 =0.4527, p23 =0.8176,p14 =0.7583 n x c1 c2 c3 c4 2 -2.1701 0.6116 0.5227 0.5227 0.2818 2 -0.3111 0.2536 -0.3682 -0.3682 0.8152 0 1.0000 0.0000 0.7071 -0.7071 0.0000 0 1.4812 0.7494 -0.3020 -0.3020 -0.5059 qi = 0.8768 0.8176 0.8176 1.4881 Appendix 6 609 Five Centers Pentadienyl radical (alternant) C5H7 all q =1.0,p12 =0.7887,p23 =0.5774, Ep =5a +5.4641ß n x c1 c2 c3 c4 c5 2 -1.7320 0.2887 0.5000 0.5774 0.5000 0.2887 2 -1.0000 0.5000 0.5000 0.0000 -0.5000 -0.5000 1 0.0000 0.5774 0.0000 -0.5774 0.0000 0.5774 Cyclopentadienyl radical (nonalternant) C5H5 all q =1.0,p12 =0.5854,Ep =5a +5.8541ß n x c1 c2 c3 c4 c5 2 -2.0000 0.4472 0.4472 0.4472 0.4472 0.4472 3/2 -0.6180 0.6325 0.1954 -0.5117 -0.5117 0.1954 3/2 -0.6180 0.0000 -0.6015 -0.3718 0.3718 0.6015 0 1.6180 0.6325 -0.5117 0.1954 0.1954 -0.5117 0 1.6180 0.0000 0.3718 -0.6015 0.6015 -0.3718 Cyclobutadienylmethyl radical (alternant) C5H5 all q =1.0,p12 =0.3574,p23 =0.6101 p15 =0.8628,Ep =5a +5.5959ß n x c1 c2 c3 c4 c5 2 -2.1358 0.5573 0.4647 0.4351 0.4647 0.2610 2 -0.6622 -0.4351 0.1845 0.5573 0.1845 -0.6572 1 0.0000 0.0000 -0.7071 0.0000 0.7071 0.0000 610 H¨uckel Molecular Orbitals Six Centers Hexatriene (alternant) C6H8 all q =1.0,p12 =0.8711,p23 =0.4834, p34 =0.7848,Ep =6a +6.9879ß n x c1 c2 c3 c4 c5 c6 2 -1.8019 0.2319 0.4179 0.5211 0.5211 0.4179 0.2319 2 -1.2470 0.4179 0.5211 0.2319 -0.2319 -0.5211 -0.4179 2 -0.4450 0.5211 0.2319 -0.4179 -0.4179 0.2319 0.5211 Butadiene-2,3-bimethyl (alternant) C6H8 all q =1.0,p12 =0.6667,p23 =0.3333,Ep =6a +6ß n x c1 c2 c3 c4 c5 c6 2 -2.0000 0.2887 0.5774 0.5774 0.2887 0.2887 0.2887 2 -1.0000 0.4082 0.4082 -0.4082 -0.4082 0.4082 -0.4082 2 0.0000 -0.5000 0.0000 0.0000 -0.5000 0.5000 0.5000 Benzene (alternant) C6H6 all q =1.0,p12 =0.6667,Ep =6a +8ß n x c1 c2 c3 c4 c5 c6 2 -2.0000 0.4082 0.4082 0.4082 0.4082 0.4082 0.4082 2 -1.0000 0.0000 0.5000 0.5000 0.0000 -0.5000 -0.5000 2 -1.0000 0.5774 0.2887 -0.2887 -0.5774 -0.2887 0.2887 Fulvene (nonalternant) C6H6 Ep =6a +7.4659ß,p12 =0.7779,p23 =0.5202,p45 =0.4491, p56 =0.7586 n x c1 c2 c3 c4 c5 c6 2 -2.1149 0.4294 0.3851 0.3851 0.4294 0.5230 0.2473 2 -1.0000 0.0000 0.5000 0.5000 0.0000 -0.5000 -0.5000 2 -0.6180 0.6015 0.3718 -0.3718 -0.6015 0.0000 0.0000 0 0.2541 -0.3505 0.2795 0.2795 -0.3505 -0.1904 0.7495 0 1.6180 -0.3718 0.6015 -0.6015 0.3718 0.0000 0.0000 0 1.8608 -0.4390 0.1535 0.1535 -0.4390 0.6635 -0.3566 qi = 1.0923 1.0730 1.0730 1.0923 1.0470 0.6223 Appendix 6 611 Seven Centers Heptatrienyl radical (alternant) C7H9 all q =1.0,Ep =7a +8.0547ß p12 =0.8155,p23 =0.5449,p34 =0.6533 n x c1 c2 c3 c4 c5 c6 c7 2 -1.8478 0.1913 0.3536 0.4619 0.5000 0.4619 0.3536 0.1913 2 -1.4142 0.3536 0.5000 0.3536 0.0000 -0.3536 -0.5000 -0.3536 2 -0.7654 0.4619 0.3536 -0.1913 -0.5000 -0.1913 0.3536 0.4619 1 0.0000 -0.5000 0.0000 0.5000 0.0000 -0.5000 0.0000 0.5000 Benzyl radical (alternant) C7H7 all q =1.0,p12 =0.5226,p23 =0.7050, p34 =0.6350,p17 =0.6350,Ep =7a +8.7206ß n x c1 c2 c3 c4 c5 c6 c7 2 -2.1010 0.5000 0.4063 0.3536 0.3366 0.3536 0.4063 0.2380 2 -1.2593 -0.5000 -0.1163 0.3536 0.5615 0.3536 -0.1163 -0.3970 2 -1.0000 0.0000 0.5000 0.5000 0.0000 -0.5000 -0.5000 0.0000 1 0.0000 0.0000 -0.3780 0.0000 0.3780 0.0000 -0.3780 0.7560 Cycloheptatrienyl radical (nonalternant) C7H7 all q =1.0,p12 =0.6102,Ep =7a +8.5429ß n x c1 c2 c3 c4 c5 c6 c7 2 -2.0000 0.3780 0.3780 0.3780 0.3780 0.3780 0.3780 0.3780 2 -1.2470 -0.5345 -0.3333 0.1189 0.4816 0.4816 0.1189 -0.3333 2 -1.2470 0.0000 -0.4179 -0.5211 -0.2319 0.2319 0.5211 0.4179 1 2 0.4450 0.5345 -0.1189 -0.4816 0.3333 0.3333 -0.4816 -0.1189 1 2 0.4450 0.0000 0.5211 -0.2319 -0.4180 0.4180 0.2319 -0.5211 0 1.8019 -0.5345 0.4816 -0.3333 0.1189 0.1189 -0.3333 0.4816 0 1.8019 0.0000 0.2319 -0.4179 0.5211 -0.5211 0.4179 -0.2319 612 H¨uckel Molecular Orbitals Eight Centers Octatetraene (alternant) C8H10 all q =1.0,p12 =0.8621,p23 =0.4948, p34 =0.7581,p45 =0.5288,Ep =8a +9.5175ß n x c1 c2 c3 c4 c5 c6 c7 c8 2 -1.8794 0.1612 0.3030 0.4082 0.4642 0.4642 0.4082 0.3030 0.1612 2 -1.5321 0.3030 0.4642 0.4082 0.1612 -0.1612 -0.4082 -0.4642 -0.3030 2 -1.0000 -0.4082 -0.4082 0.0000 0.4082 0.4082 0.0000 -0.4082 -0.4082 2 -0.3473 0.4642 0.1612 -0.4082 -0.3030 0.3030 0.4082 -0.1612 -0.4642 Cyclooctatetraene (alternant) C8H8 all q =1.0,p12 =0.6035,Ep =8a +9.6568ß n x c1 c2 c3 c4 c5 c6 c7 c8 2 -2.0000 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 2 -1.4142 0.3536 0.0000 -0.3536 -0.5000 -0.3536 0.0000 0.3536 0.5000 2 -1.4142 0.3536 0.5000 0.3536 0.0000 -0.3536 -0.5000 -0.3536 0.0000 1 0.0000 0.3536 0.3536 -0.3536 -0.3536 0.3536 0.3536 -0.3536 -0.3536 1 0.0000 0.3536 -0.3536 -0.3536 0.3536 0.3536 -0.3536 -0.3536 0.3536 Nine Centers Benzcyclopentadienyl radical (nonalternant) C9H7 p12 =0.6592,p49 =0.6071,p56 =0.6630, p18 =0.4790,p45 =0.6363,p89 =0.5118, Ep =9a +11.8757ß n x c1 c2 c3 c4 c5 c6 c7 c8 c9 2 -2.3226 0.3203 0.2758 0.3203 0.2988 0.2259 0.2259 0.2988 0.4681 0.4681 2 -1.5450 -0.3114 -0.4031 -0.3114 0.2689 0.4934 0.4934 0.2689 -0.0780 -0.0780 2 -1.1935 0.2992 0.0000 -0.2992 -0.4841 -0.2207 0.2207 0.4841 0.3571 -0.3571 2 -0.7293 -0.2054 -0.5634 -0.2054 0.0935 -0.3454 -0.3454 0.0935 0.4136 0.4136 1 -0.2950 0.5428 0.0000 -0.5428 0.3355 0.2591 -0.2591 -0.3355 0.1601 -0.1601 0 0.9016 0.1548 -0.3434 0.1548 -0.5424 0.2852 0.2852 -0.5424 0.2038 0.2038 0 1.2950 -0.2591 0.0000 0.2591 -0.1601 0.5428 -0.5428 0.1601 0.3355 -0.3355 0 1.6952 0.4840 -0.5711 0.4840 0.1884 -0.0699 -0.0699 0.1884 -0.2495 -0.2495 0 2.1935 -0.2207 0.0000 0.2207 0.3571 -0.2992 0.2992 -0.3571 0.4841 -0.4841 qi = 0.9571 1.1119 0.9571 0.9218 0.9920 0.9920 0.9218 1.0730 1.0730 Appendix 6 613 Ten Centers Azulene (nonalternant) C10H8 p12 =0.6560, p4,10 =0.5858, p56 =0.6389, p19 =0.5956, p45 =0.6640, p9,10 =0.4009, Ep =10a +13.3635ß n x c1 c2 c3 c4 c5 c6 c7 c8 c9 c10 2 -2.3103 0.3233 0.2799 0.3233 0.2886 0.1998 0.1730 0.1998 0.2886 0.4670 0.4670 2 -1.6516 -0.2678 -0.3243 -0.2678 0.1909 0.4333 0.5247 0.4333 0.1909 -0.1180 -0.1180 2 -1.3557 0.2207 0.0000 -0.2207 -0.4841 -0.3571 0.0000 0.3571 0.4841 0.2992 -0.2992 2 -0.8870 -0.2585 -0.5829 -0.2585 0.2186 -0.1598 -0.3603 -0.1598 0.2186 0.3536 0.3536 2 -0.4773 0.5428 0.0000 -0.5428 0.1601 0.3355 0.0000 -0.3355 -0.1601 0.2591 -0.2591 0 0.4004 -0.0632 0.3158 -0.0632 0.4699 0.1023 -0.5109 0.1023 0.4699 -0.2904 -0.2904 0 0.7376 -0.2992 0.0000 0.2992 -0.3571 0.4841 0.0000 -0.4841 0.3571 0.2207 -0.2207 0 1.5792 0.4364 -0.5527 0.4364 -0.0844 0.2697 -0.3416 0.2697 -0.0844 -0.1365 -0.1365 0 1.8692 -0.2500 0.2675 -0.2500 -0.3233 0.4045 -0.4328 0.4045 -0.3233 0.1998 0.1998 0 2.0953 -0.2591 0.0000 0.2591 0.3355 -0.1601 0.0000 0.1601 -0.3355 0.5428 -0.5428 qi = 1.1729 1.0466 1.1729 0.8550 0.9864 0.8700 0.9864 0.8550 1.0274 1.0274 Naphthalene (alternant) C10H8 all q =1.0, Ep =10a +13.6832ß p12 =0.7246, p23 =0.6032, p19 =0.5547, p9,10 =0.5182 n x c1 c2 c3 c4 c5 c6 c7 c8 c9 c10 2 -2.3028 0.3006 0.2307 0.2307 0.3006 0.3006 0.2307 0.2307 0.3006 0.4614 0.4614 2 -1.6180 0.2629 0.4253 0.4253 0.2629 -0.2629 -0.4253 -0.4253 -0.2629 0.0000 0.0000 2 -1.3028 0.3996 0.1735 -0.1735 -0.3996 -0.3996 -0.1735 0.1735 0.3996 0.3470 -0.3470 2 -1.0000 0.0000 -0.4082 -0.4082 0.0000 0.0000 -0.4082 -0.4082 0.0000 0.4082 0.4082 2 -0.6180 0.4253 0.2629 -0.2629 -0.4253 0.4253 0.2629 -0.2629 -0.4253 0.0000 0.0000 Appendix 7 Derivation of the Hartree–Fock Equation This appendix is divided into two parts. In the first section we develop the formula for the expectation value E¯ =.|H|. for the case in which . is a single determinantal wavefunction over MOs. In the second section we derive the Hartree–Fock equation by requiring E¯ to be stationary with respect to variations in .. A7-1 The Expansion of E¯ in Terms of Integrals over MOs We limit discussion to the case in which . is a single, closed-shell determinant. We will develop our arguments by referring to a four-electron example: .4 =(4!)-1/2  f1(1) ¯ f1(2)f2(3) ¯ f2(4) (A7-1) Recall that this is the shorthand formula for a Slater determinant. Each f is a normalized MO,theMOsare assumed to be orthogonal, and a bar signifies that an electron possesses ß spin. As we develop our arguments within the context of ., we will generalize them to apply to the general 2n electron closed-shell wavefunction .2n =[(2n!)]-1/2|f1(1)f¯1(2)f2(3)f¯2(4) ···fn(2n-1)f¯n(2n)| (A7-2) When .4 is expanded according to the rule for determinants (Appendix 2), we obtain 4! products. We note the following features of the expanded form. 1. There is one product, occurring with coefficient+1, which is identical to the product appearing in the shorthand form of Eq. (A7-1). We refer to this as the “leading term.” 2. An equivalent way of expressing a Slater determinant is via the expression (for .4) .4 =(4!)-1/2 P (-1)pP f1(1)f¯1(2)f2(3)f¯2(4) (A7-3) Here P stands for all the sequences of permutations of electron labels that lead to different products (i.e., P is a permutation operator), and p is the number of pairwise permutations in a given sequence. For .4 there are 4! sequences P, the simplest being “no permutations” (hence, p =0) which produces the leading term. Then there are single permutations, such as P1,2 (with p =1), which produces the term -f1(2)f¯1(1)f2(3)f¯2(4). There are also double permutations, etc. According to Eq. (A7-3), any term differing from the leading term by an odd number of 614 Appendix 7 615 permutations will appear with coefficient -1. We will be particularly concerned with products that differ from the leading term by a single permutation. 3. A single permutation may be made to occur between electrons in MO’s with the same spins or different spins. In the latter case, two electrons in the singly permuted product will disagree in spin with their counterparts in the leading term. In the former case, no such spin disagreement will exist. 4. Terms also appear in .4 corresponding to more than a single permutation of electron indices. It is useful to pick a representative example of each type of product mentioned above. For .4, we have the following: Leading term: f1(1)f¯1(2)f2(3)f¯2(4) Singly permuted term: f1(1)f¯1(4)f2(3)f¯2(2); Spin agreement: (P2,4) Singly permuted term: f1(2)f¯1(1)f2(3)f¯2(4); Spin disagreement: (P1,2) Doubly permuted term: f1(2)f¯1(4)f2(3)f¯2(1); (P1,2P1,4) Upon expanding .4|Hˆ |.4, we obtain a set of 4! products on both the left and righthand sides of Hˆ : E¯ = (4!)-1  {f1*(1)f¯1*(2)f2*(3)f¯2*(4)-f1*(1)f¯1*(4)f2*(3)f¯2*(2)-···} ×Hˆ (1, 2, 3, 4){f1(1)f¯1(2)f2(3)f¯2(4)-f1(1)f¯1(4)f2(3)f¯2(2)-···}dt (A7-4) This can be expanded into a set of integrals, one for each term on the left: E¯ = (4!)-1  f1*(1)f¯1*(2)f2*(3)f¯2*(4)Hˆ (1, 2, 3, 4) f1(1)f¯1(2)f2(3)f¯2(4) - f1(1)f¯1(4)f2(3)f¯2(2)-f1(2)f¯1(1)f2(3)f¯2(4)-··· dt - f1*(1)f¯1*(4)f2*(3)f¯2*(2)Hˆ (1, 2, 3, 4) f1(1)f¯1(2)f2(3)f¯2(4) - f1(1)f¯1(4)f2(3)f¯2(2)-f1(2)f¯1(1)f2(3)f¯2(4)-··· dt (A7-5) etc. In Eq. (A7-5), E¯ is a sum of 4! integrals, each containing one term from the set on the left of Hˆ and all 4! from the set on the right. At first, it might seem that we must evaluate all of the 4! integrals in Eq. (A7-5). But it can be shown that these integrals, times their +1 or -1 coefficients, are all equal to each other, enabling us to write E¯ as 4! times the first integral: E¯ =  f1*(1)f¯1*(2)f2*(3)f¯2*(4)Hˆ (1, 2, 3, 4)  P (-1)pP(f1(1)f¯1(2)f2(3)f¯2(4)) dt (A7-6) 616 Derivation of the Hartree–Fock Equation The demonstration that the various integrals in Eq. (A7-5), times their coefficients, are equal to each other is as follows. Consider the second integral in Eq. (A7-5). Note that, if we permute electrons 2 and 4 in that integral, we restore the term on the left of Hˆ to its original “leading term” order, thereby making that product identical to its counterpart in the first integral. Furthermore, if we carry out this permutation throughout the whole of the integrand of integral number 2 (i.e., in Hˆ , in all 4! products to the right of Hˆ , and in dt ), we will not affect the value of the integral. (Recall that, for example,  1 0 x2dx  2 1 y3dy = 1 0 y2dy  2 1 x3dx The result of this permutation P2,4 on the second integral in Eq. (A7-5) is  f1*(1)f¯1*(2)f2*(3)f¯2*(4)Hˆ (1, 4, 3, 2) f1(1)f¯1(4)f2(3)f¯2(2) -f1(1)f¯1(2)f2(3)f¯2(4)-··· dt (A7-7) Now Hˆ is invariant under exchange of electron indices, and the set of products to the right of Hˆ in Eq. (A7-7) is the same set we had before, but their order is changed, and the whole product evidently differs by a factor of -1 from the set in the first integral. Therefore, we can say that the first and second integrals of Eq. (A7-5) have the same absolute value but different signs. However, the fact that these integrals contribute to E¯ with opposite signs cancels the sign disagreement. In this way, every integral in Eq. (A7-5) can be compared to the leading integral and Eq. (A7-6) verified. This much simplified expression for E¯ is, for the 2n-electron case E¯ = f1*(1)f¯1*(2) ···f*n(2n-1)f¯*n(2n)Hˆ (1, 2, . . . , 2n)  P (-1)pP(f1(1)f¯1(2) ···fn(2n-1)f¯n(2n)) dt (A7-8) At this point we write out Hˆ more explicitly. It is, in atomic units, Hˆ (1, 2, . . . , 2n) = 2n  i=1 - 1 2.2 i - nuclei  µ Zµ/rµi + 2n-1  i=1 2n  j=i+1 1/rij (A7-9) = 2n  i=1 Hcore (i) + 1/rij (A7-10) Here,  is a shorthand symbol for the double sum in Eq. (A7-9). We see that Hˆ is composed of one-electron operators, Hcore (i) , which deal with the kinetic and nuclearelectron attraction energies for electron i, and two-electron operators for interelectronic repulsion. The internuclear repulsion is omitted since, for a given nuclear configuration, it is simply a constant that can be added to the electronic energy. Note for future reference that Hˆ has no dependence on electron spin coordinates. Appendix 7 617 If we insert the expression (A7-10) for Hˆ into Eq. (A7-6) for E¯, it is easy to to see that we can expand the result into separate sets of integrals over one-and two-electron operators. We consider first the one-electron integrals. For .4, these are  f1*(1)f¯1*(2)f2*(3)f¯2*(4) × 4  i=1 Hcore (i) f1(1)f¯1(2)f2(3)f¯2(4)-f1(1)f¯1(4)f2(3)f2(2)-··· dt (A7-11) Upon expanding this, the first set of integrals we obtain is  f1*(1)f¯1*(2)f2*(3)f¯2*(4) Hcore (1) +Hcore (2) +Hcore (3) +Hcore (4) f1(1)f¯1(2)f2(3)f¯2(4)dt (A7-12) This can be expanded again. The first integral contains the operator Hcore (1) , which does not act on electrons 2-4. This allows a separation into a product of integrals as follows:  f1*(1)f¯1*(2)f2*(3)f¯2*(4)Hcore (1) f1(1)f¯1(2)f2(3)f¯2(4) dt (A7-13) = f*1 (1)Hcore (1) f1(1)dt1  f¯1*(2)f¯1(2) dt2  f*2 (3)f2(3)dt3  f¯2*(4)f¯2(4) dt4 (A7-14) The last three integrals are overlap integrals and are all unity by virtue of normality of the MOs. The first integral, a “core integral,” is normally symbolized H11. Here the subscripts refer to the MO index, not the electron index: Hii = f*i (1)Hcore (1) fi(1)dt1 (A7-15) Therefore, Eq. (A7-13) equalsH11. By continued expansion, Eq. (A7-12) can be shown to be equal to H11 + H11 + H22 + H22 = 2(H11 + H22). In this case, we have been dealing with identical MO products on the two sides of the operator. As we continue evaluating the expansion of Eq. (A7-12), we next encounter an integral in which the products differ by a permutation, namely, - f1*(1)f¯1*(2)f2*(3)f¯2*(4)Hcore (1) +Hcore (2) +Hcore (3) +Hcore (4) f1(1)f¯1(4)f2(3)f¯2(2) dt (A7-16) Again, for Hcore(1), this may be written - f*1 (1)Hcore (1) f1(1)dt1  f¯1*(2)f¯2(2) dt2  f*2 (3)f2(3)dt3  f¯2*(4)f¯1(4) dt4 (A7-17) Orbital orthogonality will cause the second and fourth integrals to vanish. In general, if the two products differ by one or more permutations, they will have two or more sites of disagreement. Upon expansion, at least one disagreement will occur in an overlap 618 Derivation of the Hartree–Fock Equation integral, causing the integral to vanish. Thus, except for the integral involving identical products [Eq. (A7-12)], all the integrals obtained by expansion of Eq. (A7-11) vanish. Our result, generalized to the 2n-electron case, is (seeAppendix 11 for bra-ket notation)  .2n  2n  i=1 Hcore (1)  .2n  = n  i=1 2Hii (A7-18) We now turn to integrals containing two-electron operators. Consider, for example, an integral that has products differing in two places, -f1(1)f¯1(2)f2(3)f¯2(4) |1/r13|f1(3)f¯1(2)f2(1)f¯2(4) (A7-19) This can be partly separated into a product of integrals over different electron coordinates. (A7-19)=-f1(1)f2(3)|1/r13|f1(3)f2(1)f¯1(2)|f¯1(2)f¯2(4)|f¯2(4) (A7-20) Observe that the two disagreements are inside the two-electron integral, and that the overlap terms both show complete internal agreement and are therefore equal to unity. It is clear that, if our two products differed in more than two places, at least one such disagreement would appear in an overlap integral, causing the whole integral to vanish. Therefore, two-electron integrals need be considered only if they involve products differing by zero or one permutations. Let us consider these two possibilities separately. If there are no disagreements, we have for .4, f1(1) ¯ f1(2)f2(3) ¯ f2(4)  1/rtj  f1(1)f¯1(2)f2(3)f¯2(4) =f1(1)f¯1(2)|1/r12|f1(1)f¯1(2)+f1(1)f2(3)|1/r13|f1(1)f2(3) +f1(1)f¯2(4)|1/r14|f1(1)f¯2(4)+f¯1(2)f2(3)|1/r23|f¯1(2)f2(3) +f¯1(2)f¯2(4)|1/r24|f¯1(2)f¯2(4)+f2(3)f¯2(4)|1/r34|f2(3)f¯2(4) (A7-21) These integrals give the coulombic repulsion between electrons in MOs. They are symbolized Jij , where Jij =fi(1)fj (2) |1/r12|fi(1)fj.(2)=ij |ij  (A7-22) Here fi and fj may be associated with either spin. Because the operator and MOs commute, the integrand can be rearranged to give Jij = f*i (1)fi(1)(1/r12)f*j (2)fj (2)dt1dt2 =(ii|jj ) (A7-23) Some people prefer this form because it places the two mutually repelling charge clouds on the two sides of the operator. It is important to realize that the parenthetical expression (ii|jj ) and the bra-ket shorthand ij |ij  correspond to different conventions for electron index order, and are really the same integral. Returning to Eq. (A7-21), we see that it is equal to J11 +J12 +J12 +J12 +J12 +J22 = 2  i=1 . . Jii + j =i 2Jij . . (A7-24) Appendix 7 619 We must now consider the case where the products differ by a single permutation, hence in two places. Anexample has been provided in Eq. (A7-19). We noted that, when the operator 1/rij corresponds to electrons i and j in the positions of disagreement, the overlap integrals are all unity. Otherwise, at least one overlap integral vanishes. An integral like that in Eq. (A7-20) is called an exchange integral. Exchange integrals can occur only when the product on the right of the operator differs from the leading term by a single permutation. Hence, exchange integrals always enter with a coefficient of -1. We noted earlier that two classes of singly permuted products exist. One class involves permutations between electrons of like spin. In such a case, fi and fj appear throughout the integral Kij with spin agreement. For cases where electrons of different spin have been permuted, spin disagreement forces the exchange integral to vanish. (Since 1/rij is not a spin operator, the integration over spin coordinates factors out and produces a vanishing integral if spins disagree.) The result of all this is that each singly permuted product can give -Kij if the permutation is between electrons of like spin in MOs fi and fj., and zero otherwise. For .4, the acceptable permutations can be seen to be electron 1 with 3 and electron 2 with 4, both of these occurring between f1 and f2 space MOs. Hence, the contribution to E is -2K12. Combining this with J terms gives .4  1/rij  .4=J11 +4J12 -2K12 +J22 (A7-25) From the definitions of J and K, it is apparent that Jij =Jji, Kij =Kji, Kii =Jii This allows us to rewrite Eq. (A7-25) as 2J11 -K11 +2J12 -K12 +2J21 -K21 +2J22 -K22 = 2  i=1 2  j=1 (2Jij -Kij ) (A7-26) Generalizing to the 2n-electron, closed-shell case and adding in our one-electron contribution, E¯ =.2n|Hˆ |.2n=2 n  i=1 Hii + n  i=1 n  j=1 (2Jij -Kij ) (A7-27) This is the desired expression for E¯ in terms of integrals over MOs fi for a singledeterminantal, closed-shell wavefunction. A7-2 Derivation of the Hartree–Fock Equations To find the “best”MOs, we seek those that minimize E¯, that is, thoseMOs f for which E¯ is stationary to small variations df. But there is a restriction in the variations df. The MOs can only be varied in ways that do not destroy their orthonormality since this property was assumed in deriving Eq. (A7-27). This means that, for proper variations 620 Derivation of the Hartree–Fock Equation df at the minimum E¯, both E¯ and all the MO overlap integrals Sij = fi |fj  must remain constant. (Sij must equal unity when i =j , zero otherwise.) If E¯ and Sij are constant, any linear combination of them is constant too. Thus, we may write that, at the minimum E¯, c0E¯ + i  j cijSij =constant (A7-28) for our restricted type of df. This equation will hold for any set of coefficients c as long as df is of the proper restricted nature. However, it is possible to show that, for a particular set of coefficients, Eq. (A7-28) is satisfied at minimum E¯ for any small variations df. The particular coefficients are called Lagrangian multipliers. They are of undetermined value thus far, but their values will become known in the course of solving the problem. The technique, known as “Lagrange’s method of undetermined multipliers” is from the calculus of variations.1 The Lagrangian multipliers will ultimately turn out to be essentially the MO energies. For future convenience we write Eq. (A7-28) in the form E¯ -2 i  j ijSij =constant for df (A7-29) where we now understand df to be unrestricted and ij to be some unknown special set of constants. The stability of the quantity on the left-hand side of Eq. (A7-29) may be expressed as follows: dE -2d i  j ijSij =0 (A7-30) or, expanding E¯, 2 n  i=1 dHii + n  i=1 n  j=1 (2dJij -dKij )-2 n  i=1 n  j=1 ij dSij =0 (A7-31) The variations occur in the MOs f, and so dSij =  df*i (1)fj (1)dt1 + f*i (1) dfj (1)dt1 (A7-32) dHii =  df*i (1)Hcore (1) fi(1)dt1 + f*i (1)Hcore (1) dfi(1)dt1 (A7-33) dJij =  df*i (1)f*j (2)(1/r12)fi(1)fj (2)dt1dt2 +  f*i (1)df*j (2)(1/r12)fi(1)fj (2)dt1dt2 +complex conjugates (A7-34) It is convenient to define a coulomb operator Jˆi(1) as Jˆi(1)= f*i (2)(1/r12)fi(2)dt2 (A7-35) 1For an introduction to this topic, see Margenau and Murphy [1]. Appendix 7 621 Using this definition we can rewrite Eq. (A7-34) as dJij = dfi*(1)Jˆj (1)fi(1) dt1 + df*j (1)Jˆi(1)fj (1) dt1 +complex conjugates (A7-36) In the same spirit, we define an exchange operator Kˆi , which, because it involves an orbital exchange, must be written in the context of an orbital being operated on: Kˆi(1)fj (1)= f*i (2)(1/r12)fj (2)dt2fi(1) (A7-37) This enables us to write dKij dKij =  df*i (1)Kˆj (1)fi(1) dt1 + df*j (1)Kˆi(1)fj (1) dt1 + complex conjugates Employing the operators Jˆ and Kˆ , Eq. (A7-31) can be written as follows: 2 i  df*i (1)Hcore (1) fi(1)+ j (2Jˆj (1)-Kˆj (1))fi(1)- j ijfj (1)dt1 +2 i  dfi(1)Hcore* (1) f*i (1)+ j (2Jˆ* j (1)-Kˆ *j (1))f*i (1) - j *ijf*j (1)dt1 =0 (A7-38) Here we have made use of the hermitian properties of Hcore, Jˆ, and Kˆ , and also the relation ji  dfj (1)f*i (1)dt1=ij  dfi(1)f*j (1)dt1, which is merely an index interchange. Since the variations df*i and dfi are independent, each half of Eq. (A7-38) must independently equal zero. Hence, we can select either half for further development. We will select the first half. This equation indicates that the sum of integrals equals zero. Either the integrals are all individually equal to zero or else they are finite but cancel. However the latter possibility is ruled out because the variations df*i are arbitrary. By appropriately picking df*i , we could always spoil cancellation if the various integrals for different i were nonzero. But the equation states that the sum vanishes for every df*i . Therefore, we are forced to conclude that each integral vanishes. Continuing in the same spirit, we can conclude that the term in brackets in the integrand is zero. For the integral to vanish requires the integrand either to be identically zero or else to have equal positive and negative parts. If the latter were true for some choice of df*i , it would be possible to change df*i so as to unbalance the cancellation and produce a nonzero integral. Since the integral is zero for all df*i , it must be that the bracketed term vanishes identically. Thus, Hcore (1) + j (2Jˆj (1)-Kˆj (1))fi(1)= j ijfj (1) (A7-39) for all i =1 to n and for a certain set of constants ij . 622 Derivation of the Hartree–Fock Equation The original development of SCF equations was performed by Hartree for simple product wavefunctions. Fock later extended the approach to apply to antisymmetrized wavefunctions. For this reason, the collection of operators in brackets in Eq. (A7-39) is called the Fock operator, symbolized Fˆ, and Eq. (A7-39) becomes Fˆ(1)fi(1)= j ijfj (1) (A7-40) Equation A7-40 is a differential equation for each MO fi . But as it stands it is not an eigenvalue equation because, instead of regenerating fi , we obtain a sum of functions fj times the various unknown constants ij . However, there remains a degree of freedom in the problem that can be used to throw Eq. (A7-40) into eigenvalue form. It is pointed out in Appendix 2 that the value of a determinant is unchanged if any row or column, multiplied by a constant, is added to any other row or column. This means that a Slater determinant of “best” MOs is unaffected by such internal rearrangements. In other words, if we were to solve Eq. (A7-40) for a set of “best” MOs, fb i , we could form various new orthonormal MOs, (e.g., fb i +.ikfb k,fb k -.kifb i ) by mixing them together, and our wavefunction ., and all values of observables predicted from ., including E¯, would be precisely the same. A transformation that mixes the MOs f without affecting the property of orthonormality is called a unitary transformation (see Chapter 9). Letting U stand for such a transformation, we have that a transformed set of f’s, called f, is given by fi = j Ujifj, i=1, . . . ,n (A7-41) In matrix notation, this is ˜  =˜ U (A7-42) where ˜  and ˜ are row vectors, viz. ˜  =(12 ···n), (A7-43) and U is an n×n matrix, with UU† =U†U=1 (A7-44) In terms of these matrices, Eq. (A7-40) is Fˆ˜ =˜ E (A7-45) where E is an n×n matrix. If we multiply this from the right by U, we obtain Fˆ˜ U=˜ EU (A7-46) Inserting 1 (in the form UU†) between ˜ and E gives Fˆ˜ U=˜ UU†EU (A7-47) or Fˆ˜  =˜ U†EU (A7-48) Appendix 7 623 We can now require that the matrix U be such that U†EU is a diagonal matrix E. (This requires that E be a hermitian matrix, which can be shown to be the case.)2 This requirement defines U, and we have Fˆ˜  =˜ E (A7-49) which corresponds to Fˆfi =ifi, i=1, 2, . . . ,n (A7-50) This equation has the desired eigenvalue form, and is commonly referred to as the Hartree–Fock equation. It is discussed at length in Chapter 11. It is important to bear in mind that our transformation U is for mathematical convenience and has no physical effect. We may imagine that our original basis set spans a certain function space, and that solution of Eq. (A7-50) produces a set of occupied MOs fi that span a “best” subspace. Transformations by unitary matrices produce new sets of MOs f but these still span the same subspace as f. However, they are generally not eigenfunctions of Fˆ, and satisfy the less convenient Eq. (A7-40). Nonetheless, there are occasions when it is useful to use some set of MOs other than f, and we can always do this without having to worry about introducing physical changes as long as our converted MO’s are related to f by a unitary transformation. The Hartree–Fock equation is ordinarily used in quantum chemistry in connection with a basis set of AOs, and it is possible to carry through a derivation of the Hartree– Fock equation for this type of basis. Detailed treatments of this derivation may be found in the paper by Roothaan [2] and in the book by Pople and Beveridge [3]. References [1] H. Margenau and G. M. Murphy, The Mathematics of Physics and Chemistry. Van Nostrand-Reinhold, Princeton, New Jersey, 1956. [2] C. C. J. Roothaan, Rev. Mod. Phys. 23, 69 (1951). [3] J. A. Pople and D. L. Beveridge, Approximate Molecular Orbital Theory. McGraw-Hill, NewYork, 1970. 2See Roothaan [2]. Appendix 8 The Virial Theorem for Atoms and Diatomic Molecules A8-1 Atoms In Chapter 3 itwas shownthat, for the ground state of the quantum-mechanical harmonic oscillator, the average value of the kinetic energy is equal to the average value of the potential energy. We now consider how the average electronic kinetic and potential energies are related in an atom. We begin by deriving a rather general expression, and then we discuss how it applies to different levels of calculation. As our first step, we examine the effects of coordinate scaling on average values. In order to follow this discussion, it is useful to recall that one can manipulate variables and limits in an integral as follows:  b a f (x)dx = b a f (y)dy = .x=b .x=a f (.x)d(.x)=.  x=b/. x=a/. f (.x)dx (A8-1) Let .(r1, r2, . . . , rn) be a normalized function of the space coordinates of n electrons. Let T¯ = .|Tˆ|. (A8-2) V¯ = .|Vˆ |. (A8-3) where Tˆ and Vˆ are, respectively, the kinetic and potential energy operators for some system, and are independent of spin. We introduce a scale factor . into .. This factor affects the lengths of the vectors ri but not their directions. That is, .. =.(.r1,.r2, . . . ,.rn) (A8-4) lf .>1,.n is more contracted in 3n-dimensional space than .. For .<1,.n is more diffuse. We must check to see if our scaled function .n is normalized. We know that 1 =  .*(r1, . . . , rn).(r1, . . . , rn)dv =  .*(.r1, . . . ,.rn).(.r1, . . . ,.rn)d(.v) (A8-5) because we have simply relabeled all variables r by .r, including the volume element, just as in Eq. A8-1. We now factor . out of the volume element and divide the limits of 624 Appendix 8 625 integration by ., just as in Eq. A8-1. However, the limits are zero and infinity, so they are unaffected. The volume element d(.v) is given by d(.v)=(.r1)2 sin .1d(.r1)d.1df1(.r2)2 sin .2d(.r2)d.2df2 ··· (A8-6) and so .3 appears for each electron. Thus, we are led to 1=.3n  .*... dv (A8-7) Therefore, our normalization constant for .. is .3n/2, and our normalized, scaled function is .. =.3n/2.(.r1,.r2, . . . ,.rn) (A8-8) [Compare this with the specific example encountered in Eq. (7-7).] We now inquire as to the values of V¯. and T¯., where T¯. = ..|Tˆ|.. (A8-9) V¯. = ..|Vˆ |.. (A8-10) For an n-electron atom, Tˆ = - 1 2 n  i=1 .2 i (A8-11) Vˆ = - n  i=1 (Z/ri)+ n-1  i=1 n  j=i+1 1/rij (A8-12) Therefore V¯. =.3n  .*(.r1, . . . ) . . n  i=1 -Z ri + n-1  i=1 n  j=i+1 1 rij . . .(.r1, . . . ) dv (A8-13) We couldmake the integral equal to V¯ ifwe could get the scale factor into all the r terms in the operator and also into dv. The volume element dv requires .3n, which is already present in Eq. A8-13 from the normalization constants. To get . into the operator, we need to multiply the operator by .-1. Multiplying Eq. A8-13 by ..-1 gives, then V¯. =.  .*(.r1, . . . ) . . n  t=1 -Z .ri + n-1  t=1 n  j=i+1 1 .rij . . .(.r1, . . . ) d(.v) (A8-14) or V¯. =.V¯ (A8-15) The same approach to T¯. gives T¯. =.2 T¯ (A8-16) 626 The Virial Theorem for Atoms and Diatomic Molecules This arises from the fact that .2 = 1 r2 . .r r2 . .r + 1 r2 sin . . .. sin . . .. + 1 r2 sin2 . .2 .f2 (A8-17) and scaling the r terms here requires multiplying by .-2. Hence, the integral is multiplied by .2.-2 in the final step. The general result is that, for any quantum-mechanical system where Vˆ =f (r-v) (A8-18) scaling results in T¯. =.2 T¯, V¯. =.v V¯ (A8-19) For atoms, Vˆ contains r as r-1. For all systems, Tˆ involves .2, which contains r to the net power of -2. We can now write the expression for the total energy of the atom as given by the scaled function E¯. =T¯. +V¯. =.2 T¯ +.V¯ (A8-20) Now we can seek the best value of the scale factor. We do this by minimizing E¯. with respect to variations in .: .E¯./..=2.T¯ +V¯ =0 (A8-21) (T¯ and V¯ are independent of ..) We are now in a position to make some statements about the average values of Tˆ and Vˆ for certain wavefunctions. Let us consider first the exact values of T¯ and V¯ . We know that, if . were an exact eigenfunction, no further energy lowering would result from rescaling. That is, . equals unity in Eq. A8-21. As a result, 2T¯ +V¯ =0 (A8-22) or V¯ =-2T¯ (A8-23) or, since T¯ +V¯ =E¯, E =-T¯ = 1 2 ¯ V (A8-24) Thus, for an atom, we know that the exact nonrelativistic energy is equal to minus the exact average kinetic energy and is equal to one half the exact potential energy. Knowing that the exact energy of the ground-state neon atom is -128.925 a.u. enables us to say that T¯ =+128.925 a.u. and V¯ =-257.850 a.u. without actually knowing .. Moreover, the same relation holds for any stable state of an atom. This same argument holds, not only for exact solutions, but for any trial function that has already been energy-optimized with respect to a scale factor, for then a new scaling parameter . gives no improvement, . = 1, and all is as above. Thus, any nonlinear variation scheme consistent with uniform scaling should ultimately lead to the relations Appendix 8 627 [Eqs. A8-22–A8-24]. Satisfying these relations is frequently referred to as satisfying the virial relation. Completely optimized single-. and double-. functions satisfy the virial relation. It follows that Hartree–Fock atomic wavefunctions must satisfy the virial relation. Such solutions are, by definition, the best (lowest energy) attainable in a single determinantal form. “Best” includes all conceivable variation, linear or nonlinear, so all improvements achievable by scale factor variation are already present at the Hartree– Fock level, and .=1. In the event that E¯ can be lowered by scaling, it is possible to evaluate the optimum. from Eq. A8-21, which gives .=-V¯ /2T¯ (A8-25) One of the useful applications of the virial theorem is as an indicator of closeness of approach to the Hartree–Fock solution for an atom. If the calculation involves nonlinear variation (uniformly applied to all r coordinates), then the resulting wavefunction will satisfy the virial relations no matter how deficient it is as an approximation to the true eigenfunction. However, if the calculation involves only linear variation, as for example, when a linear combination of gaussian functions is used to approximate an AO, then there is no guarantee that the virial relation will be satisfied. If the basis set is extensive enough, however, the Hartree–Fock limit will be approached, and V¯ /T¯ will approach -2. Strictly speaking, a linear variation calculation on an atom that gives V¯ /T¯ =-2 is simply one that cannot be improved by uniform scaling. Therefore, approach to-2 is not a guarantee of approach to the Hartree-Fock limit. It is a necessary but not a sufficient condition. A8-2 Diatomic Molecules The treatment here is very similar to that for atoms. We make the Born–Oppenheimer approximation by assuming that . depends parametrically on the internuclear separation R: . =.(r1, r2, . . . , rn,R) (A8-26) When we scale ri , we scale R as well: .. =.(.r1,.r2, . . . ,.rn,.R) (A8-27) Henceforth, we let .R = .. Performing the same variable manipulations as in Section A8-1, we find T¯. = T¯(., .)=.2 T¯(1, .) (A8-28) V¯. = V¯ (., .)=.V¯ (1, .) (A8-29) Here, Vˆ may or may not include the internuclear repulsion term. This gives, for the total energy, E¯. =.2 T¯(1, .)+.V¯ (1, .) (A8-30) 628 The Virial Theorem for Atoms and Diatomic Molecules Upon taking the derivative with respect to ., we obtain .E¯n .. =2.T¯(1, .)+V¯ (1, .)+.2 .T¯(1, .) .. +. .V¯ (1, .) .. =0 (A8-31) This differs fromthe atomic case in that V¯ (1, .) and T¯(1, .) depend on . through ..But . .. = . .. .. .. = . .. R (A8-32) and so Eq. A8-31 becomes .E¯. .. =0=2.T¯(1, .)+.V¯ (1, .)+.2R .T¯(1, .) .. +.R .V¯ (1, .) .. (A8-33) If we assume that . is the exact eigenfunction, then .=1, and 2T¯ +V¯ +R .E¯ .R =0 (A8-34) Indeed, this relation holds for any case in which all improvement in the nature of a scale factor variation has been made, such as, for example, the Hartree-Fock limit. Note that, if V¯ contains internuclear repulsion, E¯ is the total energy. If not, E¯ is the electronic energy. A8-2.1 Problems A8-1. Use the methods outlined in this appendix to show that V¯ =T¯ for any stationary state of the quantum mechanical harmonic oscillator. A8-2. Evaluate V¯ and T¯ with . = 1/vp exp(-r) for the Li2+ ion. From these, establish the optimum scale factor . and write down the expression for the normalized optimized .. and the optimized energy E.. Compare these results with the eigenfunction for Li2+. Appendix 9 Bra-ket Notation Bra-ket, or Dirac, notation is frequently used in the literature because of its economical form. Perhaps the best way to learn how this notation is used is by studying its use in a few familiar relations and proofs. Accordingly, we have outlined a few of these uses. The applications and subtleties of this notation go considerably beyond the treatment summarized here.1 “Usual” notation Dirac notation  f*m fndt = fm bra | fn ket =m|n (A9-1)  f*m Afndt = fm|A|fn=m|A|n=Amn (A9-2)  f*m fndt* = f*nfmdt fm|fn* =fn|fm or (A9-3) m|n* =n|m For hermitian operator A:  f*m Afndt =  fn(Afm)*dt m|A|n=n|A|m* =  f*nAfmdT * (A9-4) Any function. can be written as a sum of a complete set of orthonormal functions f: . =m cmfm |. =m cm|fm=m cm|m  f*n.dt =m cm  f*nfmdt =cn n|. =m cmn|m=cn cn = f*n.dt cn =n|. . =m cmfm =m  f*m .dtfm |. =mm|.|m =m |mm|. 1Strictly speaking, for instance, fm|fn and m|n are not identical in meaning. The former refers to specific functions, fm and fn, which represent state vectors in a specific representation. The latter refers to the state vectors in any representation and hence is a more general expression. Distinctions such as this will not be necessary at the level of this text. 629 630 Bra-ket Notation Example of Use: Proof that eigenvalues of A (hermitian) are real. A|m = am|m (A9-5) m|A|m = amm|m    =0,=8 , (A9-6) m|A|m* = a*m m|m (A9-7) Combining Eqs. A9-4, A9-6, and A9-7, we have (am -a*m )=0 Appendix 10 Values of Some Useful Constants and Conversion Factors 631 Values of Some Useful Constants a Value Quantity Symbol and/or formulac Atomic units SI units Other units Uncertainty (ppm) a) Fundamental constants Planck’s constant h 2p 6.626176×10-34 J sec 4.135701×10-15 eV sec 5.4 Planck’s constant h/2p ¯h 1 1.0545887×10-34 J sec 6.582173×10-16 eV sec 5.4 Rest mass of electron me 1 9.109534×10-31 kg 9.109534×10-28 gm 5.1 Charge of electron -e -1 -1.602189×10-19 C -4.803242×10-10 esu 2.9 Rest mass of proton mp 1.83615×103 1.6726485×10-27 kg 1.6726485×10-24 gm 5.1 Rest mass of neutron mn 1.83868×103 1.6749543×10-27 kg 1.6749543×10-24 gm 5.1 Speed of light in vacuum c 137.039 2.99792458×108m sec-1 2.99792458×1010 cm sec-1 0.004 Avogadro’s numberb NA — 6.0220943×1023 mol-1 — 1.05 Bohr magneton ße 1 2 9.274078×10-24 JT-1 Electron g value ge — 2.00232 b) Derived quantities Bohr radius a0 1 5.2917706×10-11m 0.52917706Å 0.82 Vacuum permittivity 4p0 1 1.11265×10-10 J-1 C2 m-1 Twice the ionization potential of the hydrogen atom with infinite nuclear mass Ea = e2/4p0a0 1 4.359814×10-18 J 27.21161 eV; 2 rydbergs 6.6 Electric field strength one bohr radius from proton e/4p0a2 0 1 5.1423×1011V m-1 1.715270×1010 esu cm-2 4.4 632 Polarizability (of a molecule) a =e2a0 2/Ea 1 1.648776×10-41 C2 m2 J-1 1.481846×10-25 esu2 cm2 erg-1 1.4 Bohr magneton µB =e ¯h/2me 1 2 9.274078×10-24 J T-1 5.788378×10-9 eV G-1 3.9 Nuclear magneton µN =e ¯h/2mp 2.723087×10-4 5.050824×10-27 J T-1 3.1524515×10-12 eV G-1 3.9 Electric dipole moment of electron–proton separated by one Bohr radius ea0 1 8.478418×10-30C m 2.541765×10-20 esu m =2.541765 debyes 3.8 Time for 1s electron in hydrogen atom to travel one bohr radius t = ¯h/Ea 1 2.41888×10-17 sec — 12 Atomic unit of velocity a0/t 1 2.18769×106m sec-1 2.18767×108 cm sec-1 13 Atomic unit of volume a0 3 1 1.481846×10-31m3 0.14818 Å3 2.0 Atomic unit of probability density a0-3 1 6.748340×1030 m-3 6.748340 Å-3 3.0 aFrom Cohen and Taylor [1]. C=coulomb, J=joule, V=volt, T=tesla, G=gauss, Å=angstrom. bSee Ref. [2]. cFormula appropriate for atomic units. 633 Energy Conversion Factorsa eV jouleb kcal/mole Hz m-1 .K a.u. eV 1 1.6021892×10-19 23.060362 2.4179696×1014 8.065479×105 1.160450×104 3.674901×10-2 jouleb 6.2414601×1018 1 1.4393033×1020 1.5091661×1033 5.034037×1024 7.242902×1022 2.293675×1017 kcal/mole 4.336445×10-2 6.947805×10-21 1 1.0485393×1013 3.497551×104 5.032223×102 1.593601×10-3 Hz 4.1357012×10-15 6.6261759×10-34 9.5370770×10-14 1 3.3356412×10-9 4.799274×10-11 1.519829×10-16 m-1 1.239852×10-6 1.986477×10-25 2.859144×10-5 2.9979243×108 1 1.438786×10-2 4.556333×10-8 .K 8.617347×10-5 1.380662×10-23 1.987191×10-3 2.083648×1010 6.950303×101 1 3.166790×10-6 a.u. 27.21161 4.359816×10-18 6.275098×102 6.579686×1015 2.194747×107 3.157772×105 1 aTo convert from units in the left hand column to units in the top row, multiply by the factor in the row-column position, e.g., 1 kcal/mole= 1.0485393×1013 Hz. b1 joule=107 erg. 634 Appendix 10 635 References [1] E. R. Cohen and B. N. Taylor, J. Phys. Chem. Ref. Data 2, No. 4, 663 (1973). [2] A. L. Robinson, Science, 185, 1037 (1974). See also P. J. Mohr and B. N. Taylor, Rev. Mod. Phys. 72 (2000) for revised values. Appendix 11 Group Theoretical Charts and Tables A11-1 Flow Scheme for Group Symbols Is the molecule linear? No No Is there a reflection plane perpendicular to the axis? T, T h, T d, O, or O h (Examine character tables to choose among these) Is there a reflection plane? Is there a point of conversion? C 1 C I C s I or I h C v D h Are there one or more proper rotation axes? Does the molecule possess six C5 axes? Does the molecule possess four C3 axes? Yes No No No No No Yes Yes Yes Yes Yes Yes 636 Appendix 11 637 Selection of reference axis Cn. (Defines vertical direction.) Is there an axis is having higher order than any other proper axis? No No No Yes Yes Yes Yes Yes Yes Yes Yes No No No No No Of the several axes having highest order, is there one which is geometrically unique? Choose any of the equivalent axes as Cn (Cn and n now defined) Select this as reference axis Cn Is there an S2n axis collinear with Cn? S2n Cn h Cn v Cn Dnh Dnd Dn Are there n twofold axes perpendicular to Cn? Are there any symmetry elements present other than Cn, S2n, or i ? Is there a reflection plane (s h) perpendicular to Cn ? Is there a set of n reflection planes (sv) containing Cn ? Is there a set of n reflection planes containing Cn and bisecting the n twofold axes? Is there a reflection plane (sh) perpendicular to Cn? Yes (D branch) No (C branch) A11-2 Meaning of Labels for Representations Symbol Interpretation Main Symbol A One-dimensional representation symmetric for rotation by 2p/n about principal axis. (For c1, cs, ci , which have no principal axis, this symbol merely means a one-dimensional representation.) B One-dimensional representation but antisymmetric for rotation by 2p/n about principal axis E Two-dimensional representation T (or F) Three-dimensional representation G Four-dimensional representation Subscripts 1 Symmetric for perpendicular C2 rotations (or else sv or sd reflections) 2 Antisymmetric for perpendicular C2 rotations (or else sv or sd reflections) 638 Group Theoretical Charts and Tables g Symmetric for inversion u Antisymmetric for inversion Superscripts  Symmetric for sh reflection  Antisymmetric for sh reflection A11-3 Character Tables A11-3.A Special High-Symmetry Groups: C8v,D8h,I,Ih,T,Th, Td,O,Oh C8v E 2C8  ··· 8sv A1 =+ 1 1 ··· 1 z x2 +y2, z2 A2 =- 1 1 ··· -1 Rz E1 = 2 2 cos ··· 0 (x, y);(Rx,Ry) (xz, yz) E2 = 2 2 cos 2 ··· 0 (x2 -y2, xy) E3 = 2 2 cos 3 ··· 0 ··· ··· ··· ··· ··· D8h E 2C8  ··· 8sv i 2S8  ··· 8C2 g+ 1 1 ··· 1 1 1 ··· 1 x2 +y2, z2 g- 1 1 ··· -1 1 1 ··· -1 Rz g 2 2 cos ··· 0 2 -2 cos ··· 0 (Rx,Ry) (xz, yz) g 2 2 cos 2 ··· 0 2 2 cos 2 ··· 0 (x2 -y2, xy) ··· ··· ··· ··· ··· ··· ··· ··· ··· u+ 1 1 ··· 1 -1 -1 ··· -1 z u- 1 1 ··· -1 -1 -1 ··· 1 u 2 2 cos ··· 0 -2 2 cos ··· 0 (x, y) u 2 2 cos 2 ··· 0 -2 -2 cos 2 ··· 0 ··· ··· ··· ··· ··· ··· ··· ··· ··· I E 12C5 12C5 2 20C3 15C2 A 1 1 1 1 1 x2 +y2 +z2 T1 3 1 2 (1+v5) 1 2 (1-v5) 0 -1 (x, y, z);(Rx,Ry,Rz) T2 3 1 2 (1-v5) 1 2 (1+v5) 0 -1 G 4 -1 -1 1 0 H 5 0 0 -1 1 (2z2 -x2 -y2, x2 -y2,xy, yz, zx) Ih E 12C5 12C5 2 20C3 15C2 i 12S10 12S10 3 20S6 15s Ag 1 1 1 1 1 1 1 1 1 1 x2 +y2 +z2 T1g 3 1 2 (1+v5) 1 2 (1-v5) 0 -1 3 1 2 (1-v5) 1 2 (1+v5) 0 -1 (Rx,Ry,Rz) T2g 3 1 2 (1-v5) 1 2 (1+v5) 0 -1 3 1 2 (1+v5) 1 2 (1-v5) 0 -1 Gg 4 -1 -1 1 0 4 -1 -1 1 0 Hg 5 0 0 -1 1 5 0 0 -1 1 (2z2 -x2 -y2, x2 -y2, xy, yz, zx) Au 1 1 1 1 1 -1 -1 -1 -1 -1 T1u 3 1 2 (1+v5) 1 2 (1-v5) 0 -1 -3 -1 2 (1-v5) -1 2 (1+v5) 0 1 (x, y, z) T2u 3 1 2 (1-v5) 1 2 (1+v5) 0 -1 -3 -1 2 (1+v5) -1 2 (1-v5) 0 1 Gu 4 -1 -1 1 0 -4 1 1 -1 0 Hu 5 0 0 -1 1 -5 0 0 1 -1 639 640 Group Theoretical Charts and Tables T E 4C3 4C3 2 3C2  =exp(2pi/3) A 1 1 1 1 x2 +y2 +z2 E 1  * 1 (2z2 -x2 -y2, x2 -y2 1 ) *  1 T 3 0 0 -1 (Rx,Ry,Rz);(x, y, z) (xy, xz, yz) Th E 4C3 4C3 2 3C2 i 4S6 4S6 5 3sh  =exp(2pi/3) Ag 1 1 1 1 1 1 1 1 x2 +y2 +z2 Au 1 1 1 1 -1 -1 -1 -1 Eg 1  * 1 1  * 1 (2z2 -x2 -y2,  1 *  1 1 *  1 x2 -y2) Eu 1  * 1 -1 - -* -1 1 *  1 -1 -* - -1 Tg 3 0 0 -1 1 0 0 -1 (Rx,Ry,Rz) (xz, yz,xy) Tu 3 0 0 -1 -1 0 0 1 (x, y, z) Td E 8C3 3C2 6S4 6sd A1 1 1 1 1 1 x2 +y2 +z2 A2 1 1 1 -1 -1 E 2 -1 2 0 0 (2z2 -x2 -y2, x2 -y2) T1 3 0 -1 1 -1 (Rx,Ry,Rz) T2 3 0 -1 -1 1 (x, y, z) (xy, xz, yz) O E 6C4 3C2(=C4 2) 8C3 6C2 A1 1 1 1 1 1 x2 +y2 +z2 A2 1 -1 1 1 -1 E 2 0 2 -1 0 (2z2 -x2 -y2, x2 -y2) T1 3 1 -1 0 -1 (Rx,Ry,Rz); (x, y, z) T2 3 -1 -1 0 1 (xy, xz, yz) Appendix 11 641 Oh E 8C3 6C2 6C4 3C2 (=C4 2) i 6S4 8S6 3sh 6sd A1g 1 1 1 1 1 1 1 1 1 1 x2 +y2 +z2 A2g 1 1 -1 -1 1 1 -1 1 1 -1 Eg 2 -1 0 0 2 2 0 -1 2 0 (2z2 -x2 -y2, x2 -y2) T1g 3 0 -1 1 -1 3 1 0 -1 -1 (Rx,Ry,Rz) T2g 3 0 1 -1 -1 3 -1 0 -1 1 (xz, yz,xy) A1u 1 1 1 1 1 -1 -1 -1 -1 -1 A2u 1 1 -1 -1 1 -1 1 -1 -1 1 Eu 2 -1 0 0 2 -2 0 1 -2 0 T1u 3 0 -1 1 -1 -3 -1 0 1 1 (x, y, z) T2u 3 0 1 -1 -1 -3 1 0 1 -1 A11-3.B Groups with No Axis of Symmetry: C1,Ci,Cs C1 E A 1 Cs E sh A 1 1 x, y,Rz x2, y2, z2, xy A 1 -1 z,Rx,Ry yz, xz Ci E i Ag 1 1 Rx,Ry,Rz x2, y2, z2, xy, xz, yz Au 1 -1 x, y, z A11-3.C The S2n Groups S4 E S4 C2 S4 3 A 1 1 1 1 Rz x2 +y2, z2 B 1 -1 1 -1 z x2 -y2, xy E 1 i -1 -i 1 -i -1 i (x, y);(Rx,Ry) (xz, yz) S6 E C3 C3 2 i S6 5 S6  =exp(2pi/3) Ag 1 1 1 1 1 1 Rz x2 +y2, z2 Eg 1  * 1  * 1 *  1 *  (Rx,Ry) (x2 -y2, xy); (xz, yz) Au 1 1 1 -1 -1 -1 z Eu 1  * -1 - -* 1 *  -1 -* - (x, y) 642 Group Theoretical Charts and Tables S8 E S8 C4 S8 3 C2 S8 5 C4 3 S8 7  =exp(2pi/8) A 1 1 1 1 1 1 1 1 Rz x2 +y2, z2 B 1 -1 1 -1 1 -1 1 -1 z E1 1  i -* -1 - -i * 1 * -i - -1 -* i  (x, y); (Rx,Ry) E2 1 i -1 -i 1 i -1 -i  (x2-y2 1 - , xy) i -1 i 1 -i -1 i E3 11 --* -ii * --11 * -ii --* (xz, yz) A11-3.D The Cn Groups C2 E C2 A 1 1 z,Rz x2, y2, z2, xy B 1 -1 x, y,Rx,Ry yz, xz C3 E C3 C3 2  =exp(2pi/3) A 1 1 1 z,Rz x2 +y2, z2 E 1  * 1 *  (x, y);(Rx,Ry) (x2-y2, xy);(yz, xz) C4 E C1 C2 C4 3 A 1 1 1 1 z,Rz x2 +y2, z2 B 1 -1 1 -1 x2 -y2, xy E 1 i -1 -i 1 -i -1 i (x, y);(Rx,Ry) (yz, xz) C5 E C5 C5 2 C5 3 C5 4  =exp(2pi/5) A 1 1 1 1 1 z,Rz x2 +y2, z2 E1 1  2 2* *  1 * 2* 2  (x, y);(Rx,Ry) (yz, xz) E2 1 2 *  2* 1 2*  * 2 (x2 -y2, xy) Appendix 11 643 C6 E C6 C3 C2 C3 2 C6 5  =exp(2pi/6) A 1 1 1 1 1 1 z,Rz x2 +y2, z2 B 1 -1 1 -1 1 -1 E1 1  -* -1 - *  1 * - -1 -*  (x, y) (Rx,Ry) (xz, yz) E2 1 -* - 1 -* -  1 - -* 1 - -* (x2 -y2, xy) C7 E C7 C7 2 C7 3 C7 4 C7 5 C7 6  =exp(2pi/7) A 1 1 1 1 1 1 1 z,Rz x2 +y2, z2 E1 1  2 3 3* 2* *  1 * 2* 3* 3 2  (x, y) (Rx,Ry) (xz, yz) E2 1 2 3* *  3 2* 1 2* 3  * 3* 2 (x2 -y2, xy) E3 1 3 * 2 2*  3* 1 3*  2* 2 * 3 C8 E C8 C4 C2 C4 3 C8 3 C8 5 C8 7  =exp(2pi/8) A 1 1 1 1 1 1 1 1 z,Rz x2 +y2, z2 B 1 -1 1 1 1 -1 -1 1 E1 1  i -1 -i -* - * 1 * -i -1 i - -*  (x, y);(Rx,Ry) (xz, yz) E2 1 i -1 1 -1 -i i -i  1 -i -1 1 -1 i -i i (x2 -y2, xy) E3 1 - i -1 -i *  -* 1 -* -i -1 i  * - A11-3.E The Cnv Groups C2v E C2 sv(xz) sv(yz) A1 1 1 1 1 z x2, y2, z2 A2 1 1 -1 -1 Rz xy B1 1 -1 1 -1 x,Ry xz B2 1 -1 -1 1 y,Rx yz C3v E 3sv 2C3 A1 1 1 1 z x2 +y2, z2 A2 1 -1 1 Rz E 2 0 -1 (x, y);(Rx,Ry) (x2 -y2, xy);(xz, yz) 644 Group Theoretical Charts and Tables C4v E 2C4 C2 2sv 2sd A1 1 1 1 1 1 z x2 +y2, z2 A2 1 1 1 -1 -1 Rz B1 1 -1 1 1 -1 x2 -y2 B2 1 -1 1 -1 1 xy E 2 0 -2 0 0 (x, y);(Rx,Ry) (xz, yz) C5v E 2C5 2C5 2 5sv A1 1 1 1 1 z x2 +y2, z2 A2 1 1 1 -1 Rz E1 2 2 cos 72. 2 cos 144. 0 (x, y);(Rx,Ry) (xz, yz) E2 2 2 cos 144. 2 cos 72. 0 (x2 -y2, xy) C6v E 2C6 2C3 C2 3sv 3sd A1 1 1 1 1 1 1 z x2 +y2, z2 A2 1 1 1 1 -1 -1 Rz B1 1 -1 1 -1 1 -1 B2 1 -1 1 -1 -1 1 E1 2 1 -1 -2 0 0 (x, y);(Rx,Ry) (xz, yz) E2 2 -1 -1 2 0 0 (x2 -y2, xy) A11-3.F The Cnh Groups C2h E C2 i sh Ag 1 1 1 1 Rz x2, y2, z2, xy Bg 1 -1 1 -1 Rx,Ry xz, yz Au 1 1 -1 -1 z Bu 1 -1 -1 1 x,y C3h E C3 C3 2 sh S3 S3 5  =exp(2pi/3) A 1 1 1 1 1 1 Rz x2 +y2, z2 E 1  * 1  * 1 *  1 *  (x, y) (x2 -y2, xy) A 1 1 1 -1 -1 -1 z E 1  * -1 - -* (Rx,Ry) (xz, yz) 1 *  -1 -* - Appendix 11 645 C4h E C4 C2 C4 3 i S4 3 sh S4 Ag 1 1 1 1 1 1 1 1 Rz x2 +y2, z2 Bg 1 -1 1 -1 1 -1 1 -1 x2 -y2, xy Eg 1 i -1 -i 1 i -1 -i 1 -i -1 i 1 -i -1 i (Rx,Ry) (xz, yz) Au 1 1 1 1 -1 -1 -1 -1 z Bu 1 -1 1 -1 -1 1 -1 1 Eu 1 i -1 -i -1 -i 1 i 1 - (x, y) i -1 i -1 i 1 -i C5h E C5 C5 2 C5 3 C5 4 sh S5 S5 7 S5 3 S5 9  =exp(2pi/5) A 1 1 1 1 1 1 1 1 1 1 Rz x2 +y2, z2 E1  1  2 2* * 1  2 2* *  1 * 2* 2  1 * 2* 2  (x, y) E2 1 2 *  2* 1 2 *  2* 1 2*  * 2 1 2*  * 2 (x2 -y2, xy) A 1 1 1 1 1 -1 -1 -1 -1 -1 z E1 1  2 2* * -1 - -2 -2* -*  1 * 2* 2  -1 -* -2* -2 - (Rx,Ry ) (xz, yz) E2 1 2 *  2* -1 -2 -* - -2* 1 2*  * 2 -1 -2* - -* -2 C6h E C6 C3 C2 C3 2 C6 5 i S3 5 S6 5 sh S6 S3  =exp(2pi/6) Ag 1 1 1 1 1 1 1 1 1 1 1 1 Rz x2 +y2, z2 Bg 1 -1 1 -1 1 -1 1 -1 1 -1 1 -1 E1g 1  -*-1 - * 1  -* -1 - * 1 *- -1 -*  1 * - -1 -*  (Rx,Ry ) (xz, yz) E2g 1 -*- 1 -* - 1 -* - 1 -* -  1 - -* 1 - -* 1 - -* 1 - -* (x2 -y2, xy) Au 1 1 1 1 1 1 -1 -1 -1 -1 -1 -1 z Bu 1 -1 1 -1 1 -1 -1 1 -1 1 -1 1 E1u 1  -*-1 - * -1 - * 1  -* 1 *- -1 -*  -1 -*  1 * - (x, y) E2u 1 -*- 1 -* - -1 *  -1 *   1 - -* 1 - -* -1  * -1  * 646 Group Theoretical Charts and Tables A11-3.G The Dn Groups D2 E C2(z) C2(y) C2(x) A 1 1 1 1 x2, y2, z2 B1 1 1 -1 -1 z,Rz xy B2 1 -1 1 -1 y,Ry xz B3 1 -1 -1 1 x,Rx yz D3 E 2C3 3C2 A1 1 1 1 x2 +y2, z2 A2 1 1 -1 z,Rz E 2 -1 0 (x, y);(Rx,Ry) (x2 -y2, xy);(xz, yz) D4 E 2C4 C2(=C4 2) 2C2 2C2 A1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 -1 -1 z,Rz B1 1 -1 1 1 -1 x2 -y2 B2 1 -1 1 -1 1 xy E 2 0 -2 0 0 (x, y);(Rx,Ry) (xz, yz) D5 E 2C5 2C5 2 5C2 A1 1 1 1 1 x2 +y2, z2 A2 1 1 1 -1 z,Rz E1 2 2 cos 72. 2 cos 144. 0 (x, y);(Rx,Ry) (xz, yz) E2 2 2 cos 144. 2 cos 72. 0 (x2 -y2, xy) D6 E 2C6 2C3 C2 3C2 3C2 A1 1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 1 -1 -1 z,Rz B1 1 -1 1 -1 1 -1 B2 1 -1 1 -1 -1 1 E1 2 1 -1 -2 0 0 (x, y);(Rx,Ry) (xz, yz) E2 2 -1 -1 2 0 0 (x2 -y2, xy) Appendix 11 647 A11-3.H The Dnd Groups D2d E 2S4 C2 2C2 2sd A1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 -1 -1 Rz B1 1 -1 1 1 -1 x2 -y2 B2 1 -1 1 -1 1 z xy E 2 0 -2 0 0 (x, y);(Rx,Ry) (xz, yz) D3d E 2C3 3C2 i 2S6 3sd A1g 1 1 1 1 1 1 x2 +y2, z2 A2g 1 1 -1 1 1 -1 Rz Eg 2 -1 0 2 -1 0 (Rx,Ry) (x2 -y2, xy), (xz, yz) A1u 1 1 1 -1 -1 -1 A2u 1 1 -1 -1 -1 1 z Eu 2 -1 0 -2 1 0 (x, y) D4d E 2S8 2C4 2S8 3 C2 4C2 4sd A1 1 1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 1 1 -1 -1 Rz B1 1 -1 1 -1 1 1 -1 B2 1 -1 1 -1 1 -1 1 z E1 2 v2 0 -v2 -2 0 0 (x, y) E2 2 0 -2 0 2 0 0 (x2 -y2, xy) E3 2 -v2 0 v2 -2 0 0 (Rx,Ry) (xz, yz) D5d E 2C5 2C5 2 5C2 i 2S3 10 2S10 5sd A1g 1 1 1 1 1 1 1 1 x2 +y2, z2 A2g 1 1 1 -1 1 1 1 -1 Rz E1g 2 2 cos 72. 2 cos 144. 0 2 2 cos 72. 2 cos 144. 0 (Rx,Ry ) (xz, yz) E2g 2 2 cos 144. 2 cos 72. 0 2 2 cos 144. 2 cos 72. 0 (x2 -y2, xy) A1u 1 1 1 1 -1 -1 -1 -1 A2u 1 1 1 -1 -1 -1 -1 1 z E1u 2 2 cos 72. 2 cos 144. 0 -2 -2 cos 72. -2 cos 144. 0 (x, y) E2u 2 2 cos 144. 2 cos 72. 0 -2 -2 cos 144.-2 cos 72. 0 648 Group Theoretical Charts and Tables D6d E 2S12 2C6 2S4 2C3 2S5 12 C2 6C2 6sd A1 1 1 1 1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 1 1 1 1 -1 -1 Rz B1 1 -1 1 -1 1 -1 1 1 -1 B2 1 -1 1 -1 1 -1 1 -1 1 z E1 2 v3 1 0 -1 -v3 -2 0 0 (x, y) E2 2 1 -1 -2 -1 1 2 0 0 (x2 -y2, xy) E3 2 0 -2 0 2 0 -2 0 0 E4 2 -1 -1 2 -1 -1 2 0 0 E5 2 -v3 1 0 -1 v3 -2 0 0 (Rx,Ry) (xz, yz) A11-3.I The Dnh Groups D2h E C2(z) C2(y) C2(x) i s(xy) s(xz) s(yz) Ag 1 1 1 1 1 1 1 1 x2, y2, z2 B1g 1 1 -1 -1 1 1 -1 -1 Rz xy B2g 1 -1 1 -1 1 -1 1 -1 Ry xz B3g 1 -1 -1 1 1 -1 -1 1 Rx yz Au 1 1 1 1 -1 -1 -1 -1 B1u 1 1 -1 -1 -1 -1 1 1 z B2u 1 -1 1 -1 -1 1 -1 1 y B3u 1 -1 -1 1 -1 1 1 -1 x D3h E 2C3 3C2 sh 2S3 3sv A1 1 1 1 1 1 1 x2 +y2, z2 A2 1 1 -1 1 1 -1 Rz E 2 -1 0 2 -1 0 (x, y) (x2 -y2, xy) A1 1 1 1 -1 -1 -1 A2 1 1 -1 -1 -1 1 z E 2 -1 0 -2 1 0 (Rx,Ry) (xz, yz) Appendix 11 649 D4h E 2C4 C2 2C2 2C2 i 2S4 sh 2sv 2sd A1g 1 1 1 1 1 1 1 1 1 1 x2 +y2, z2 A2g 1 1 1 -1 -1 1 1 1 -1 -1 Rz B1g 1 -1 1 1 -1 1 -1 1 1 -1 x2 -y2 B2g 1 -1 1 -1 1 1 -1 1 -1 1 xy Eg 2 0 -2 0 0 2 0 -2 0 0 (Rx,Ry) (xz, yz) A1u 1 1 1 1 1 -1 -1 -1 -1 -1 A2u 1 1 1 -1 -1 -1 -1 -1 1 1 z B1u 1 -1 1 1 -1 -1 1 -1 -1 1 B2u 1 -1 1 -1 1 -1 1 -1 1 -1 Eu 2 0 -2 0 0 -2 0 2 0 0 (x, y) D5h E 2C5 2C5 2 5C2 sh 2S5 2S5 3 5sv A1 1 1 1 1 1 1 1 1 x2 +y2, z2 A2 1 1 1 -1 1 1 1 -1 Rz E1 2 2 cos 72. 2 cos 144. 0 2 2 cos 72. 2 cos 144. 0 (x, y) E2 2 2 cos 144. 2 cos 72. 0 2 2 cos 144. 2 cos 72. 0 (x2 -y2, xy) A1 1 1 1 1 -1 -1 -1 -1 A2 1 1 1 -1 -1 -1 -1 1 z E1 2 2 cos 72. 2 cos 144. 0 -2 -2 cos 72. -2 cos 144. 0 (Rx,Ry ) (xz, yz) E2 2 2 cos 144. 2 cos 72. 0 -2 -2 cos 144. -2 cos 72. 0 D6h E 2C6 2C3 C2 3C2 3C2 i 2S3 2S6 sh 3sd 3sv A1g 1 1 1 1 1 1 1 1 1 1 1 1 x2 +y2, z2 A2g 1 1 1 1 -1 -1 1 1 1 1 -1 -1 Rz B1g 1 -1 1 -1 1 -1 1 -1 1 -1 1 -1 B2g 1 -1 1 -1 -1 1 1 -1 1 -1 -1 1 E1g 2 1 -1 -2 0 0 2 1 -1 -2 0 0 (Rx,Ry ) (xz, yz) E2g 2 -1 -1 2 0 0 2 -1 -1 2 0 0 (x2 -y2, xy) A1u 1 1 1 1 1 1 -1 -1 -1 -1 -1 -1 A2u 1 1 1 1 -1 -1 -1 -1 -1 -1 1 1 z B1u 1 -1 1 -1 1 -1 -1 1 -1 1 -1 1 B2u 1 -1 1 -1 -1 1 -1 1 -1 1 1 -1 E1u 2 1 -1 -2 0 0 -2 -1 1 2 0 0 (x, y) E2u 2 -1 -1 2 0 0 -2 1 1 -2 0 0 D8h E 2C8 2C8 3 2C4 C2 4C2 4C2 i 2S8 2S3 8 2S4 sh 4sd 4sv A1g 1 1 1 1 1 1 1 1 1 1 1 1 1 1 x2 +y2, z2 A2g 1 1 1 1 1 -1 -1 1 1 1 1 1 -1 -1 Rz B1g 1 -1 -1 1 1 1 -1 1 -1 -1 1 1 1 -1 B2g 1 -1 -1 1 1 -1 1 1 -1 -1 1 1 -1 1 E1g 2 v2 -v2 0 -2 0 0 2 v2 -v2 0 -2 0 0 (Rx,Ry) (xz, yz) E2g 2 0 0 -2 2 0 0 2 0 0 -2 2 0 0 (x2 -y2, xy) E3g 2 -v2 v2 0 -2 0 0 2 -v2 v2 0 -2 0 0 A1u 1 1 1 1 1 1 1 -1 -1 -1 -1 -1 -1 -1 A2u 1 1 1 1 1 -1 -1 -1 -1 -1 -1 -1 1 1 z B1u 1 -1 -1 1 1 1 -1 -1 1 1 -1 -1 -1 1 B2u 1 -1 -1 1 1 -1 1 -1 1 1 -1 -1 1 -1 E1u 2 v2 -v2 0 -2 0 0 -2 -v2 v2 0 2 0 0 (x, y) E2u 2 0 0 -2 2 0 0 -2 0 0 2 -2 0 0 E3u 2 -v2 v2 0 -2 0 0 -2 v2 -v2 0 2 0 0 650 Appendix 12 Hints for Solving Selected Problems Chapter 1 1-1. Use Eq. (1-25). 1-7. P.E.(t)=- .(x,t) 0 m.2(x, t)/.t2 d(x, t)= 1 2 m.22(x, t ). Next integrate P.E.(t) over one complete cycle (0-t ). Chapter 2 2-9. sin x sin y = 1 2 [cos(x -y)-cos(x +y)]. 2-10. What kind of function has .II=8? When could such a function join smoothly onto a sine function in region I? Chapter 3 3-1. Imagine an auto runs from A to B at 30 mph and from B to C at 60 mph. Sketch the distribution function for the auto. Then reason howyou arrived at this function and apply similar reasoning to the harmonic oscillator. 3-6. a) Seek points where H2(y) equals zero. b) Seek places where d.2/dy=0; then evaluate .2 at these points and compare. 3-15.  8 1 exp(-y2)dy ~0.10 exp(-1.052)+exp(-1.152)+exp(-1.252)+··· to convergence. 3-19. Consider howthe solutions for the harmonic oscillatorwould meet the conditions imposed by this new potential. 651 652 Hints for Solving Selected Problems Chapter 4 4-3. The classical turning point occurs when the total energy equals the potential energy. 4-11. E =V (r) when r equals classical turning point. 4-12. Ignore all but the . and f dependences in Eq. (4-30). Do not forget to square these dependences, and do not forget to include the 31/2 term of 3d2 z . 4-17. x =r sin . cosf. 4-18. a) Do not forget to include . dependence of dv. 4-26. Refer to Problem 2-11. 4-37. µ should be in units of kg/molecule, and you have masses in a.m.u. These can be taken directly as g/mole and then converted. Chapter 5 5-4. Square . and integrate, using the fact that 1s, 2s are orthonormal. Chapter 6 6-9. b) * must be same at t =0 and t =1/.. 6-11. Use the Schmidt orthogonalization method. 6-13. a) Means: is . an eigenfunction of the momentum operator? 6-18. Wavefunction is not normalized. 6-24. Use that exp(ikx)=cos(kx)+i sin(kx). Chapter 7 7-4. a) Note that, for symmetric .p ,  .fdx = 2  L/2 0 .nf dx. For antisymmetric .n, the integral can be evaluated by inspection. A useful integral is  x sinx dx =sin x -x cos x. c) Use the fact that E = nc2n En. The series can be estimated with a small calculator (tedious) or else by summing the first few terms and integrating over a function that envelopes the higher terms. 7-11. Use the fact that f = ici.i and E = ic*i ciEi . 7-13. b) Take limit as F .0 rather than simply evaluating at F = 0.1. Note that (1+x)m =1+mx +[m(m-1)/2!] x2+···. 7-16. Do not forget that overlap between fa and fb must enter normality condition: c2 a +c2 b +2cacbS =1 7-17.  f2(-1/ra)dv=-1+(. +1) exp(-2.) using the method of Appendix 3. Appendix 12 653 7-19. Note that .+ and .- are degenerate at R=8. 7-20. In the limit of R.0,HAA is indeterminate. Use l’Hospital’s rule (i.e., take d/dR on the numerator and denominator and evaluate at R=0). Alternatively, you can expand exp(-2R) in powers of R and evaluate at R =0. 7-27. Note that (f) and (g) have bothAOs on center a. When Hˆ is present in the integral, you are restricted to considering symmetry operations that do not affectHˆ . 7-34. Use the Schmidt orthogonalization procedure (Chapter 6) to construct 2s. Chapter 8 8-15. b) Simply note where HOMO is bonding, antibonding, nonbonding, and recognize that some of the effect of this MO will be lost upon ionization. Chapter 12 12-2. Recall that perturbation should be greatest for states with .2 largest in region of perturbation. 12-7. To evaluate the first-order correction to the energy, you can recognize that 1s| - 1/r|1s is identical to the potential energy of the H atom. The virial theorem tells you the value of this quantity at once. 12-16. Use the fact that Ep =a MOs i ni +2ß MOs i neighbors k1, there are more wiggles, but the value of the integral over sin2 still equals L/2. 2-4. 0.6090, 0.1955, 13 , 13 . 2-5. a) Use height times width of this narrow rectangle in .2. Height = (v2/L) sin(0.5pL/L) 2=(2/L) sin2(p/2)=2/L. Width=0.01L. Area= 0.02. This is 2% of the probability density in 1% of the box width. That is twice the classical probability, which is uniform in the box. b) (2/L) sin2(p/3) times 0.01L=0.015. 1.5 times classical. 2-6. a) 1/5. (There are five equal hills in .2 5 , and 0=x =L/5 contains one of them. b) Smaller. .2 1 (Fig. 2-5) is small in this region (less than 1/L in value) and integrates to less than 1/5. 2-7. a) S. b) -A. c) SS. d) AA. e) -AS. f) AASASSA. g) -AASASAA. Rule: Product antisymmetric when odd number of antisymmetric functions is present. 2-8. All are zero by symmetry except d), e), h). i) is integral of 2 sin2 x cos x which is SA. j) is integral of sin2 x cos x which is SS from -p to p, but SA in each half of the range (i.e., like i). If each half must give zero, so must the whole. 2-9. [See Hint.] (2/L)  L 0 sin(npx) sin(mpx)dx = (using Hint) (1/p)  p 0 {cos[(n- m)y]-cos[(n+m)y]}dy =0, for n and m integers and n =m. 2-10. [See Hint.] .II=8, and so .II is a constant. A constant has a zero derivative, and .1 must arrive at x =L with zero derivative if successful junction is to be made. This requires that an odd number of quarter-waves fit between 0 and L (so wave arrives with either a peak or a valley at x = L). This requires that [(2n + 1)/4].I = L; .I = 4L/(2n + 1) = h/v2mU; U = (2n + 1)2h2/32mL2 is the relation between U and L that is required for a state to exist at E = U. (Strictly speaking, one can only approach .=8as a limit, and this problem is physically meaningless. However, it makes a good exercise.) 2-11. Given that H.1 = E1.1,H.2 = E2.2,E1 = E2 = E and f = c1.1 + c2.2. Then Hf =c1H.1 +c2H.2 =c1E1.1 +c2E2.2 =E(c1.1 +c2.2)=Ef. Q.E.D. 2-12. a) . should oscillate on right with same . as on left. b) . should be symmetric or antisymmetric (and . should be same in each side). c) . should be smooth (i.e., have no cusp) at finite barrier. d) . should be a decaying exponential at right. e) Same as d). . should not become infinite. 2-13. a) .4 is two sine waves. b) E4 is the same as E2 in a half-box of width L:E4=22h2/8mL2=h2/8mL2. Or, one can also calculate it as the n=4 solution with width =2L. 656 Answers to Problems 2-14. For exponential eigenfunctions, .*. = exp(ijf)]*[exp(ijf) = exp(-ijf) exp(ijf)=exp(0)=1. This gives a “rectangle” of height 1 and width 2p, hence area 2p and normalizing constant 1/v2p. For sine or cosine functions, we get a squared function that oscillates from 0 to 1, gives us half the area, namely p, and yields normalizing constant 1/vp. 2-15. exp(iv2f) does not join onto itself when f.f +2p. 2-16. a)H =-(h2/8p2I)d2/df2. b)H.=(9h2/8p2I).. c)[(h/2pi)d/df]. = constant times .. Not a constant of motion. 2-17. The barrier forces solutions to vanish at f = 0. Of our four choices sin(kf), cos(kf), exp(±ikf) only the sin(kf) set has this property. Therefore a) E=0 is lost. b) All degenerate levels become nondegenerate. c) No, only sine solutions exist. d) No, sine solutions are not eigenfunctions of the angular momentum operator. 2-18. a) Enx,ny =(h2/8m)(n2x /L2x +n2y /L2y )]=(n2x +4n2y )h2/8mL2x . b) Zero point energy =E1,1 =5h2/8mL2x c) nx 1 2 3 1 4 2 3 5 4 ny 1 1 1 2 1 2 2 1 2 E/(h2/8mL2x ) 5 8 13 17 20 20   25 29 32 degeneracy 1 1 1 1 2 1 1 1 d) e) i) All eigenvalues increase by 10J . (ii) No effect on eigenfunctions. 2-19. Both 3h2/4mL2x . Accidental degeneracy results when nodes that are not equivalent by symmetry nevertheless give the same energy. 2-20. a) Yes. The two eigenfunctions being mixed are degenerate (52 + 12 + 12 = 32 +32 +32), so the linear combination remains an eigenfunction. (See Problem 2-11.) b) Instead of integrating, we can take the value of .2 V because V is small enough to make .2 essentially constant in it. .2 = (v2/L)6 sin2(p/2) sin2(p/2) sin2(p/2) = 8/L3; V = 0.001L3;.2 V = 0.008. The classical value (uniform distribution) is 0.001, so the quantum-mechanical probability for finding the particle at the center is 8 times greater than classical. (Not coincidentally, this is the cube of the answer for the one-dimensional analog in Problem 2-5.) Appendix 13 657 2-21. E =(6+n)2 -(5+n)2h2/8mL2 . = (8mcl2/h)(2n+10)2/(2n+11)=637(2n+10)2/(2n+11)Å n : 0 1 2 3 . : 5791 7056 8323 9592 2-22. Momentum is a constant of motion if [(h/2pi)d/dx]. = constant times .. This applies for b) and d), and the momentum equals 3h/2p and -3h/2p, respectively. Kinetic energy is equal to k2h2/8p2m, or 9h2/8p2m for all four cases. 2-23. The z component of angular momentum is a constant of motion if [(h/2pi)d/df]. = constant times .. This applies for b) and d), and the angular momentum equals -3h/2p and 3h/2p, respectively. Kinetic energy equals 9h2/8p2l in all four cases. 2-24. . and d./dx are real at x = a. (. = .*)x=a;Aexp(ika) + B exp(-ika) = A* exp(-ika)+B* exp(ika); so Aexp(ika)-A* exp(-ika)=B* exp(ika)- B exp(-ika). (d./dx)=(d./dx)*x=a; Aexp(ika) + A* exp(-ika) = B* exp(ika) + B exp(-ika); adding these two equations gives Aexp(ika)=B* exp(ika), so A = B*. Subtracting the two equations gives A* exp(-ika)=B exp(-ika), soA*=B. (Or we could take the complex conjugate of the previous equation.) Then, |A|2/|B|2 =A*A/B*B =A*A/A*A=1, so |A|=|B|. 2-25. Matching values at x =0 gives A+B =C. Matching slopes gives kA-kB = kC. Using the first of these to eliminate C from the second gives the first part of (2-73). Using the first to eliminate B from the second gives the second part of (2-73). 2-26. NowAis zero, B =C +D,-kB =kC -kD,C/D=(k-k)/(k+k),B/D= 2k/(k +k). 2-27. a) |C|2/|A|2 gives the relative spatial densities of transmitted to impinging particles, but we need relative fluxes, which depend on both particle densities and velocities. The relative velocities are the same as the relative momenta (since masses are the same), and these are in turn the same as the relative k values. So k/k gives the relative velocities of impinging and transmitting particles. For reflecting particles, the analogous ratio is k/k, so it does not appear. b) k|C|2/k|A|2 + |B|2/|A|2 = 4kk/(k + k)2 + (k - k)2/(k + k)2 = (k +k)2/(k +k)2 =1. 2-28. 100% transmission occurs when an integral number of de Broglie half-waves fit in x =d;n./2=d; also, .=h/p=h/v2T =h/v2m(E -U). Equating these expressions for . : 4d2/n2 = h2/ [2m(E -U)] ;(E/U)-1 = n2h2/(8d2mU). Substituting for U on the right: (E/U)-1 = n2p2/16. When n = 1,E/U = 1.617. When n=2,E/U =3.467. (Compare with Fig. 2-18b.) 2-29. E1 0.805cm-1, E2 36cm-1. 658 Answers to Problems MC: c d e a e b d b d a c d d b a Chapter 3 3-1. [See Hint.] P(x) is proportional to 1/v(x), which is [dx(t)/dt ]-1, which is -vk/mLsin(vk/mt)-1. This is proportional to sin2( k/mt)-1/2 =1-cos2( k/mt)-1/2 =1-x2/L2-1/2 The normalized probability distribution function is (pvL2 -x2)-1. 3-2. a) x(t)=0.100 cos(v2t), t in seconds, x in meters. b) -0.0453m. c) 1.00× 10-2 J. d) 2.00 × 10-3 J. (e, f) 5.00 × 10-3 J. g) 0.126ms-1. h) ±0.100m. i) 0.225 s-1 3-3. x=±(n+1/2)h/p)/vkm 1/2 . 3-4. a) (vß/p)1/(2nn!) 1/2 is the normalizing factor. It keeps the total probability density equal to one. Hn(y) is a Hermite polynomial. It provides the nodes in the wavefunction. Exp(-y2/2) forces . to decay at large values of x. It gives the correct asymptotic behavior. b) .0(y) = 4 ß/p exp(-y2/2) .1(y) = 4 ß/p(1/v2)2y exp(-y2/2) .2(y) = 4 ß/p(1/v8)(4y2 -2) exp(-y2/2) 3-5. H.0 = -h2/(8p2m) d2/dx2 +kx2/2 (ß/p)1/4 exp(-ßx2/2) = h2ß/(8p2m)-ß2h2x2/(8p2m)+kx2/2.0 = (h/4p) k/m-kx2/2+kx2/2.0 = (h/2)(1/2p) k/m).0 =(hv/2).0. 3-6. [See Hint.] a) y=±v1/2. b) y=±v5/2. 3-7. The first function is asymmetric; the second becomes infinite in both limits of y. 3-8. a) Approaches zero. Decaying exponential overwhelms polynomial. b) Antisymmetric. Polynomial is antisymmetric and exponential is symmetric. c) Value =0, slope =120. 3-9. a) c0 =7. b) c1 =0. 3-10. a) Not, because missing y3 term should cause the polynomial to terminate, not permit y5 term. b) Not, because of mixed symmetry. (Powers 5, 3, 1, 0.) c) OK. 3-11. 3-12. a) Zero (integrand antisymmetric). b) vp/2 (symmetric). Appendix 13 659 3-13. Zero (integrand antisymmetric). 3-14. a) xtp =±3h/(2pvmk) 1/2 =±v3/ß. b) .2 is maximum at y =±1, or x = ±1/vß = ±h/(2pvmk). c) 1/2. d) 0.0144 [from (2/e)vß/p times 0.02v3h/(2pvkm)1/2]. 3-15. [See Hint.] 2  8 a vß/p exp(-ßx2)dx = (2/vp) 8 1 exp(-y2)dy ~ 0.156, where a =1/vß. 3-16, 17. H2(y)=4y2 -2. 3-18. Zero point energy = 3 2h.; 10; 3. 3-19. [See Hint.] The barrier requires . = 0 at x = 0 but does not affect Hˆ at x > 0. Therefore, antisymmetric solutions of harmonic oscillator are still good. Results are .x>0 =v2.harm,osc,n=1, 3, 5, . . . , and E =(n+ 1 2 )h., n= 1, 3, 5, . . . .x=0 =0. 3-20. a) 40. b) -1. 3-21. V 5 =E5/2=(11/4)h. =T 5. 3-22. For each unit of energy going into vibration, half goes into kinetic energy, which registers as a rise in temperature, and half is “hidden” as potential energy. None is hidden as potential energy in rotation or translation. 3-23. Using nominal nuclear masses gives k values of 958, 512, 408, 311Nm-1, respectively. Decreasing bond stiffness implies decreasing bond strengths. (Observed D0 values are, in kJ mol-1, 564, 428, 363, 295, respectively.) MC: e e d, c c Chapter 4 4-1. E2 - E1 = 2.46737 × 1015 Hz using me. Using µ gives 0.999455 times this value, a difference of 545 ppm. 4-2. The radial distribution function vanishes at r =0 because 4pr2 is zero there and again at r =8 because the decaying exponential of .2 overwhelms the finite power term 4pr2. 4-3. [See Hint.] E1s=-1 2 a.u. V (r)=-1/r a.u. Equal when r =2 a.u. 8 2 .2dv= 0.238, and so 23.8%. 4-4. a) 1/Z a.u. b) 3/2Z a.u. c) 0 (by inspection of .2). d) -Z2 a.u. Differs from -Z times the reciprocal of b) because average of 1/r =1/(average of r). 660 Answers to Problems d) is lower because 1/r blows up at small r, contributing large (negative) amounts of potential energy. 4-5. (1/v2) 8 0 exp(-r)(2-r) exp(-r/2)r2dr =. . . 0. (Do not forget r2 from dv.) 4-6. p-1/2 is the normalizing factor. 4-7. ¯x = +8 -8 .2x dx. .2 is symmetric, x is antisymmetric, and ¯x =0. 4-8. a)  L 0 .n(x - L/2)2.n dx. b) Expect it to increase as n increases, approaching a limiting value. This because .2 favors the box center for lower states and approaches classical (uniform) distribution as n increases. c) L2(1/12 - 1/2n2p2), n=1 : 0.03267L2, n=2 : 0.070668L2. Approaching L2/12, which is the classical value achieved if .2 in a) is replaced by 1/L. 4-9. For reflection in the xy plane, 2pz is antisymmetric, and 3dxy is symmetric. 4-10. a) 4/Z a.u. b) 5/Z a.u. c) 2/Z a.u. d) -Z2/4 a.u. 4-11. [See Hint.] rtp =2n2/Z. 4-12. [See Hint.] The sum of squares of angular dependencies equals 4 3 . Use trigonometric identities to remove angle dependence. 4-13. a) -1/2 a.u. b) -2 a.u., -1/2 a.u. c) 25 d) 2, 1 e) -2 a.u. 4-14. a) l =m=0. (No angular dependence: must be s.) b) 2 (quadratic in r). c) -1/18 a.u. (2 nodes means it is 3s, so n=3.) d) 18 a.u. 4-15. a) Yes, at r =6 a.u. b) 3py(r sin . sin f =y, so it is py, and there is one radial node). 4-16. a) Looks OK. b) No. Blows up at large r. c) No. Lacks exponential decay function in r. 4-17. [See Hint.]  1s2r sin . cosf dv = 0 (because cosf is antisymmetric in each subrange 0–p, p–2p). The average value of x should be zero because the electron is equally likely to be found at equal ±x positions due to the spherical symmetry of .2. 4-18. [See Hint.] a) .mp = 35.15, 144.45. b) Angular nodes come where (3 cos2 . -1)=0. . =54.74., 125.26.. 4-19. xy =(r sin . cosf)(r sin . sin f)=r2 sin2 . cosf sin f =(r2 sin2 . sin 2f)/2. 4-20. Equation (4-45) predicts 2/3 and 0 for these integrals. Actual integration over x2 and over 5x4/2-3x2/2 gives 2/3 and 0. Integral over all space for 3p03d-1 involves integral from 0 to p of P0 1 (cos .)P-2 2 (cos .) sin. d., which is same as integral from 1 to -1 of P0 1 (x)P -1 2 (x) dx, so the integral vanishes. 4-21. 15/(2v10p (1-cos2 .) cos . exp(-2if), f-2. 4-22. LˆzYl,m(.,f) = (h¯/i)(d/df) l,m(.) exp(imf) = mh¯ l.m(.) exp(imf) = mh¯ Yl,m(.,f). Appendix 13 661 4-23. ˆ Lx2p0 = ˆ LxR(r) cos . =-iR(r) sin . sin f =constant ×2p0 ˆ Ly2p0 = iR(r) sin . cosf =constant ×2p0 Lˆx1s = 0=Lˆy1s=Lˆz1s=Lˆ2ls ···=0×ls Since the vector has zero length, its x, y, z components must also have zero length. The question of vector orientation becomes meaningless. 4-24. [-1/(sin .)(d/d.) sin .d/d.]R(r) cos . =2R(r) cos ., l(l +1)=2, l =1. 4-25. Same for 6f 4-26. [See Hint.] Will be eigenfunctions in cases b) and e). Only these correspond to mixing degenerate eigenfunctions. 4-27. a) v6¯h or v6 a.u. b) d-2. 4-28. a) 0. b) -3¯h or -3 a.u. 4-29. a) 0. b) 2¯h2 or 2 a.u. c) 0 (same as if all x’s were z’s). d) 0 (by symmetry). e) 0. 4-30. a) 6.3,2,1. b) 2.2px. c) (-1/18).3px. d) -.2p-1 . 4-31. a)Yes, -1/18. e)Yes, 0 (same as Lˆz.3pz ). f)Yes, 2. All others, No. 4-32. 1.44×10-3 a.u. 4-33. 2.80×1010 Hz, 11.4 ppm. 4-34. a) -1/32. b) 16. c) 0, v2,v6,v12. d) 7. e) 4. 4-35. µr2 =?=m1r2 1 +m2r2 2 . Chug, chug, Bingo. 4-36. Using Eq. (4-56), we write Eq. (4-68) as (Lˆ2/h¯2). =(-2IE/h¯2).; (Lˆ2/2I )= E.; since there is no potential term, E is all kinetic, so this completes the demonstration. 4-37. [See Hint.] First find µ in a.m.u., then divide by 6.0221 × 1026 to convert to kg/molecule. µ=1.6529×10-27 kg. r =vI/µ=1.4144×10-10 m. 4-38. 10.976cm-1, 21.953cm-1, 32.929cm-1. 4-39. 1.129×10-10m(µ=1.1385×10-26 kg, I =1.4504×10-46 kg m2). 4-40. J =3, so mJ =3, 2, 1, 0,-1,-2,-3. The states split into seven equally spaced levels. 4-41. v3/2, or 0.866 a.u. 662 Answers to Problems MC: e b b e d e b c c Chapter 5 5-1. -1 2 (.2 1 +.2 2 +.2 3 )-3/r1 -3/r2 -3/r3 +1/r12 +1/r23 +1/r13. 5-2. ¯r1s =3/2Z =(for He+) 3 4 a.u., ¯r2s =6/Z =(for He+) 3 a.u. 5-3. E = 2.343 × 105 eV compared with an IE of 13.6 eV. It shows that . small enough to locate the electron with useful precision involves photons with energy sufficient to excite the electron completely out of the system. 5-4. [See Hint.]  .2dv = 1 2  [1s(1)22s(2)2 + 2(1s(1)2s(1)2s(2)1s(2)) + 2s(1)21s(2)2]dv(1)dv(2)= 1 2 (1 · 1+2 · 0 · 0+1 · 1)=1. 5-5. .a(2.1)=(1/v2)[1s(2)2s(1)-2s(2)1s(1)]=-.a. 5-6. Upon substitution and expansion, complete cancellation occurs. 5-7. (1/v6)[1s2p1s(aßß -ßßa)+1s1s2p(ßaß -aßß)+2p1s1s(ßßa -ßaß)]. 5-8.  1s*2sdv  1s*1sdv  2s*1sdv  a*ad. ß*ad. a*ßd. = 0 · 1 · 0 · 1 · 0 · 0=0. 5-9. For r1 =1, r2 =2, r3 =0 get .(1, 2, 0) = (1/v6)[1¯s(r =1)2pz(r =2)1s(r =0)+2pz(r =1)1s(r =2)1¯s(r =0) -2pz(r = 1)1¯s(r =2)1s(r =0)-1s(r =1)2pz(r =2)1¯s(r =0)] The other cases are the same except for factor of -1. Thus, .2 is identical for all three cases, and no physical distinction exists. 5-10. Replace each a in the lowest row with ß. 5-11. a) aa, ßß, .., aß +ßa, a. +.a, ß. +.ß b) aß -ßa, a. -.a, ß. -.ß 5-12. a) Yes. Antisymmetric for exchange of any two electrons. b) -1/2(.2 1 +.2 2 +.2 3 )-3/r1 -3/r2 -3/r3 +1/r1,2 +1/r2,3 +1/r1,3. c) No. It is a product of one-electron orbitals, hence an independent-electron solution, but H is not separable. d) (-9/2)(1+1/4+1/9)=-6.125 a.u. e) -3/2 a.u. 5-13. Equation (5-41) with U1 =1s, U2 =1¯s, U3 =2s, U4 =2¯s. 5-14. F, 1s22s22p5, Z =9, .1s =8.7, .2s =.2p =2.6. 5-15. LetAˆf=af and  f*fdt =1. (Aˆ)av = f*Aˆfdt = f*af dt =a f*f dt =a. 5-16. Li2+ is a hydrogenlike ion, and hence should have all states of same n degenerate. Li differs in that potential seen by electron is not of form -Z/r, due to screening of nucleus by other electrons. Hence, degeneracy is lost. The 2s AO of Li is lower than the 2p due to the fact that the 2s electron spends a larger fraction of time near nucleus where it experiences full nuclear charge. 5-17. 1 2 1 2 +1=3 4 a.u. Appendix 13 663 5-18. n electrons of a spin give one state. Each time one a is changed to a ß we get a different state. There are na spins available to change. 5-19. Symmetric combination gives e2 +8e either way. Antisymmetric gives e2 -8e one way and -(e2 -8e) the other. 5-20. a) 2s1/2( 2 states). b) 2p3/2,2p3/2,2s1/2( 8 states). 5-21. a) Not satisfactory since electron 1 is identified with the 1s AO, etc. [1s(1)3d2(2)-3d2(1)1s(2)]a(1)a(2). b) 3D3. 5-22. 120. 5-23. a) 2P3/2, 2p1/2. b) 2. c) 4, 2. 5-24. a) a(1)ß(2) - ß(1)a(2). b) -22.5 a.u. c) -22.5 a.u. + J1s,2P1 + K1s,2P1 . d) Eigenvalue for S2 is 0. 5-25. All in a.u.: a) 6. b) 2. c) 12. d) 2, 1, 0,-1,-2. e) 1, 0,-1. f) 3, 2, 1, 0,-1, -2,-3. 5-26. a) In both cases, maximum net z components are: spin =1, orbital=1. b) 3P0 below 3P1 below 3P2 for p2 case, reverse for p4. c) Each pairing reduces multiplicity to m+1, where m is number of unpaired electrons, which equals the number of holes. Each pair’s contribution to ML due to their orbital’s ml value is either totally canceled by a pair at -ml or else half-canceled by an unpaired electron at -ml . Therefore, the uncanceled portions reflect the presence of unpaired electrons, hence holes. Therefore, the amount of uncanceled ml is equal to minus that which we could assign to the holes. 5-27. a) 2S1/2. b) 4S3/2. c) 1S0. d) 3F2. 5-28. a) 12. b) 3P2, 3P1, 3P0, 1P1. 5-29. a) 2P1/2 below 2P3/2. b) 4S3/2 below 2D3/2, 2D5/2 below 2P1/2, 2P3/2. (Rules do not allow us to sort by J in this case since shell is exactly half filled.) 5-30. Striving for maximum multiplicity means avoiding pairing electrons. 5-31. a) 15. b) 35. 5-32. No. Impossible for doublet and singlet combinations to arise from same number of electrons. Each pairing reduces number of unpaired electrons by two, so allowed multiplicities are all even or all odd. 5-33. a) 20. b) 12. c) 6. d) 60. e) 100. 5-34. a) 28. b) 4F9/2, 4F7/2, 4F5/2, 4F3/2. 5-35. 3D3. Also 3D2, 3D1. 5-36. 2g2 -g, 15, 45. 5-37. Splittings equal 9.274×10-24 J times g, with g equal to 4/3, 7/6, 1/2, 1, respectively. 664 Answers to Problems MC: d e e d Chapter 6 6-1.  8 -8 .*(d2/dx2)fdv ?=  8 -8 f(d2/dx2).*dv. Use 8 -8 vdu = uv|8-8 -  8 -8 udv. On the left, let v=.*, u=df/dx, du=d2f/dx2, dv=d.*/dx. On the right, let v =f, u=d.*/dx, etc. The uv term vanishes since .* and f each vanish at limits. The remaining integrals are identical. 6-2. Each equals -4v8/27. 6-3.  .*.dv =1,  .*i .jdv =di,j , . =ici.i . Then  .*.dv =1= ic*i .*i j cj.jdv =ij c*i cj dij =ic*i ci . Q.E.D. 6-4. . =iciµi ,  µ*i µjdv =di.j , want ck:  µ*k .dv = µ*k iciµidv =ici  µ*k µidv =icidk,i =ck 6-5. a) (1/p)  2p 0 cos 2f(¯h/i)(d/df) cos 2fdf = -(2/¯hpi)  2p 0 cos 2f sin 2fdf . 2p 0 sym · antisym = 0 b) . =(1/v2) (1/v2p)exp(2if)+(1/v2) (1/v2p)exp(-2if) Terms in [] are normalized eigenfunctions of Lz with eigenvalues of +2¯h and -2¯h. So (Lz)av =(1/v2)2(2¯h)+(1/v2)2(-2¯h)=0. 6-6. [x,px] = [x(¯h/i)(d/dx) - ( ¯h/i)(d/dx)x]f (x)=( ¯h/i) xf  -f -xf =-( ¯h/i)f x · px = 1 2   .*(-¯h/i).dt =|-¯h/2i|= ¯h/2 6-7. f must be identical to the eigenfunction .0. 6-8. No. The existence of some real eigenvalues does not guarantee that the operator satisfies the definition of hermiticity: (d/dr) exp(-ar)=-a exp(-ar) but  8 0 exp(-ar)(d/dr) exp(-br)r2dr =  8 0 exp(-br)(d/dr) exp(-ar)r2dr if a = b 6-9. a) Each side equals -(1/2v2).1s exp(it/2)-(1/8v2).2p0 exp(it/8). b) [See Hint.] (*)t=0 = (1/2)(1s2 + 21s2p0 + 2p20 );(*)t=1/. = (1/2)(1s2 + 2 cos(3/8.)1s2p0 + 2p20 ). Equal when 3/8. =2p;. =3/16p. E = (1/2)(1 - 1/4) = 3/8 a.u.; E = h. = 2ph./2p = 2p. ¯h = 2p. in a.u. . = E/2p =(3/8)/2p =3/16p. Appendix 13 665 6-10. (1-S2)-1/2. 6-11. [See Hint.] S =v3/2, and so f =(2/v3p)(r - 3 2 ) exp(-r). 6-12. a) 8v8/27=0.838. b) 0 (by symmetry). 6-13. [See Hint.] a) ˆ px.0 =-(ß ¯hx/i).0 =constant times .0. Momentum is hence not a constant of motion. b)  .0*pˆx.0dx=-(ßh¯/i)  .0*x.0dx= (sym)(anti)(sym)=0 (Must be zero since otherwise motion in one direction would involve greater momentum than motion in the other.) 6-14. First pair: No. x2(d2/dx2) + x3(d3/dx3) = 2x(d/dx) + 4x2(d2/dx2) + x3(d3/dx3). Second pair: Yes. Both arrangements give 2x2(d2/dx2) + x3(d3/dx3). 6-15. a) 6 a.u. (since l =2). b) 0 (since real form of . involves equal mix of ±ml ). 6-16. a) Because these operators commute, yes, there must be a set of simultaneous eigenfunctions. b) Mixing degenerate-energy cases gives functions that remain eigenfunctions for the energy operator but not for the momentum operator. (These are the sine and cosine versions.) c) Exactly knowable because knowledge of momentum gives knowledge of an eigenfunction which in turn gives knowledge of energy. 6-17. E =(1/3)(-1/2-1/8-1/18)=-0.2268 a.u. 6-18. [See Hint]. Normalized . = 0.26726[1s + 2(2p1) + 3(3d2)]. Lˆz = 0.267262[0+4(1)+9(2)]=1.571 a.u. 6-19. Proofs are in Sections a) 6-8, b) 6-9, c) 6-11. 6-20. ||2 =(1/2) |.1|2 +|.2|2 +2|.1.2|cos[(E2 -E1)t/¯h] . 6-21. Cycle time =mL2/h=mL2/2p a.u. 6-22. cn =(2v2/L)  L/2 0 sin(2px/L) sin(npx/L)dx. a) v2.2 and  are same function in range 0=x =L/2, guaranteeing c2 =v2. No other c can be larger or the resulting function will give a total probability density greater than 1. b) .4 and  have opposite symmetry for 0=x =L/2. c) As .i becomes more oscillatory, the positive and negative portions of its product with  will cancel more effectively. 6-23. a) c1 =0.838, c2 =0.2048. b) Whereas only s-type AOs appear in Eq. (6-41), explicit account of changing potentialwould yield a nonspherically symmetric potential so that p-typeAOs would enter too. 6-24. [See Hint.] ck is small if there is effective cancellation between positive and negative portions of the product of exp(-ax2) and cos(kx). Larger k makes cos(kx) more oscillatory and makes cancellation more effective. The broader exp(-ax2) is, the more effective is this cancellation for a given value of k. 666 Answers to Problems 6-25. a) Real V means realHˆ . Hˆ(x, y, z, t)=-(h¯/i)(./.t)(x, y, z, t). Complex conjugate of equation gives Hˆ*(x, y, z, t) = (h¯/i)(./.t)*(x, y, z, t). Transform t .-t throughout (does not affect equality), then recognize ./.(-t) = -./.t , so Hˆ*(x, y, z,-t) = -(h¯/i)(./.t)*(x, y, z,-t). Q.E.D. b) (x, y, z, t) becomes .(x, y, z) exp(-iEt/¯h). *(x, y, z,-t) becomes .*(x, y, z) exp[iE(-t)/¯h] = .*(x, y, z) exp(-iEt/¯h). Carrying these through Eqs. (6-3,4,5) shows that Hˆ .* =E.*. c) If . and .* are independent, we have two solutions with the same energy, i.e., degeneracy. If E is not degenerate, . and .* are not different, so . =.* and is real. d) 2p-1 becomes -2p+1. No change for 2p0. e) No. While all complex eigenfunctions must be degenerate, real eigenfunctions can be degenerate too. Cases we have seen are (1) accidental degeneracies such as .3,3,3 and .5,1,1 in the 3-dimensional cubic box and (2) real eigenfunctions constructed as linear combinations of complex ones, as 2px and 2py from 2p+1 and 2p-1. MC: c a d Chapter 7 7-1. a) Yes. c1 = v2/L L 0 f (x) sin(px/L)dx, c2 = v2/L L 0 f (x) sin(2px/L) dx. c1 =0 by symmetry. c2 is positive. b) No. Both a) and b) are continuous, smooth, single-valued functions, but b) does not go to zero at x =0,L as do all the box eigenfunctions. 7-2. a) b) 1.3h2/8mL2. 7-3. c1 =v0.4=0.632, c3 =0 by symmetry. 7-4. a) [See Hint.] For n=odd, cn =2  L/2 0 f.dx=± 4v6 n2p2 +for n = 1, 5, 9, 13, . . . -for n = 3, 7, 11, 15, . . . For n=even, cn=0 (by symmetry) (i.e., f is symmetric and so contains only symmetric .n). b) fapprox(x =L/2) = m  n=1 cn.n(x =L/2) = (4v6/p2) 2/L m  n=1(odd) (1/n2) Appendix 13 667 m: 1 3 5 7 9 . . . 135 vLfapprox(x =L/2): 1.40395 1.55994 1.61609 1.64475 1.66208 1.72689 vLf(x =L/2)=v3=1.73205. c) [See Hint.] E¯ =  odd n (4v6/n2p2)2(n2h2/8mL2)=(12h2/p4mL2) odd n (1/n2) = (12h2/p4mL2)  m  odd n=1 (1/n2)+ 1 2  8 m+1 (1/x2)dx  = [(h2/8mL2)(96/p4)(1.23386)]m=9 = [1.21602(h2/8mL2)]m=9;[1.21432(h2/8mL2)]m=135 7-5. Normalizing factor = (2a/p)3/4; E¯ = (3a/2) - 2v2a/vp; a = 8/9p; E¯(min)=-4/3p =-0.4244 a.u.; r¯ =1.5 a.u.; rmp =v9p/4=1.329 a.u. (For .exact, E=-0.5 a.u., ¯r =1.5 a.u., rmp =1.0 a.u.) 7-6. a) and b) See text and Eqs. (7-16)–(7-20). c) a = 53 ,E¯ =-0.370 a.u. 7-7. a) 1/v8. b) -0.292 a.u. 7-8. a) Zero, because the function is symmetric for xy reflection whereas 2pz is antisymmetric. b) -0.4215 a.u., (by assuming c3s is v0.05 and all higher terms vanish). 7-9. a) . =5/3,E¯ =-0.2777 a.u. b) c1 =0.897. c) . =2.26(f -0.897.1s). d) No. . cannot have E¯ lower than the n=2 value, which is the lowest-energy case orthogonal to n=1 and which has E=-0.125 a.u. 7-10. -0.75 a.u. 7-11. [See Hint.] The energy E¯ =-0.375 a.u.=(0.9775)2(-1 2 a.u.)+higher-energy contributions. But this leading term equals -0.478 a.u., and so the net value of the higher energy terms must be positive. Therefore, at least one of them must correspond to a state with positive energy—a continuum state. 7-12. Slater’s rules give . = 1.7, whereas the variation method gives . = 27/16 = 1.6875. 7-13. a) S11 = 1, S12 = 0, S22 = 1, H11 =-1 2 , H12 = -F, H22 = 0. E¯ = -1 4 - 1 4 v1+16F2; for F = 0.1,E¯ = -0.51926 a.u. This trial form is superior because z · .1s is more contracted than .2pz , closer in size to .1s, hence interferes constructively and destructively with .1s more effectively. b) [See Hint.] lim(F .0) of v1+16F2 =1+8F2, in lim, E¯ =-1 2 -2F2, -1 2aF2=-2F2; a =4. E(e2/a0).aF2 (a units) ·(e/a2 0 )2, a units =a3 0. (See Appendix 10.) 7-14. Since 2s is isoenergetic with 2p states, these should mix freely in response to field. Hence, 2s is more polarizable. 668 Answers to Problems 7-15. E¯(min)=-12/5=-2.4 a.u. . =v2/5(fa +fb). 7-16. [See Hint.] E¯(lowest)=-2.030 a.u. . =1.045fa -0.179fb. 7-17. [See Hint.] E¯elec = (. 2/2) - 2 + 2(. + 1) exp(-2. ); .best = 0.9118, E¯elec = -0.9668 a.u.; E¯elec + 1/R = E¯tot = -0.4668 a.u. Since this energy exceeds that of H +H+ (-0.5 a.u. at R=8), this function does not demonstrate the existence of a bound state. 7-18. 1su becomes a 2p AO of He+, so E=-0.5 a.u. 7-19. [See Hint.] Since they are degenerate, .+ and .- may be mixed. The sum gives 1sa, describing the case in which the electron is at a. The difference gives 1sb. 7-20. [See Hint.] Both equal -3 2 a.u. This is higher than the lowest He+ eigenvalue because these are hydrogen atom 1s functions instead of He+ functions. 7-21. k must be greater than S. 7-22. a) Hˆ =-1 2.2 -(1/rH)-2/rHe. b) Separated atoms: lowest energy for H+ +He+(1s)=-2 a.u. For the united atom; Li2+(1s)=-4.5 a.u. 7-23. a) pu bonding. b) su antibonding. c) sg bonding. d) dg bonding. e) pu bonding. 7-24. 1ssg 2pzsu 3pypu 3dxydg 2ssg 2pxpu 3dz2sg 3dxzpg 7-25. a) Antibonding. b) Bonding. c) Bonding. 7-26. 2px, 2py, 3px, 3py, 3dxy, 3dyz. 7-27. [See Hint.] The integrals that vanish by symmetry are b), c), e), and f); g) does not vanish. The AOs are orthogonal due to different symmetry for reflection in the xy plane at a. But Hˆ is not invariant to this reflection. In f), the relevant reflection is through the yz plane; Hˆ is invariant to this one. 7-28. Sketches show that several planes qualify, but symmetries are opposite no matter which is chosen. For instance, if the xz plane is selected, dx2-y2 is symmetric, dxy is antisymmetric, pxz is symmetric, pyz antisymmetric. 7-29. 0. (It is a s MO.) 7-30. a) su antibonding. b) pu bonding. c) pg antibonding. d) dg bonding. 7-31. a) 4. b) 3 (a triplet). c) (1) increase, (2) decrease. d) 2 (a doublet). 7-32. a) 1s2 g 1s2 u 2s2 g 2s2 u 3s2 g 1p4 u 1p1 g. b) 5. c) O+2 has larger D0. d) 2g. e) All s MOs. 7-33. For He2, the second MO (su1s) correlates with third united atomAO (2ps ). For LiH, the second MO (s ) correlates with second united atomAO (2s). Thus, this MO is less antibonding in heteronuclear case. 7-34. [See Hint.] 2s = 1.0295 2s - 0.2447 1s; 2sg = 0.0136 1sA - 0.6523 2sA - 0.0854 2ps,A and similarly for B. 7-35. Eelec = Esepatoms - Vnn - De; De =-Eelec - Vnn + Esepatoms = (1.1026 - 0.500-0.500)a.u.=0.1026 a.u. Appendix 13 669 MC: a a d b d Chapter 8 8-1. For .prod, E=E1+E2+E3. For .det, E= 1 6(E1+E2+E3)six times. Energies of products in . are E1 +E2 +E3 and E1 +E2 +E4. These can be factored out to give Hˆ . =E. only if E3 =E4. 8-2. a) b) c)  x 1 1 x  (only 2 unsaturated carbons, so the same as ethylene). d) Same as c). Same as two ethylenes since the two p systems are noninteracting due to spatial separation. e) Same as c). Same as two ethylenes since the two p systems are orthogonal and noninteracting. 8-3.  x 1 1 1 1 x 0 0 1 0 x 0 1 0 0 x  =0, x4 -3x2 =0 See Appendix 6 for results. 8-4. 13 , since only .3 is common to both MOs. 8-5. S =1/v2, 1/v1-S2 =v2, f2(normalized)=v2(f2 -f1/v2). This results in all |c| = 1/2, with negative values on same side. That is, the node for f2 is vertical, while that for f1 is horizontal. 8-6. a) a -2ß (using octagon in circle). b) (since totally antibonding). 8-7. a) E =a +2ß(v2/3v3)+0+0+0)=a +0.544ß. b) p12 =0.272. All others zero. 670 Answers to Problems c) E = a + ß. (It is an ethylene pi bond, distributed over three locations, or three-thirds of a double bond. Or one can use a =1/v6 and equations for p and E.) 8-8. Highest two levels have same magnitudes as lowest two, with coefficients multiplied by -1 on alternating atoms (here taken to be second and fourth). 8-9. a) No. This is an odd alternant, so expect paired energy levels with an MO at E = a. The unpaired electron is therefore not in a degenerate level. b) Yes. Pentagon in circle forces degeneracies. Second level has three electrons. c)Yes. Square in circle shows that second level is degenerate. It contains one electron. d)Yes. The degenerate level of benzene now has an odd number of electrons. 8-12. Bond orders: CH2–CH, 0.8944.0.6708; CH–CH, 0.4472.0.5854. Bond lengths in A (Eq. 8-61): CH2–CH, 1.354.1.392; CH–CH, 1.436.1.408; CH2–CH=+0.038A, CH–CH=-0.028Å. 8-13. For benzene, cµi should be taken as 1/v6 since all carbons are equivalent. 8-14. Only fluoranthene deviates markedly because it is nonalternant. Hence, its LUMO and HOMO energies are not symmetrically disposed about E =a. 8-15. a) Oxidation potential ~ 0.97 V, reduction potential ~ 1.41 V. b) [See Hint]. To shorten: 4–10, 9–10, 8–9; to lengthen: 3–10, 1–9, 4–5, 7–8; otherwise no change. 8-16. E =18a +21.877ß. Error =0.0015ß per p electron. 8-17. a) Naphthalene; Ep (from Table 8-2) = 10a +13.128ß, from HMO = 10a + 13.6832ß. The difference = 0.055ß per p electron, aromatic. Perylene; Ep (Table 8-2) =20a +27.2796ß, Ep (HMO)=20a +28.2453ß; Ep = 0.048ß per p electron, aromatic. b) RE for perylene is slightly less than double that for naphthalene. The central ring does not appear to be contributing. c) These two bonds are single in all formal (nonpolar) structures. d) The calculated length =1.433Å. The HMO length is too short. The actual length is more consistent with these being “truly” single bonds. 8-18. The fourth molecule in Fig. 8-24 should strive for six electrons in each ring. This would make the left side (i.e., the seven-membered ring) net positive. The other Appendix 13 671 two molecules become net negative on the left. Electron densities corroborate this. For the fourth molecule, charge densities exceed unity in the five-membered ring and are less elsewhere. The fifth molecule has only one p electron density less than unity, and this is for the methylene carbon. 8-19. a) 9. b) 10. c) 4. d) 6. e) 10. 8-20. Since the formal structure always shows C–O single bonds, C1=C2, and C3=C4 double bonds, and C2–C3 as single, we can use the single-, double-bond distinctions of Table 8-3. These give  x 1.1 0 0 0.8 0 0 0 1.1 x 0.9 0 0 0.3 0 0 0 0.9 x -0.1 1.1 0 0 0.8 0 0 0 1.1 x 0.8 0 0 0 0.8 0 0 0.8 x +2.0 0 0 0 0 0.3 0 0 0 x +1.5 0 0 0 0 0.8 0 0 0 x -0.1 3.0 0 0 0 0 0 0 3.0 x -0.5  Otherwise, the positions with 1.1 and 0.9 become 1.0. 8-21. a) Left. b) Left. c) Right. 8-22. qr = 1 at all centers, and so it does not predict some centers best for nucleophilic, and hence worst for electrophilic substitution, so it doesn’t apply to this question. Since the HOMO and LUMO have identical absolute coefficients (by the pairing theorem), the same site is most favored for both nucleophilic and electrophilic substitution. Lr must be identical for nucleophilic, radical, or electrophilic substitution because an interrupted even alternant produces an odd alternant, for which cationic, neutral, and anionic p energies (ß part) are the same. Thus, HOMO and LUMO indices are consistent with coincidence of active sites for nucleophilic and electrophilic substitutions, and Lr is consistent with the coincidence of these with active sites for radical addition. 8-23. No. Both types should prefer the most polarizable site, since that is the site to which charge is most easily attracted or from which it is most easily repelled. 8-24. a) F1 =0.0684, F2 =0.4618, F4 =0.9737. b) Index Values Preferred site qr q2 =0.818 q4 =1.488 4 HOMO c2 2 =0.1356 c2 4 =0.6646 4 L+r L+2 =2.134ß L+4 := 0.962ß 4 prr p22=-0.4340 p44=-0.4019 2 c) Only protons on C2 and C3 will produce ESR splitting in simplest theory, since the singly occupied MO of the radical anion is zero elsewhere. d) Net 672 Answers to Problems bonding, because energy is below E=a and this happens only when bonding interactions dominate. 8-25. The correlation is fairly good except for styrene, which is way off. But styrene is the only member of the set where addition is not occurring at a ring position. Because the geometric constraints are so different, the relation between free valence and transition-state energy is presumably rather different for styrene. 8-26. The five-membered ring because, as it strives for six electrons in order to satisfy the 4n+2 rule, it becomes negatively charged. (The seven-membered ring also strives for six electrons, becoming positive.) 8-27. Fulvene a) should experience greater change. It is not an alternant hydrocarbon, whereas benzyl b) is. TheMOinto which the electron goes in benzyl is necessarily nonbonding, with zero coefficients at positions 1, 3, and 5. (See Appendix 5 for discussion, Appendix 6 for coefficient values.) 8-28. a) 11. b) 2 and 4. c) 3, 6, and 8 (1, 9, and 10 have no attached hydrogen). 8-29. a) is easier to ionize because the HOMO is zero at the nitrogen position (in the all-carbon analog). This means ionization does not remove electronic charge from the more electronegative atom in case a) but does in case b). MC: c b Chapter 9 9-1. a) 212 b) 6a 6b 6c 7a 7b 7c  c)  25 13 i -7 18  d) 1 0 0 1  e) . .. 3i +16 2i +28 31 51 -12 -21 . .. f) product not defined g) 1 9-2. H is defined to be hermitian if Hij = Hj*i . Hj i =  .j*Hˆ .idt , and so H*ji =  .jHˆ *.*i dt . But if Hˆ is hermitian, this must equal  .*i Hˆ .j dt =Hij . Therefore, H*ji =Hij and H is hermitian. 9-4. AC = " CA. But " CA = A˜C˜ , so AC = A˜C˜ . This must be true in this example because A and C are symmetric. That is, A=A˜ , C=C˜ . 9-5. a) |B - .i1| = |T-1AT - .i1| = |T-1AT - .iT-11T| = |T-1(A - .i1)T| = |T-1||A-.i1||T| = |TT-1 (A-.i1)| = |A-.i1| b) For diagonal B, value of |B-.11| is product of diagonal elements. For this to vanish, at least one such element must vanish. This will occur whenever .i equals a diagonal element of B. Therefore, the latent roots are the diagonal values. 9-6. If a latent root is zero, then the product of latent roots is zero. But this product is the value of the determinant of the matrix. If the determinant of the matrix is zero, there is no inverse. Appendix 13 673 9-7. a) tr(ABC) = n  i=1 (ABC)ii = i  j  k aij bjkcki = i  j  k ckiaij bjk which is  k (CAB)kk =tr(CAB) = i  j  k bjkckiaij which is  j (BCA)jj =tr(BCA) = i  j  k (ckibjkaij ) which is not i (CBA)ii , hence =tr(CBA). b) tr(T-1AT)=tr(TT-1A)=tr(A). 9-8. (norm ˜TAT)2 = i,j (#˜TAT)ij ( ˜TAT)ji = i,j (T˜A˜ T)ij (T˜AT)j i =  i,j,k,l (T˜)ik(A˜ )kl(T)lj (T˜)j l(A)lk(T)ki = k,l . ..... (A˜ )kl(A)lk i ( ˜T)ik(T)ki  1  j (T)lj ( ˜T)jl  1 . ..... = k,l (A˜ )kl(A)lk =(norm A)2 9-9. a) tr=0, det=2, norm=v6; therefore, a+b+c=0, abc=2, a2+b2+c2=6; solutions: 2,-1,-1. b) Solutions 1, 1,-1. c) 0, 1+v3, 1-v3. 9-10. Both vectors transform to  3 cos . -3 sin . . Hence, reversal is not possible and transformation is singular. This is verified by fact that the determinant vanishes. 9-11. The matrix is already diagonalized. This means the eigenvector matrix is the 3×3 unit matrix. 9-12. Let T-1AT=DA (diagonal) and T-1BT=DB (diagonal). Then DADB =DBDA (diagonal matrices commute); T-1ATT-1BT = T-1BTT-1AT; T-1ABT = T-1BAT;TT-1ABTT-1 =TT-1BATT-1;AB=BA. 9-13. In the second case, C is not unitary, since C†SC=1. The ordinary procedures for diagonalizing H have built in the requirement that C†C = 1. The problem would be to find a matrix C that simultaneously diagonalizes H and satisfies C†SC=1.  a†ß d. . (1 0) 0 1  =0;  a†ad..(1 0) 1 0  =1 Sˆza = 1 2 1 0 0 -1 1 0  = 1 2 1 0  = 1 2 a 674 Answers to Problems Chapter 10 10-1. 10-2. AO no. Atom AO type AO no. Atom AO type 1 H1 1s 6 C3 2py 2 H2 1s 7 O4 2s 3 C3 2s 8 O4 2px 4 C3 2px 9 O4 2px 5 C3 2px 10 O4 2py 10-3. E=-0.756 a.u.; MO 9 f9=-0.27 1s1 -0.27 1s2 -0.49 2sC +0.22 2pxC +0.31 2s0 +0.33 2px0 A s MO, mainly C–H2 bonding and lone pair (nonbonding) on oxygen. Shows some C–O antibonding character [see I). E=-0.611 a.u.; MO 8 f8=-0.21 1s1 +0.21 1s2 -0.32 2pyC -0.76 2pyO A s MO, CH2 and C–O bonding [see (II)]. E=-0.597 a.u.; MO no. 7 f7 =0.24 2pzC +0.92 2pzO Appendix 13 675 A p MO, mostly on oxygen, but somewhat delocalized to give some C–O bonding [see (III)]. 10-4. The p MOs are 4 and 7. All others are s. 10-5. C and O 2pp AOs are 4 and 8. The 4, 8 overlap population is seen from the data to be 0.1936. Since MO 7 is C–O bonding, loss of an electron should cause the C–O bond to lengthen. 10-6. E7 = (0.2456)2(-10.67eV)+(0.9181)2(-15.85eV) +2(0.2456)(0.9181)(1.75)(0.2146)(-10.67eV-15.85eV)/2 = -16.25eV=-0.5972 a.u. 10-7. The sum of the elements in the upper triangle =12. (Use of all elements would count overlap populations twice.) 10-8. If column 7 is the gross populations of MO no. 7, then it should turn out that 0.2175 = 2 c2 47 +(0.5)(2)c47c87S48: 2(0.2456)2 +(0.2456)(0.9181) (0.2146)]=0.2175. 10-9. These must beAOs because MO charges must be 0, 1, or 2. AO 4, for example, gets its charge from MO 7. We have just seen (previous problem) that this is 0.2175. (AO 4 also appears in MO 4, and the “charge matrix” gives a value of 1.7825 for this. But this does not appear in the gross population because MO no. 4 is unoccupied in the ground state configuration.) 10-10. The net charges are theAO charges plus the nuclear charges (after cancellation of some nuclear charge by inner-shell electrons). These results indicate high polarity, with oxygen being the negative end of the dipole. The predicted polarity is unrealistically high becauseEHMOneglects interelectronic repulsion which would tend to counteract such extreme charge imbalance. 10-11. Number MOs=number AOs=1 on each H and 4 (valence) on each C=22. Chapter 11 11-1. Koopmans SCF Experiment 2s-. 2p 1.0800 1.0830 0.989 1s-. 2p 31.9220 31.1921 31.19 11-2. Koopmans–SCF (eV) SCF – observed (eV) (electron relaxation) (electron correlation) 2B1 2.71 -1.54 2A1 2.52 -1.40 2B2 1.86 -0.90 11-3. For electron affinities, these errors should reinforce, rather than cancel, because adding an electron should increase electron correlation. 11-4. ad -cb+.(af -be)=ad +.af -bc-.be. 676 Answers to Problems 11-5. Neither .1 nor .2 is already the best function in our function space. Hence, we cannot argue that mixing will bring no improvement. 11-6. This must be true to enable a1 to be factored from the expanded form of Aˆ.. 11-7. For a given choice of basis functions, there are two integrals: .a(1).b(2)|.c(1).d (2) and .a(1).b(2)|.d (1).c(2) There are five ways to choose a function for each position. Thus, the number of integrals is 2×54 =1, 250. 11-8. a) and c) would be prevented from contributing. 11-9. Hˆ =- 1 2 10  i=1 .2 i - 10  i=1 ( 1 ri,H1 + 1 ri,H2 + 8 ri,0 )+ 9  i=1 10  j=i+1 1 rij 11-10. a) (1/v2)|sg(1)sg(2)|. b) Eelec = -1.804 a.u. c) Etot = -1.090 a.u. d) De =0.090 a.u. e) IE (Koopmans)=0.619 a.u. f) KE+Vne=-1.185 a.u. Chapter 12 12-1. E0 +W(1) 0 =.|H0|.+.|H|.=.|H|.=W0. 12-2. [See Hint.] E(1) 1 >E(1) 3 >E(1) 2 . 12-3. a) d. b) d. c) -d/2. d) d/2. e) -d/2. f) 0. 12-4. a) W(1) 2 =3d/4. (b-1) Expect c(1) 21 to cause .1 to shift to right. Since .2 is positive on left of box, negative on right, c(1) 21 should be negative. Since .2 is above .1, it should cause energy to be depressed, and c(1) 21 leads to a negative contribution to W(2) 1 . (b-2) Because c(1) ij =-c(1) ji , c(1) 12 must be positive, giving a f(1) 2 that causes charge to shift left and a contribution to W(2) 2 that is positive. 12-5. c(1) 41 = .1|H|.4 E1 -E4 = (2/L)  L 0 sin(px/L)(Ux/L) sin(4px/L)dx -15p2/2L2 = 64UL2 153p4 c(1) 21 = 32UL2 27p4 , c(1) 41 c(1) 21 = 2 125 =1.6% 12-6. The effect is zero, to first order, because .2 is symmetric for every state and the perturbation is antisymmetric. 12-7. [See Hint.] E=E0+E1=-1 2 a.u.+1s|-1/r|1s=-3 2 a.u. Correction seeks to make . less diffuse, yet still spherical. c(1) 2s,1s should be positive (augments hydrogen 1s at small r and cancels at large r) and c2P0 , 1s(1) should be zero (wrong symmetry). 12-8. W(0) +W(1) =-2 a.u.+2 a.u. = 0. Reasonable, since must be above -0.5 a.u. (See Problem 12-1.) Appendix 13 677 12-9. a) Zero, because the perturbation is antisymmetric for x, y reflection, while the 1s function is symmetric. b) c(1) 2s,1s vanishes. The relevant integral suffers the same symmetry disagreement as in part a). Or, in another view, the 2s AO will affect only the diffuseness of ., whereas the perturbation affects polarity. c(1) 2pz,1s should be positive. The perturbation lowers the potential for positive z, the wavefunction should skew that way, so 2pz should enter with its positive lobe reinforcing 1s at positive z. c) It is negative, reflecting the lower energy of the electron distribution as a result of first-order polarization. 12-10. Ep =4a +4.9624ß +(0.1)(q4); q4 =1.4881; Ep =4a +5.1112ß. 12-11. The effect is least atC6 since q6 is smallest. First-order result: E=6a+7.777ß. Computed result: E =6a +7.8546ß. 12-12. kk = 4*m j=1*n i=m+1 c2 kj c2 ki/(Ej - Ei). This must be negative. .qk must be negative if .ak is positive. This means that making atom k less attractive causes electron density to decrease there. This makes sense. 12-13. Since the total charge must be conserved, the sum of all atom-atom polarizabilities, including p1,1 must be zero. Since the sum over the polarizabilities in Example 12-5 is 0.44277, p1,1=-0.44277. 12-14. a) W(1) =hß (because all q =1). b) The lowest-energy one, which is the MO where the central atoms find the largest |c|. 12-15. a) Ep =9a +12.1118ß. b) Atom 1 seems likely to have the greater self-atom polarizability because, in the cation, the product of squares of HOMO, LUMO coefficients vanishes at atom 2. 12-16. [See Hint.] For butadiene: Ep =4ß[(0.3718)2 -(0.6015)2]=-0.894ß. For hexatriene: Ep =4ß[(0.2319)2-(0.4179)2+(0.5211)2]=0.603ß. The energy of hexatriene is lowered, that of butadiene is raised, and so hexatriene benefits. For cyclobutadiene: E0 +E(1) =4a +3.5778ß, E (H¨uckel) =4a +4.000ß. For benzene: E0 +E(1) =6a +7.591ß, E (H¨uckel) =6a +8.000ß. 12-17. a) b) f(1) 1 = .2|H|.1 E1 -E2 .2 + .3|H|.1 E1 -E3 .3 Since H depends only on density at C2, it comes out of the integral. [See (IV).] f(1) 1 = 0+(-1 2 cß/2v2ß).3=-0.1768c.3, f(1) 2 =0 f(1) 3 =+0.1768c.1 678 Answers to Problems Thus, .1 +f(1) 1 has more density at C2 than did .1, .2 +f(1) 2 is identical to .2, and .3 +f(1) 3 has lost density at C2. 12-18. p1,2=-0.1768ß-1; p1,3=-0.265ß-1. Both atoms 2 and 3 lose charge, but atom 3 loses more than atom 2. 12-19. a) Nondegenerate MO, f1 = (1/v3)(.1 + .2 + .3) is already correct. The correct zeroth-order degenerate MOs must give zero interaction element with H. For this H, this means that the overlap between these MOs must be zero at C2. Thus, one MO must have a node at C2 f2 =(1/v2)(.1 -.3) and the other must be orthogonal to it f3 =(1/v6)(2.2 -.1 -.3). If one uses data from Appendix 6, one obtains, for degenerate MOs, .1 = -0.8165.1 + 0.4082.2 + 0.4082.3, and .2 = 0.7071.2 - 0.7071.3. These give H 11 = 0.1666cß, H 22 = 0.5cß, H 12 = 0.2886cß. Since H 12 = 0, one knows .1 and .2 are not correct zeroth-order wavefunctions. Solving the determinantal equation gives E1 = 0, E2 = 0.6666cß. These are the firstorder corrections to the energies of the two upper levels. Solving for coefficients gives, for E = 0, c1 =-0.866, c2 = 0.500, and so the proper zerothorder wavefunction having a zero first-order correction is f(0) 1 =-0.866.1 + 0.500.2 =0.7071.1 -0.7071.3. For E =0.6666cß, c1 =0.500, c2 =0.866, and sof(0) 2 =-0.4082.1+0.8165.2-0.4082.3. These are the same functions arrived at intuitively above. b) For the above zeroth-order MOs, the densities at C2 are respectively 13, 0, 2 3 . Therefore, the energy of the lowest level drops by cß/3, that for one of the originally degenerate levels drops by 2cß/3, and the other level is unaffected (to first order). 12-20. a +2.1ß, a +0.818ß, a +0.618ß, a -1.418ß, a -1.618ß. 12-21. Answers to part a) and b) follow answers for parts c) and d). c)E=a±2ß(0.5)2(2)=a±ß. These agree exactly with benzene orbital energies, which is reasonable since the unperturbed fragment nonbonding orbitals combine to give exactly the benzene MOs. d) a + v2ß + 2ß(0.353)2(2) = a + 1.913ß compared to a + 2ß for benzene. Appendix 13 679 The allyl coefficients are renormalized by dividing by v2. 12-22. a) Only p3.p2 is dipole allowed. b) It is polarized along the outer C–C bonds. If we pretend the molecule is linear, this is the line connecting the carbons. If we adopt a more realistic structure, it depends on whether we choose a cis or trans structure: 680 Answers to Problems 12-23. f(0) ± |-z|f(0) ± =±2s|r cos .|2pZ=±3 a.u. For f =cos(a)2s+sin(a)2pZ, maximum dipole occurs when cosa = 1/v2, sin a =±1/v2, that is, when f = f(0) ± . This is reasonable since the mixing of degenerate states like these requires no energy “expense” in terms of the unperturbed hamiltonian. As soon as the slightest external field appears, the mixing will occur to the above extent to produce maximum dipoles. 12-24. Since, for MO 4, c3 =c7, this MO is symmetric for reflection through the yz plane. The operator x is antisymmetric for this reflection. Therefore, if integral f4|x|f? is to be nonzero, f? must be antisymmetric for this reflection. Of the empty MOs shown, only f5 satisfies this requirement. There is no other symmetry operation present that will independently cause the integral to vanish, and f4.f5 is likely to be the observed transition. 12-25. a) The HOMO is antisymmetric for reflection through the yz plane. (Assume that the coordinate origin is in the center of the 5–10 bond.) The x operator is also antisymmetric. Therefore, the allowed transition should be to an MO that is symmetric for this reflection. There are three such MOs, at energies of +1.000, +1.303, +2.303 in units of -ß. But the middle of these disagrees in symmetry with theHOMOfor reflection through the xz plane. Therefore, only the other two transitions are allowed. b) Here, the integrals containing y will vanish by symmetry except for transitions to +0.618 and +1.303, but the latter state disagrees in symmetry with the HOMO for the yz plane reflection. Hence, only -0.618.+0.618 is y allowed. c) No p–p transition is z allowed. d) Transitions to 1.303 and 1.618 are not allowed for any polarization. 12-26. The field polarizes the atom, which means that the 2s state acquires 2p character. But the 2p.ls transition is allowed, and so the atom now relaxes to the 1s state. 12-27. 4.7 allowed, polarized along C–O axis. 3.7 forbidden. 2.7 and 1.7 allowed, polarized perpendicular to molecular plane. 12-28. a) a, a.a, a +0.5ß. b) a, a.a+0.25ß, a+0.25ß. If we viewthis as a sequence of substitutions, then case a) is clearly a case of cooperative perturbations since the proper zeroth-order orbital that is strongly lowered by the first substitution is lowered again by the second. For case b), the second substitution affects the other one of the proper zeroth-order orbital energies. To this order, a) should be more stable since the lowering of energy due to the two electrons in this level is ß, whereas the most that can come from b) is ß/2. [Analysis of character tables (Chapter 13) enables us to see that the degeneracy for case b) must become split at higher levels of perturbation because aC2v molecule has only one-dimensional representations, hence cannot have degenerate orbitals.] 12-29. Lowest energy rises by 0.0667ß. One of the next pair rises by 0.1ß and the other drops by 0.0333ß. 12-30. x,y components =0. z component = 0.293 Z a.u.= 0.745 Z D= 2.485×10-30 Z Cm. Appendix 13 681 MC: c a c b Chapter 13 13-1. No. “Come about 180.” is needed for closure. 13-2. It means that for each operation there is a right inverse as well as a left inverse. If AB =E, then A is the left inverse of B and B is the right inverse of A. 13-3. a) C4. b) 4. c) Probably only three, since one tends to think of left face and right face as being in same class. However, in the C4 group they are not, because there is no operation in the group to interchange them (such as reflection through a plane containing the z axis). 13-4. There are many possible arrangements counted in 4! that are not physically achievable through operations in the group. For example, 1 — 2 | | 4 — 3 . 1 — 4 | | 3 — 2 For C3v, all possible arrangements are accessible. 13-5. a) C2v. b) D2h. c) D4h. d) Td. 13-6. a) U†U=UU† =1 (upon explicit multiplication). b) Upon explicit multiplication, U†AU= 1 0 0 -1 . 13-7. a) D6h (yes). b) C2v (no). c) D2h (no). d) D3d (yes). e) C3v (yes). 13-8. a) D3h. b) (1) a 2 ; (2)e; (3)a2 ; (4)a1 . 13-9. [See Hint.] D2d, MO number : 1 2 3 4 5 6  7 8  9 10   11 12 13   14 15 16 MO symmetry : b2 a1 a1 b2 e e e b2 e a1 b2 a1 The highest occupied MO is 9,10; the lowest empty MO is 8,7, and so the transition is e.e. f9|x or y|f8 =  e.e.e= e.e.e.e=0 f9|z|f8 =  e.b2 .e= a1 .a2 .b1 .b2 =0 Transition is allowed for (group theory) z polarized light. This means x polarized for the coordinate system shown. 13-10. a) and b) can be checked against C2v character table. c) Hydrogens generate characters 2 0 2 0, which is a1 .b1. d) The unnormalized symmetry combinations are: a1, 1sA + 1sB;b1, 1sA - 1sB. 682 Answers to Problems 13-11. a) and b) can be checked against the C4v character table. c) Hydrogens generate characters 4 0 0 2 0 or 4 0 0 0 2, depending on howsv and sd are selected. Assuming that sv contains corner ammonia molecules gives the former set. This resolves to a1 .b1 .e. (The other choice gives b2 instead of b1, but reversal of choice of sv and sd has the effect of interchanging the symbols b1 and b2, and there is no real difference involved in this choice.) For a situation where the nitrogens are numbered as shown in the figure, the unnormalized symmetry orbitals are: a1, 2s1 + 2s2 + 2s3 + 2s4; b1, 2s1 - 2s2 + 2s3 - 2s4; e, 2s1 - 2s3 and 2s2 - 2s4; or 2s1 + 2s2 - 2s3 - 2s4 and 2s1 - 2s2 - 2s3 + 2s4. (Other combinations are also possible, but these two sets are the most convenient.) 13-12. a) Four operations means order 4. Four classes means four representations. Hence, each representation must be one dimensional, therefore having character +1 or -1 for every operation. One of these must be +1 everywhere (A1). The others must all have +1 in the first column and -1 in two of the other three columns. The result is as given in Appendix 13 for C2v. b) Twelve operations gives order 12. Six classes means six representations. This must mean two 2×2 and four 1×1. There is but one unique set of orthonormal character vectors that fit this framework. Check against the D3h character table. 13-13. Two two-dimensional representations and two one-dimensional representations. 13-14. Check against C2v. 13-15. Order =12. There are six classes. 13-16. 1 2 . .... 1 1 -1 1 1 -1 1 1 1 -1 -1 -1 1 1 1 -1 . .... . .... s px py pz . .... =sp3 set . .... 1/v3 0 v2/v3 0 1/v3 1/v2 -1/v6 0 1/v3 -1/v2 -1/v6 0 0 0 0 1 . .... . .... s px py pz . .... =sp2 set For each matrix T,T†T=1. 13-17. Yes. Since various hybridized sets are equivalent, the final result of the calculation is independent of one’s choice. 13-18. a) A2 .2B1 .E. Yes. b) Ag .2B3u. No. c) A1g .A2g .Eg .A1u .A2u .Eu. No. 13-19. a) e.b2 .e=a1 .a2 .b1 .b2. Need not vanish. b) e.e.e=e.e.e.e. Must vanish. c) a1 .b2 .b2 =a1. Need not vanish. Appendix 13 683 d) a1 .b2 .a1 =b2. Must vanish. e) a1 .e.b2 =e. Must vanish. 13-20. eg.a2u, eu are x,y allowed. eg.eu is z allowed. 13-21. The symmetry of the molecule is C2v. All p MOs are antisymmetric for C2 so must be bases for b1 or b2 representations. The function y (for the molecule) is z (in the table), and so it is a basis for the a1 representation. For the product f4yfn to contain a1 symmetry, it is necessary, then, that fn have the same symmetry as f4. f6 and f7 do have the same symmetry; therefore, f4.f6,f7 are y allowed (where y is coincident with the symmetry axis). 13-22. a) A nonspecial point above the top face has seven equivalent positions above that face (into which it can be sent by various symmetry operations, so there are eight equivalent positions above that face. Because the two sides of the square are identical, there are another eight equivalent positions below. Therefore, we expect a group of order 16. (See D4h.) b) Like the square, each face of the cube has eight equivalent points. The inside of a face is not equivalent to the outside, so each face is limited to eight equivalent points, not 16. Six faces times eight points per face yields a predicted group order of 48. (Oh) MC: d c a e d a d a b Chapter 14 14-1. 1sA|Hˆhyd - 1/rB|1sA = -1 2 - (1/R) + [(1/R)+1] exp(-2R) where R is distance between nuclei. For R =2 a.u., Eelec=-0.9725 a.u., Etot=-0.4725 a.u. For R =1 a.u., Eelec=-1.2293 a.u., Etot =-0.2293 a.u. For R =3 a.u., Eelec=-0.8300 a.u., Etot =-0.4967 a.u. 14-2. Expected QMOT behavior is reversed for very low HAA. The “antibonding” MO is lower than the “bonding” for HAA =-20. This happens because the loss of energy involved in removing charge from the atoms is not compensated by that gained by putting the charge into the bond region. 14-3. The molecular electron affinities tend to be greater than for the atoms when the extra molecular electron goes into a bonding MO, smaller if into an antibonding MO. QMOT leads us to expect the MO energy to be lowered (raised) from separated atom levels if the interaction is bonding (antibonding). Koopmans’ theorem then leads to predictions in qualitative agreement with these data. 14-4. The observed ground-state angles are more consistent with the triplet-state configuration. The first excited state should then be a singlet and have a smaller angle. (Accurate calculations give 103..) 14-5. It should become more bent. (Also, the O–H bonds should lengthen, although this is not the kind of geometric change that the figure explicitly treats.) 14-6. The computed results will largely agree with QMOT ideas. However, it is possible that the 1sg - 1a1 level will rise where Walsh’s rules predict it will fall. [Whether this occurs depends upon details of EHMO parameter choices.] Upon analysis, this turns out to result from a situation similar to that examined 684 Answers to Problems in Problem 14-2. That is, the increase in H–H overlap population comes at the expense of population elsewhere. If the energy associated with this “population elsewhere” is low (i.e., if the original MO energy is low enough) the energy cost is greater than the gain due to increased H–H overlap population. This inversion of behavior for low levels is ignored in theWalsh approach. However, it does not occur for a given MO until it is fairly deeply buried under higher filled MOs. Also, since the higher MOs dominate the behavior of the molecule anyway, the inversion does not affect our prediction. 14-7. Since the exact answer depends on details of your EHMO program, allow us to take this opportunity to toast your good health. 14-8. If S = S0 cos ., dS/d. =-S0 sin .. S for . = 0 to . = 30. equals -(1 - 0.866)S0=-0.134S0; for 90.60., S=-S0(0-0.5)=0.5S0. 14-9. For a diagram and discussion, see Gimarc [1]. Those with six valence electrons (the first three in the list) are planar. Those with eight electrons (the last four) are pyramidal. 14-10. Sketches and discussions of AB2 molecules may be found in the literature [2,3]. Molecules with 16 or fewer valence electrons (BeCl2,C3,CO2,N-3 ) should be linear. Those with more than 16 should be bent (NO2,O3,F2O). 14-11. Bending should increase end-to-end antibonding. These MOs favor the linear form [see (V)]. Bending should increase end-to-end bonding. These MOs favor the bent form [see (VI)]. Therefore, pg.pu should make molecule more bent. 14-12. For allyl, draw the three p MOs. For the cyclic molecule, draw a C–C bond (lowest in energy), a single p-p AOon the negative carbon (intermediate energy) and the C–C antibond (high energy). Conrotary motion preserves a C2 axis, disrotatory preserves a reflection plane. For the C2 axis, the allyl MO symmetries are (in order of increasing energy) A, S, A. For the cyclopropenyl anion, they are S, A, A. For the reflection plane, the same MOs have symmetries S, A, S; and S, S, A. The resulting predictions for the four-electron anion are that thermal closure goes conrotatory, photochemical goes disrotatory. 14-13. [See Hint.] The diagram is given in (VII). s and p refer to symmetry (S or A) for reflection through the molecular plane. The second symmetry symbol refers to reflections through a plane between the two acetylenes. The third symbol Appendix 13 685 refers to a reflection orthogonal to the first two (i.e., bisecting both acetylenes at their bond midpoints). The reaction does not appear favorable for a thermal mode. Photochemically it is less unfavorable, but still corresponds to a rather highly excited product. Asquare planar intermediate is unlikely for either mode. 14-14. No states can differ in symmetry in this case, and no crossings should occur. 14-15. SeeWoodward and Hoffmann [4, pp. 23, 24]. 14-16. SeeWoodward and Hoffmann [4, pp. 70, 117]. Chapter 15 15-1. (h/2p)k =px =v2mE =v2mT =h(1/.). 15-2. [See Hint.] dk/dE = (dE/dk)-1 = 4p2m/kh2 . 1/k . 1/vE so plot E vs 1/vE. 15-3. [See Hint.] a) For example, f2=(1/v6) pp1 +exp(2pi/3)pp2 -exp(pi/3)pp3 +pp4+ exp(2pi/3)pp5 -exp(pi/3)pp6 .E2 =a -ß. b) j = 0 gives exp(0) = l at every atom, j = 6 gives exp(2(n - 1)pi) = 1 at every atom. 15-4. [See Hint.] a) DOS, “normalized” by dividing byN, and in units of |ß|-1, is1/(p sin(hp)), where h=j/N and 0=h=1/2. E =a-2|ß|cos(hp). Therefore, DOS = 1/{p sin(arc cos [(E -a)/2|ß|])}. b) 2N states spread uniformly over an energy range of 4ß gives a “normalized” DOS of 0.5|ß|-1. 15-5. For example, j =-1.(1/v6) pp1 +exp(-pi/3)pp2 +exp(-2pi/3)pp3- pp4 -exp(-pi/3)pp5 -exp(-2pi/3)pp6. Sum of j =±1 functions divided by v2 gives one real function, and difference divided by iv2 gives the other. 686 Answers to Problems 15-6. Requires that exp(-2piqnN/n) equal 1. This requires that the argument equal pi times an even number, which is the case if qN is an integer. Since q and N are integers, so is qN. 15-7. (CH)2n has lower energy. Variation would not affect (CH)2n, but would mix p and p* versions of (C2H2)n to produce COs like those for (CH)2n. (Note that, for convenience, we have arranged phases so that the node is identically placed in these diagrams.) 15-8. The HOMO and LUMO at |k| = p/3a respond oppositely to distortion that lengthens every third bond and/or shortens the others. 15-9. Symmetry for reflection through a plane perpendicular to the polymer axis is not useful for this analysis. Symmetry for reflection through a plane containing the atoms divides the COs into s and p types. Only s–p crossings are allowed. 15-10. The s bands involve 2s and 2p AOs. One band pair includes 2s AOs with 2p AOs that are oriented parallel to the screw axis. The other band includes no 2s AOs and only 2p AOs perpendicular to the screw axis. The former functions form a basis that is symmetric for the symmetry operation, whereas the latter form a basis that is antisymmetric: 15-11. At high pressures, intermolecular distance becomes the same as intramolecular distance. This yields a zero “gap” at the Fermi level (same as polyacetylene with uniform bond lengths). 15-12. Appendix 13 687 15-13. a) HOMO =-8.19 eV. b) LUMO =-4.09 eV. c) Egap = 4.10 eV. d) Gap should open (both up and down) so HOMO becomes lower and IE is higher. LUMO is higher and EA is less negative. Computed values: IE=8.39eV,EA= -2.26eV,Egap =6.13 eV. 15-14. In Chapter 8 we show correlations between HOMO and IE and between LUMO and EA, with different values for ß in each case. The difference in interelectronic repulsion and exchange for neutral and anionic species is handled implicitly in this manner by simple H¨uckel theory. 15-15. Only p4 has the proper symmetry to combine with p1 to make c1=c4 =c2=c3 =c5 =c6. Mixing causes the bands to split apart. p1 is pushed down by p4, and p4 is pushed up by p1. The coefficients decrease on carbons 1 and 4 in p4, consistent with a rise in energy due to less bonding. 15-16. No. Atoms 2 and 6 have zero coefficients in the appropriate COs (see p(0) a of Fig. 15-26), but atom 5 has a nonzero coefficient. 15-17. The band diagram is identical to the one in Fig. 15-23 except that the a + ß level at k=0 has an intended correlation with a-ß at k=p/a, and vice versa. However, these have the same symmetry (anti for s2), hence avoid crossing. The resulting diagram is identical to that in Fig. 15-23 except that band p2 has an intermediate hill and p5 an intermediate valley—height and depth unknown until a variational calculation is performed. 15-18. Edge energies without substitution at k =0 : a +2.5ß, a,a -ß,a -1.5ß. At k=p/a :a+1.5ß,a+ß, a,a-2.5ß. (Proper zeroth-order MOs for theE=a level of cyclobutadiene are the versions having nodes through atoms rather than through bonds.) The lines at a,a - ß for k = 0 cross. This is allowed by symmetry disagreement for reflection through the plane containing the polymer axis and perpendicular to the molecular plane. Edge energies are lowered by substitution by the following amounts as we move up the diagram on the k =0 side: ß/8,ß/4, 0,ß/8. On the k = p/a side: ß/8, 0,ß/4,ß/8. Symmetry is maintained so the crossing is still allowed. For only one substitution, the energy lowerings are half as great. Symmetry is lost, and the crossing becomes avoided. 15-19. Like Fig. 15-28a with the middle horizontal row shaded. Higher in energy than , but lower than X or M. 15-20. 688 Answers to Problems 15-21. The energy level for X is lower than those for  and X. For X, all interactions are bonding. For  and X, nearest-neighbor interactions are antibonding and longer-range interactions of both bonding and anti bonding types occur. Judging the relative energies of  and X is complicated since we need to evaluate relative magnitudes of interactions between p AOs on the same long edge of the rectangle and also on opposite corners. Since p AO interactions depend on distance and angle, this is not trivial, as it is for s-type AOs. 15-22. All are nondegenerate. 15-23. The RFBZ is triangular. Referring to Fig. 15-27c, the RFBZ shown there becomes square, but symmetry nowmakes points on opposite sides of the-M line equivalent (e.g., X and X become equivalent), so only the -X-M- line is needed. 15-24. Both are nonbonding and would have the same energy. 15-25. COs at (0,p/a) and (p/a, 0) are the same except that the nodal planes are rotated by 60o. Appendix 13 689 15-26. The crystal is symmetrical for reflection through a hexagonal sheet. Planewaves experiencing the same potential and having the same |k| are degenerate. 15-27. At the top of the RFBZ, unit cell functions are stacked with sign reversal. If the upper and lower phases of the unit cell functions agree, this means that we must have a bonding interaction in the cell, but we will also get antibonding between cells, hence no splitting. On the other hand, if the upper and lower phases are opposite, we must have antibonding in the cell but we now get bonding between them, again for no splitting. The same is true for sigma bonds. 15-28. Appendix 2 A2-2. a) -2. b) 1. c) 0. d) x=±v2. A2-4. Determinant of coefficients vanishes, and so nontrivial roots do exist. A2-5. c=2, -1±v7 are the roots. A2-6. a) If two rows or columns differ by factor c, then multiplication of the smaller row or column gives |M| = c|M| (by rule 1), and M has two identical rows or columns. Interchanging these gives M, and |M|=-|M|. But M =M because the interchanged rows or columns are identical. Therefore, |M| = -|M|, and |M|=0. Therefore, |M|=c|M|=0. Appendix 4 A4-1. From Eq. (A4-47), LzL+ =L+(Lz +1). Then LzL+Yl,m =L+(Lz +1)Yl,m = L+(km +1)Yl,m =(km +1)L+Yl,m. A4-2. L+L- =(Lx +iLy)(Lx -iLy)=L2x +L2y +iLyLx -iLxLy. Equation (A3- 31) tells us that iLyLx -iLxLy =-i2Lz =Lz, so L+L- =L2x +L2y +Lz = L2x +L2y +L2z -L2z +Lz =L2 -L2z +Lz . Therefore, L2 =L+L- +L2z -Lz . A4-3. Equation (A4-55) indicates that the result of L+ on Yl,m remains an eigenfunction of L2 with the same eigenvalue, i.e., that L2L+Yl,m =L2C+Yl,m+1 = klC+Yl,m+1. But also, L+L2Yl,m = L+klYl,m = C+klYl,m+1, so Eq. (A4-55) indicates that L+ and L2 commute. This is easily verified since L+ is a linear combination of Lx and Ly, both of which commute with L2. We can establish Eq. (A4-55) by evaluating the result of the reverse order of operations and using the fact that L+ and L2 commute: L+L2Yl,m =L+klYl,m =klL+Yl,m = klC+Yl,m+1. But L+L2Yl,m=L2L+Yl,m, so L2L+Yl,m=klC+Yl,m+1. Q.E.D. 690 Answers to Problems A4-4. Lx =(1/2)(L+ +L-), (1/2)Yl,m|L+ +L-|Yl,m=(1/2)[C+Yl,m|Yl,m+1+ C-Yl,m|Yl,m-1=0+0. A4-5. a) (10) 0 1  =0. b) S+ß =(Sx +iSy)ß = 0+0 1 2 - i2 2 1 2 + i2 2 0+0 0 1  = 1 0  =a c) S+a = 0 1 0 0 1 0  = 0 0  d) Sx,Sy= 1 4 0 1 1 0 0 -i i 0  - 0 -i i 0 0 1 1 0 + = 1 4 2i 0 0 -2i  =iSz . A4-6. S2 =S2 x +S2 y +S2 z = 1 4 0 1 1 0 0 1 1 0  + 0 -i i 0 0 -i i 0  + 1 0 0 -1 1 0 0 -1 + = 3 4 1 0 0 1  ;S2a = 3 4 1 0 0 1 1 0  = 3 4 1 0  = 3 4 a. Appendix 8 A8-1. Tˆ = -1 2d2/dx2, Vˆ = 1 2 kx2. Equation (A8-19) gives E¯. = .2T + .-2V . .E¯/..=0=2.T -2.-3V . For an exact solution, .=1, and T =V . A8-2. V = -3 a.u., T = 1 2 a.u., . = -V /2T = 3, .. = .3/p exp(-.r) = v27/p exp(-3r), E. =.2T +.V = 9 2 -9=-4.5 a.u. References [1] B. M. Gimarc, Accounts Chem. Res. 7, 384 (1974). [2] R. S. Mulliken, Rev. Mod. Phys. 4, 1 (1932). [3] A. D.Walsh, J. Chem. Soc. 2260 (1953). [4] R. B. Woodward and R. Hoffmann, The Conservation of Orbital Symmetry. Academic Press, NewYork, 1970. Index A A–A bonding, 502 ab initio calculations basis sets for, 353–357 description of, 348 examples of, 370–384 for molecules, 374–382 Abelian groups, 430 Absolute squares, 21, 461 Acceptable functions, 22 Acetylene, 244 Acyclic polyenes, 281–282 Additive constant in H, 401 All-trans polyacetylene, 551–552 Allyl, 606 Allyl radical, 607 2-Allylmethyl, 606, 608 AM1, 386t Ammonia, 47, 432f Amplitude function, 4 Amplitude of wave, 2 Angle-dependent functions, 97 Angular momentum as pseudovector, 111 description of, 51t electron, in atoms, 149–159 electron spin, 599 expressions for, 591–592 gyroscope with, 112f magnetic moment and, 115–117 magnitude of, 595 in molecular rotation, 117–118 nuclear spin, 599 quantum-mechanical operators, 592–593 spherical harmonics and, 110–115 spin-orbital for equivalent electrons, 156–159 for many-electron atoms, 152–159 for nonequivalent electrons, 153–154 for one-electron ions, 150–152 Zeeman effect, 154–156 total, 149–150 total orbital, 152 vectors, 143 Angular momentum operators, 52, 593–595, 599 Angular momentum–angular position, 18 Angular velocity, 51t Antibonding molecular orbitals, 217, 226, 258 Antinodes, 4 Approximate density function, 369 Approximations s-p separability, 245 Born–Oppenheimer, 207–208 generalized gradient, 370 independent electron, 127–129 local density, 370 orbital, 233–235 Aromatic properties, 281 Aromaticity, 281 Associative law, 430 Asymptotic behavior, 74–75, 78–79 Atom(s) helium description of, 134 nonlinear variation for, 194–197 1s2s configuration of, 138–144 hydrogen nonlinear variation for, 191–194 polarizability of, 197–206, 410–412 s-type states of, 418 virial theorem of, 624–627 Atom self-polarizability, 291 Atom–atom polarizability, 407 Atomic p-electron densities, 257 Atomic ionization energies, 372 Atomic orbitals in acetylene, 244 basis, for Hückel determinant, 247–248 decay of, 385 definition of, 208 description of, 128, 143, 145 differential overlap between, 385 frozen, 486 gross population, 337 Hartree–Fock equation and, 623 linear combination of, 206–220 in molecular orbitals, 465–467 net population, 335 691 692 Index Atomic orbitals (Continued) p-type, 219 1s, 498, 587–590 united-atom, 227f–228f Atomic spectral line splitting, 133 Atomic units, 109t, 109–110, 632 “Atom-in-molecule” energy, 488 Aufbau process, 149 Austin Model 1, 386t Average dipole moment, 171 Average values, postulate for, 171 Avogadro’s number, 631 Azimuthal angle, 91 Azulene, 290–291, 294–295, 606, 613 B Band diagrams, 528 Band width, 541 Band-crossing analyses, 559 Basis atomic orbitals, 247–248 Basis functions, representations generated from, 446–451 Basis sets for ab initio calculations, 353–357 description of, 231–233 descriptors for, 356–357 Gaussian, 353 Slater-type-orbital, 353–354 Benz-cyclopentadienyl, 606 Benz-cyclopentadienyl radical, 612 Benzene, 606, 610 description of, 526, 530–533 energies for, 531f Hückel molecular orbital method, 260, 267–268, 530 molecular orbitals of, 530–531, 535, 556 poly-paraphenylene, 555–561 Benzyl, 606 Benzyl radical, 611 Binding energy, 234 Bloch functions, 534 Bloch sums description of, 536–537, 540f, 542–543, 545 without variational modification, 549 Bloch’s theorem, 533–537 Bohr magneton, 115, 155, 631 Bohr radius, 93, 631 Boltzmann’s constant, 546 Bond integrals, 249 Bond length, 269–270 Bond order, 269–270, 605 p-bond order, 259 Bond-angle change, 484 Bond–atom polarizability, 408 Bond–bond polarizability, 408 Bonding molecular orbital, 217 Born–Oppenheimer approximation, 207–208, 349 Boson, 136 Boundary conditions, 7 Bra-ket notation, 629–630 Brillouin’s theorem, 364–365 Butadiene, 264–265, 279, 507, 608 Butadiene-2,3-bimethyl, 606, 610 C Cartesian coordinates, 90, 324, 449 C–C length, 555 C–C separation, 544, 553 CCSD, 379 CCSD(T), 379–380 Character(s) absolute squares of, 461 conditions for, 460–461 definition of, 458 description of, 458–462 reducible representations resolved using, 462–463 Character tables, 459, 637–648 Charge density index, 291 Circular motion, 50 cis-1,3-butadiene, 511f Class, 434–436 Classical wave equation, 4–7 Closed shells, 132, 226, 349–350 Closed subshell, 349 C–N length, 555 CNDO/1, 386t CNDO/2, 386t CNDO/BW, 386t Coefficient waves, 532f Cofactor, 584 Column vector, 309 Commutators, 178–179 Commuting operators, 175–176 Complete neglect of differential overlap, 386t Complex conjugate of a matrix, 309, 311–312 Compression waves, 5 Concerted process, 509 Configuration correlation diagram, 519f Configuration interaction calculation, 365 description of, 360–365 electron motion and, 378 size consistency of, 366 truncated, 366 Conjugate variables, 18 Conjugative model, 287 Conrotatory closure, 508, 513 Constant of motion angular momentum as, 111 description of, 30, 43 Constant potential, particle in a ring of, 50–52 Constants, 631–633 Continuous function, 532 Contracted Gaussian function, 356 Coordinate transformation, 312 Index 693 Correlation diagrams configuration, 519f description of, 46 orbital, 519 state, 519f symmetry in, 514 symmetry-forbidden, 517 Correlation energy, 357–358 Correspondence principle, 31 Coulomb integral, 249, 413, 587–590 Coulomb operator, 350–351, 620 Coulomb terms, 369 Coupled cluster theory, 366–367 Covalent character, 363 Covalent terms, 363 Crystal orbital bond order, 541 Crystal orbital overlap population, 541 Crystal orbitals, 539, 559f Cyclic groups, 453–456 Cyclic polyenes, 281–282 Cycloaddition reaction, 517 Cyclobutadiene, 265–267, 283–284, 608 Cyclobutadienyl methyl, 606 Cyclobutadienyl methyl radical, 608 Cyclobutene, 511f, 513 Cycloheptadienyl, 606 Cycloheptadienyl radical, 611 Cyclooctatetraene, 606, 612 Cyclopentadienyl, 606 Cyclopentadienyl radical, 6089 Cyclopropenyl radical, 607 Cyclopropenyl system, 253–256, 606 D D3d–D3h, 504f De Broglie, 14–15 Degenerate state, perturbation theory for, 409–410 Degenerate-level perturbation theory, 412–414 Delocalization energy, 279 Delocalized effect, 281 Delocalized molecular orbitals, 472 Density functional theory, 368–370 Density of states, 528 Determinant(s) cofactors, 584 definition of, 584 4×4, 584 Hückel molecular orbital method a quantity, 248–249 ß quantity, 249 basis atomic orbitals, 247–248 constructing of, 247–248 generalizations, 259–263 manipulation of, 249–250 overlap integrals, 249 topological, 250 in linear homogeneous equations, 585 secular, 200–201, 211 Slater, 137–139, 349, 369, 622 topological, 250 2×2, 584 Determinantal equation, 248 Diagonal matrix, 311 Diatomic molecules description of, 84–85 homonuclear closed shells of, 226 electronic states of, 229 molecular orbitals of, 220–231 properties of, 225t qualitative molecular orbital theory rules applied to, 490–494 symmetry orbitals of, 227f virial theorem for, 627–628 Diels–Alder reaction, 517, 518f Differential equation for q(x), 75–76 Diffraction experiment with electrons, 16–19 Dihedral planes, 438–439 Dimethylacetylene, 506, 507f Dipole moment, 171 Dipole transition, 419 Dirac delta function, 170, 178 Dirac notation, 629–630 Direct lattice, 564 Direction of rotation, 451 Disrotatory closure, 508 Dissociation energy, 234 Distance matrix, 343 Distribution function, 70 E Eigenfunctions Bloch’s theorem, 533–534 commuting operators have simultaneous eigenfunctions, 175–176 description of, 20, 32 gerade, 215 for hydrogenlike ion in atomic units, 110t lowest-energy, 190 nondegenerate, 131, 218 1s, 192 simultaneous, 175–176 symmetry of, 244 ungerade, 215 unperturbed, 395 Eigenvalues description of, 20 extended Hückel method, 328–331 formaldehyde, 343 of Hermitian operators completeness of, 176–177 degenerate, 174 expressed as an orthonormal set, 174–175 694 Index Eigenvalues (Continued) nondegenerate, 173–174 orthogonal set formed from, 173–174 proof of, 172–173 Kohn–Sham, 370 for L2, 595–600 linear combination of atomic orbitals–molecular orbitals–self-consistent field, 351–352 for Lz , 595–600 matrix of, 316 negative, 93 postulate relating measured values to, 169–170 potential function of, 92 Schrödinger equation, 92–93 Eigenvectors description of, 316 extended Hückel method, 328–331 Electrical conductivity, 546–547 Electrocyclic reaction, 508 Electromagnetic radiation, 9 Electromagnetic wave description of, 9–10, 12–13 square of, 21 Electron(s) charge of, 631 diffraction experiment with, 16–19 equivalent, 156–159 independent approximation of, 127–129 nonequivalent, 153–154 orbital motions of, 143 potential energy of, 397 resonance energy per, 281, 292f rest mass of, 631 spin states for, 136 uniform electrostatic perturbation of, 396–403 p-electron assumption, 246–247 Electron densities, 602 p-electron densities, 271–275, 290 p-electron energy, 279–284 Electron exchange symmetry, 129–132, 159–160 Electron flux, 546 Electron index order, 618 Electron motion, 378 Electron orbital angular momentum, 599 Electron probability density function, 93 p-electron repulsion energy, 277 Electron spin, 132–136 Electron spin angular momentum, 599 Electron spin resonance hyperfine splitting constants, 271–275 Electrophilic aromatic substitution, 289 Elliptical coordinates, 217 Energies for benzene, 531f description of, 30–32 extended Hückel experimental energies and, 340–342 Mulliken populations and, 338–340 ground state description of, 368, 371t to first-order of heliumlike systems, 403–405 Energy level splitting, 47 Energy to first order, 392, 397–399 Energy to zeroth order, 392 Equation of motion, 69–70 Equivalent electrons, 156–159 Equivalent orbitals, 472 Equivalent representations, 445, 458 Ethane, 440 Ethylene, 263–264, 606–607 Ethylene molecular orbitals, 268 Exchange integrals, 142, 619 Exchange operator, 351 Exclusion principle, 136, 146 Experimental energies, 340–342 Explicit integration, 397–398 Exponentials, 39 Extended Hückel energies experimental energies and, 340–342 Mulliken populations and, 338–340 Extended Hückel method band calculations, 560 basis set, 324–325 description of, 324, 541 eigenvalues, 328–331 eigenvectors, 328–331 hamiltonian matrix, 326–328 K parameter, 333–335 Mulliken populations, 335–340 nuclear coordinates, 324 overlap matrix, 325–326, 344 for polyacetylene, 552–554 total energy, 331–332 External potential, 368 F Fermi contact interaction, 193 Fermi energy, 540, 550, 577 Fermion, 136 Filled molecular orbitals, 472 Finite central barrier, particle in an infinite “box” with, 44–47 First Brillouin zone description of, 536, 564 reduced, 540, 564, 566, 577 First-order corrections to .1, 399–401 description of, 392–394 Index 695 First-order Stark effect, 411 First-order structure, 485 Fock operator, 350–351 Force constant, 69 Formaldehyde eigenvalues, 343 Formaldehyde orbital numbering, 343 Free particle in one dimension description of, 47–50, 526–529 particle in a ring problem and, similarities between, 52 Free radical reactions, 292 Free valence, 292 Frequency factor, 4 Frontier orbitals, 291, 505–508 Frozen atomic orbitals, 486 Fulvene, 606, 610 Functions acceptable, 22 angle-dependent, 97 Bloch, 534 continuous, 532 Dirac delta, 170 distribution, 70 electron probability density, 93 Gaussian, 355–356 Kronecker delta, 38 Legendre, 108, 589 linear combination of, 7 linearly independent, 77 orthogonal, 385 orthonormal, 38 polarization, 355 single-valued, 21 square-integrable, 22 G Gauss error function, 80 Gaussian basis set, 353 Gaussian functions, 183, 355–356 Gaussian wave packets, 184 Generalized gradient approximation, 370 Gerade, 213, 215, 548 Gimarc’s diagram, 502 Givens–Householder–Wilkinson method, 320 Graphite, 565–576 Gross atomic orbital population, 337 Ground state energy description of, 368, 371t to first-order of heliumlike systems, 403–405 Group abelian, 430 definition of, 430 point, 441–443 representations for from basis functions, 446–451 cyclic groups, 453–456 description of, 443–446 irreducible inequivalent, 456–458 labels, 636–637 labels for, 451–452 molecular orbitals and, 452–453 one-dimensional, 444, 454, 473 two-dimensional, 444, 454 symbols for, 635–636 symmetry point, 431–434 Group theory elementary example of, 429–430 overview of, 429 H H2, 488–490 H, 401 H2+ molecule–ion antibonding states of, 487 bonding states of, 487 description of, 206–220, 485–488 H2 vs., 488–490 H3AAH3 system, 501 HAB, 499, 500f HAH, 497, 499 Hamiltonian, 20, 349, 404 Hamiltonian matrix description of, 319, 326–328 integrals of, 473 partitioned, 469f Hamiltonian operators, 20, 53, 131–132, 180, 593–595 Harmonic electric-field wave, 9f Harmonic motion, 70 Harmonic oscillation, 2 Harmonic oscillator one-dimensional, 69–72 quantum-mechanical, 72–74 Schrödinger equation for asymptotic behavior, 74–75, 78–79 description of, 72–73 differential equation for q(x), 75–76 energy spectrum, 79–80 f as a power series, 76–77 recursion relation for, 77–78 simplifying of, 74 wavefunctions, 80–81 wavefunctions description of, 73–74, 80–81 normalization of, 81–82 orthogonality of, 81–82 zero-point energy of, 74 Harmonic wave, 2, 3f Hartree–Fock energy correlation energy, 357–358 density functional theory and, 381–382 estimating of, 371 restricted, 357, 360 unrestricted, 357 696 Index Hartree–Fock energy curve, 381 Hartree–Fock equation atomic orbitals used with, 623 definition of, 623 derivation of, 614–623 description of, 350 Hartree–Fock limit, 357, 627 Hartree–Fock wavefunctions, 375t, 627 Heat of hydrogenation, 279 Heisenberg uncertainty principle, 18–19, 31 Helium atom description of, 134 nonlinear variation for, 194–197 1s2s configuration of, 138–144 Heptatrienyl, 606 Heptatrienyl radical, 611 Hermite polynomials, 82 Hermitian adjoint of a matrix, 310 Hermitian matrix, 317 Hermitian operators description of, 171–172 eigenvalues of completeness of, 176–177 degenerate, 174 expressed as an orthonormal set, 174–175 nondegenerate, 173–174 orthogonal set formed from, 173–174 proof of, 172–173 Hermiticity, 174, 213 Heteroatomic molecules, 284–287, 285t Hexatriene, 510–511, 606, 610 Highest occupied molecular orbitals, 275, 291, 493, 505, 507f, 519, 540 Homogeneous magnetic field, 133 Homonuclear diatomic molecules closed shells of, 226 electronic states of, 229 molecular orbitals of, 220–231 properties of, 225t qualitative molecular orbital theory rules applied to, 490–494 symmetry orbitals of, 227f Hooke’s law, 69 Hückel molecular orbital method assumptions, 601 benzene, 260, 267–268 bond length, 269–270 bond order, 269–270 butadiene, 264–265, 279 charge distributions from, 256–259 cyclobutadiene, 265–267 determinant a quantity, 248–249 ß quantity, 249 basis atomic orbitals, 247–248 constructing of, 247–248 generalizations, 259–263 manipulation of, 249–250 overlap integrals, 249 topological, 250 determinantal equation for cyclopropenyl system, 253–256 description of, 250 solving for molecular orbitals, 251–253 solving for orbital energies, 250–251 ethylene, 263–264 heteroatomic molecules, 284–287, 285t for hydrocarbons, 268–269 independent p-electron assumption, 246–247 overview of, 244 perturbation at an atom in, 406–409 reaction indices, 289–295 self-consistent variations of a and ß, 287–289 s-p separability, 244–246 summary of, 295–296 Hund’s rule, 148, 159–160 Hybrid orbitals, 470–471 Hydrocarbons Hückel molecular orbital method for, 268–269 Mobius conjugated, 283 neutral alternant, 602 Hydrogen atom nonlinear variation for, 191–194 polarizability of, 197–206, 410–412 s-type states of, 418 Hydrogenlike orbitals, 146, 208 Hyperfine splitting constant, 272 I Identity operation, 459 Improper axis, 436 Improper rotation, 436 Independent electron approximation, 127–129 Independent p-electron assumption, 246–247 INDO, 386t Induced dipole, 411–412 Inductive model, 286 Inhomogeneous magnetic field, 133 Inner repulsion energy, 588 Instantaneous dipole moment, 171 Integrals bond, 249 Coulomb, 249, 413, 587–590 exchange, 142 list of, 582–583 overlap, 249 resonance, 249 Integrand, 472 Intended correlations, 545 Index 697 Interaction element, 204 Intermediate neglect of differential overlap, 386t Internal rotation, 485 Internuclear repulsion energy, 209 Inverse matrix, 311–312 Inverse operation, 430 Ionization energy description of, 278–279 of neon, 372t valence state, 327 Irreducible inequivalent representations, 445, 456–458 J Jacobi method, 319 Jahn–Teller theorem, 258 K K parameter, 333–335 Kinetic energy description of, 43 equation for, 11 of photoelectrons, 11, 12f, 278 Kinetic energy operators, 140 Kohn–Sham orbitals, 369–370 Koopmans’ theorem, 358–360, 372, 472 Kronecker delta, 82, 170 Kronecker delta function, 38 L L2, 595–600 Lagrangian multipliers, 620 LCAO–MO–SCF equation. See Linear combination of atomic orbitals–molecular orbitals–self-consistent field equation Legendre functions, 108, 589 Legendre polynomials, 107 Light as electromagnetic wave, 9–10 electromagnetic field theory of, 12 Light intensity, 17f Linear combination, 7 Linear combination of atomic orbitals–molecular orbital calculation, 208 Linear combination of atomic orbitals–molecular orbitals–self-consistent field equation density functional theory methods, 368–370 eigenvalues, 351–352 ground state wavefunction, 361 multideterminant methods, 367 overview of, 350–351 Linear harmonics, 110 Linear homogeneous equations, 585 Linear momentum, 18 Linear polyenes, 262 Linear position, 18 Linear variation method matrix formulation, 315–317 hamiltonian, 319 overview of, 308 Linearly dependent functions, 7, 76 Linearly independent functions, 77 Lithium, 135 Local density approximation, 370 Localization energy, 294 Localized orbitals, 472 Lowest unfilled molecular orbitals, 275, 291, 505, 509 Lowest-energy eigenfunction, 190 Lowest-energy molecular orbitals, 222, 252, 262, 327, 520 Lowest-energy wavefunction, 93–97 L–S coupling, 152 Lz , 595–600 M Magnetic moment angular momentum and, 115–117 definition of, 115 Magnitude, 592 Mass density, 95 Matrix addition of, 310–311 complex conjugate of, 309, 311–312 definition of, 308 diagonal, 311 of eigenvalues, 316 of eigenvectors, 316 expression of, 308–309 geometric model, 312–314 hamiltonian, 319, 326–328 hermitian, 317 hermitian adjoint of, 310 inverse, 311–312 multiplication of, 310–311 nonsingular, 314 orthogonal, 314 product of, 311–312 rotation, 313 similarity transformation, 314 singular, 314 square, 311 symmetric, 310 transformation, 314 transpose of, 309–312 unit, 311 Matrix equation, 317–320 Matter, wave nature of, 14–16 Maxwell’s differential equations, 10 McConnell relation, 272 698 Index Metastable states, 501 Methane, 339 2-Methoxyethanol, 277 Methylene cyclopropene, 606, 608 Microscopic systems, 18 MINDO/3, 386t Mirror A universe, 434–435 Møller–Plesset perturbation theory, 366 MNDO, 386t Mobile bond order, 259 Mobius conjugated hydrocarbons, 283 Modified intermediate neglect of differential overlap, 386t Modified neglect of diatomic overlap, 386t Molecular orbitals p, 416, 418 antibonding, 217, 226, 258 atomic orbitals in, 465–467 benzene, 530–531, 535, 556 bonding, 217 for cyclopropenyl system, 255 definition of, 208 delocalized, 472 ethylene, 268 filled, 472 frontier, 505–508 hierarchy in, 484–485 higher-energy, 490, 498 highest occupied, 275, 291, 493, 505, 507f, 519, 540 of homonuclear diatomic molecules, 220–231 Hückel molecular orbital determinantal equation solved for, 251–253 for linear polyenes, 262 lowest unfilled, 275, 291, 505, 509 lowest-energy, 222, 252, 262, 327, 520 nonbonding, 252, 602–604 nondegenerate, 329, 467 occupied, 362 representation table and, 452–453 self-consistent field–molecular orbitals, 384–386 subjacent, 506 superjacent, 506 symmetry of, 296, 463–465 virtual, 362 Molecular rotation, angular momentum in, 117–118 Molecules ab initio calculations for, 374–382 homonuclear diatomic. See Homonuclear diatomic molecules point group of, 441–443 Mulliken populations description of, 335 extended Hückel energies and, 338–340 net atomic orbital, 335 overlap, 335–336, 345, 541 Multiconfigurational self-consistent field calculation, 367 N Naphthalene, 606, 613 NDDO, 386t Nearest-neighbor interaction, 484–485, 496 Negative eigenvalues, 93 Neglect of diatomic differential overlap, 386t Neutral alternant hydrocarbons, 602 Newton’s laws of motion, 4–5 Nodes, 3 Nonbonding molecular orbitals, 252, 602–604 Nonconcerted process, 509 Noncrossing rule, 230 Nondegenerate eigenfunctions, 131, 218 Nondegenerate state, 6, 31 Nondegenerate wavefunctions, 448 Nonequivalent electrons, 153–154 Nonsingular matrix, 314 Norbornadiene, 416–417 Nuclear motion, potential energy for, 207 Nuclear spin angular momentum, 599 Nucleophilic aromatic substitution, 289 O Occupied molecular orbitals, 362 Occupied orbitals, 353 Octatetraene, 606, 612 Off-diagonal determinantal element, 204 One-dimensional harmonic oscillator, 69–72 One-dimensional representations, 444, 454, 473 One-electron density function, 256 One-electron energy, 128, 352 Open-shell systems, 373 Operators angular momentum, 52, 593–595, 599 commuting, 175–176 constructing of, 167–168 coulomb, 350–351, 620 definition of, 20 exchange, 351 Fock, 350–351 hamiltonian, 20, 53, 180, 593–595 permutation, 614 postulate for constructing, 167–168 quantum-mechanical, 592–593 Orbital(s) atomic. See Atomic orbitals crystal, 539, 559f description of, 101–102 equivalent, 472 frontier, 291, 505–508 hybrid, 470–471 hydrogenlike, 146, 208 Index 699 interaction between, 414–417 Kohn–Sham, 369–370 localized, 472 molecular. See Molecular orbitals occupied, 353 separated-atom, 228f Slater-type basis sets, 353–354 contracted Gaussian functions for, 356 description of, 146–147, 324 minimal valence, 471 p-type, 354 symmetry, 220–222, 227f Orbital approximation, 233–235 Orbital energies definition of, 352 description of, 128 internuclear separation for He2, 489f ionization energies and, 278–279 oxidation-reduction potentials and, 275–277 solving of Hückel molecular orbital determinantal equation for, 250–251 Orbital orthogonality, 617 Orbital symmetries, 515 Oriented dipole, 133 Orthogonal functions, 385 Orthogonal matrix, 314 Orthogonal transformation, 314 Orthogonality in irreducible inequivalent representations, 456–458 of vectors, 457 of wavefunctions, 37–38, 81–82 Orthonormal functions, 38 Orthonormal set, 174–175 Oscillating electric charge, 10f Oscillating particle distributions, 527 Oscillation, harmonic, 2 Outer repulsion energy, 588 Overlap integrals, 249 Overlap matrix, 325–326, 344 Overlap population, 335–336, 345, 541 Oxidation-reduction potentials, 275–277 Pp orbitals, 149 Pairing of roots, 601–602 Pairing theorem, 262 Particle(s) in an infinite “box” with a finite central barrier, 44–47 in a one-dimensional box definition of, 27 energies, 30–32 with one finite wall, 38–43 overview of, 27–30 wavefunctions, 32–38 in a ring of constant potential, 50–52 description of, 529–530 scattering of, in one dimension, 56–59 in a square well, 27 in a three-dimensional box, 53–56 Partitioned hamiltonian matrix, 469f Pauli principle, 137–138, 148, 156, 178, 268 Peierls distortion, 550, 570, 577 Pentadienyl, 606 Pentadienyl radical, 609 Periodic potential, 526 Periodic structures, 528 Periodicity in three dimensions, 565–576 two-dimensional, 562–565 Permanent dipole, 411 Permutation operator, 614 Perturbation definition of, 391–392 uniform electrostatic, 396–403 Perturbation theory degenerate-level, 412–414 Møller–Plesset, 366 overview of, 391–392 Rayleigh–Schrödinger .2 effects on, 402–403 additive constant in H, 401 at an atom in the simple Hückel molecular orbital method, 406–409 for degenerate state, 409–410 first-order corrections, 392–394, 399–401 formal development of, 391–396 ground-state energy to first-order of heliumlike systems, 403–405 overview of, 391 W1(2), 401–402 spectroscopic selection rules and, 417–420 Phase factor, 3  equation, 106 Photoelectric effect, 10–14 Photoelectrons, 11, 12f, 278 Photons characteristics of, 14 definition of, 12 Einstein’s relation for, 14 rest mass of, 14 p bands, 554–555 Piecewise continuous, 28 Planck, 10 Planck’s constant, 18, 631 Point group of a molecule, 441–443 Point of inversion, 436 Polarizability of the hydrogen atom, 197–206, 410–412 700 Index Polarization functions, 355 Polyacetylene all-trans, 551–552 with alternating bond lengths, 547–551 energy calculations, 562 extended Hückel molecular orbital method results for, 552–554 self-consistent field results for, 552–554 with uniform bond lengths, 537–546 Polyatomic molecules, 495–505 Poly-paraphenylene, 555–561 Population density, 95 Postulates for average values, 171 for constructing operators, 167–168 measured values related to eigenvalues, 169–170 Schrödinger equation, 168–169 wavefunction, 166–167 Potential energy determination of, 383 of electrons, 397 for nuclear motion, 207 quantum-mechanical average value of, 83–84 Probability density description of, 13 volume-weighted, 95f Probability waves, 13 . as stationary state, 180 conditions on, 21–22 triple value of, 22f .2, 402–403 p-type atomic orbital, 219 p-type Slater-type orbitals, 354 2px orbital, 101–102 2py orbital, 101–102 Pyrrole molecule, 286 2pz orbital, 101–102 Q Qualitative molecular orbital theory H2+, 485–488 molecular orbitals, 484–485 molecular structure, 484–485 need for, 484 of reactions, 508–521 rules for description of, 490 homonuclear diatomic molecule applications, 490–494 Quantum numbers, 30, 97–98 Quantum-mechanical average, 96–97 Quantum-mechanical average value of potential energy, 83–84 Quantum-mechanical harmonic oscillator, 72–74 Quantum-mechanical operators, 592–593 Quantum-mechanical tunneling, 41 q(x), 75–76 R R equation, 108–109 Radial node, 98 Radical addition, 289 Rayleigh–Ritz variation principle description of, 178 proof of, 190 Rayleigh–Schrödinger perturbation theory .2 effects on, 402–403 additive constant in H, 401 at an atom in the simple Hückel molecular orbital method, 406–409 for degenerate state, 409–410 first-order corrections, 392–394, 399–401 formal development of, 391–396 ground-state energy to first-order of heliumlike systems, 403–405 overview of, 391 W1(2), 401–402 Reaction indices, 289–295 Reactions electrocyclic, 508 qualitative molecular orbital theory of, 508–521 stereospecific, 509 Reciprocal lattice, 564 Reciprocal operation, 430 Reciprocal space, 562–565 Recursion relation, 77–78 Reduced first Brillouin zone, 540, 564, 566, 577 Reducible representations, 445, 462–463 Redundancies, 432 Reflection plane, 436 Reflection symmetry, 205 Representations from basis functions, 446–451 cyclic groups, 453–456 description of, 443–446 irreducible inequivalent, 456–458 labels, 636–637 labels for, 451–452 molecular orbitals and, 452–453 one-dimensional, 444, 454, 473 two-dimensional, 444, 454 Repulsive energy, 588 Resonance energy, 281 Resonance energy per electron, 281, 282f Resonance integrals, 249 Rest mass description of, 14 of electron, 631 of neutron, 631 of proton, 631 Index 701 Restricted Hartree–Fock energy, 357, 360 Rigid-rotor model, 117–118 Ring of constant potential, particle in, 50–52 Rotation axis, 436 Rotation matrix, 313 Row vector, 309 Russell–Saunders coupling, 152 Rydberg series, 232 Rydberg state, 232 S 1s atomic orbitals, 498, 587–590 2s orbital, 101 Scalar, 309–310, 312 Scanning tunneling microscopy, 575 Schmidt orthogonalization, 175, 468 Schrödinger equation in atomic units, 110 center-of-mass coordinates, 90 for circular motion, 50 description of, 19–20, 22–23, 89–91 eigenvalues, 92–93 in free particle in one dimension, 47–48 harmonic oscillator asymptotic behavior, 74–75, 78–79 description of, 72–73 differential equation for q(x), 75–76 energy spectrum, 79–80 f as a power series, 76–77 recursion relation for, 77–78 simplifying of, 74 wavefunctions, 80–81 higher-energy solutions for, 98–105 lowest-energy wavefunction, 93–97 for one-dimensional harmonic oscillator, 72–73 for particle in the one-dimensional square wall, 54 for periodic structures, 528 postulate for, 168–169 quantum numbers, 97–98 separation of variables, 105–106 time-dependent, 168–169, 180 Second-order Stark effect, 411 Second-order structure, 485 Secular determinant, 200–201, 211 Secular equation, 200 Self-consistent field acronyms, 386 atomic orbitals, 147 configuration interaction, 360–365 definition of, 348 description of, 146–147 equations, 622 hamiltonian, 349 Hartree–Fock limit, 357 Koopmans’ theorem, 358–360 multiconfigurational, 367 polyacetylene results, 552–554 total electronic energy, 352–353 wavefunction for, 349–350 Self-consistent field–molecular orbitals, 384–386 Separated-atom basis, 208 Separated-atom energy, 487–488 Separated-atom limits, 208 Separated-atom orbitals, 228f s bond network, 295 s-p separability, 244–246 Sigmatropic shift, 519 Similarity transformations, 314, 459 Singlet, 142 Single-valued function, 21 Singular matrix, 314 Slater determinants, 137–139, 349, 369, 622 Slater-type orbitals basis sets, 353–354 contracted Gaussian functions for, 356 description of, 146–147, 324 minimal valence, 471 p-type, 354 Spectroscopic selection rules, 417–420 Spectroscopists, 14 Spherical harmonics angular momentum and, 110–115 definition of, 111 Spherical polar coordinates, 90, 217, 454 Spherically symmetric potential, 91, 116 Spin forbidden, 418 Spin states, 136 Spin-free density function, 166 “Split shell” wavefunction, 195 Splitting, 154 Splitting constants, 271–275 Square absolute, 21, 461 of electromagnetic wave, 21 rotation effects on, 431f Square matrix, 311 Square symmetry, 266 Square-integrable function, 22 1s2s configuration of helium, 138–144 Standing waves in clamped string, 7–9 description of, 3–4 Stark effect, 411 State correlation diagram, 519f State energy, 501 Stater-type orbitals, 208 Stereospecific reactions, 509 Stilbene, 283 Strain energy, 284 Subjacent molecular orbitals, 506 Sum of squares of dimensions, 457–458 Superjacent molecular orbitals, 506 702 Index Symmetric matrix, 310 Symmetry in correlation diagrams, 514 integration and, 472–476 reflection, 205 Symmetry elements, 436–441 Symmetry operations, 431, 439 Symmetry orbitals definition of, 468 description of, 220–222, 515 generating of, 467–470 for homonuclear diatomic molecules, 227f unnormalized, 468 Symmetry point groups, 431–434 Symmetry-forbidden reaction, 513, 517 T Term symbols, 152  equation, 107–108 Third-order structure, 485 Three-dimensional Bernal graphite, 571f, 572 Three-dimensional box, particle in, 53–56 Time-dependent differential equation, 6 Time-dependent Schrödinger equation, 168–169 Time-dependent states, 180–185 Time-independent wave equation description of, 6, 8f Schrödinger. See Schrödinger equation Topological determinant, 250 Torsional angle change, 485 Total electronic energy, self-consistent field, 352–353 Total extended Hückel energy, 331–332 Transformation matrix, 314 Transposing of a matrix, 309–312 Transverse waves, 5 Traveling waves, 1–3 Trimethylenemethane, 292 Triplet, 142 Tunneling, quantum-mechanical, 41 Two-dimensional graphite, 569, 570f Two-dimensional periodicity, 562–565 Two-dimensional representations, 444, 454 U Uncertainty principle of Heisenberg, 18–19, 31, 114, 179 Ungerade, 213, 215, 548 Uniform electrostatic perturbation, 396–403 Unit matrix, 311 Unitary transformation, 314, 445–446 United-atom atomic orbitals, 227f-228f United-atom limits, 208 Unperturbed eigenfunctions, 395 Unperturbed energy, 392 Unrestricted Hartree–Fock energy, 357 V Vacuum permittivity, 631 Valence state ionization energy, 327 Variables conjugate, 18 separation of, 105–106 Variation method linear, 197–206 nonlinear calculations, 196–197 for helium atom, 194–197 for hydrogen atom, 191–194 orbital approximation, 233–235 spirit of, 190 Variation principle, 178 Variational wavefunction, 231–233 Vectors addition of, 310–311 column, 309 matrix multiplication of, 311 multiplication of, 310–311 orthogonality of, 457 in reciprocal space, 562–565 representation, 457 two-dimensional, 312 Vertical planes, 438–439 Vibrations of diatomic molecules, 84–85 Virial relation, 373 Virial theorem for atoms, 624–627 for diatomic molecules, 627–628 Virtual molecular orbitals, 362 Volume-weighted probability density, 95f W W1(2), 401–402 Walden inversion, 520 Walsh diagrams, 495–505, 497f, 503f-504f Wave(s) amplitude of, 2 classical equation for, 4–7 compression, 5 electromagnetic, 12–13 frequency of, 3 harmonic, 2, 3f light as, 10 probability, 13 standing, 3–4, 7–9 transverse, 5 traveling, 1–3 wavelength of, 2 Wave profile, 1 Wavefunctions antisymmetric, 136, 213 description of, 32–35 determinantal, 349–350 Dirac delta function as, 178 gerade, 213, 215 hamiltonian, 144 Index 703 of harmonic oscillator, 73–74 Hartree–Fock, 358, 375t, 627 lowest-energy, 93–97 nondegenerate, 448 normalization of, 81–82 orthogonality of, 37–38, 81–82 postulate, 166–167 Schrödinger equation for harmonic oscillator, 80–81 self-consistent field, 349–350 Slater determinantal, 137, 139 “split shell,” 195 symmetry of, 35–37, 47, 136, 213 ungerade, 213, 215 variational, 231–233 zeroth-order, 393 Wavelength de Broglie, 15 description of, 2 Wavenumber, 14, 528 Wigner–Seitz unit cell, 564 Wolfsberg–Helmholtz relation, 327 Woodward–Hoffman rule, 510–511 Work function, 12 Z Zeeman effect, 154–156 Zeeman splitting, 116 Zero average momentum, 184–185 Zero differential overlap, 385 Zero-point energy, 31 Zeroth-order wavefunction, 393