Crystal structure analysis using the 'superposition' - and 'complementary' - structures

Ernst Höhne and Leo Kutschabsky

1. General

Quite a number of crystal structures contain parts (e.g. heavy atoms or building units) with a higher symmetry (e.g. with additional translation or pseudotranslation) compared with the whole structure. In those cases, standard methods may not lead to correct results, this is why special methods may have to be applied.

As an example let the heavy atoms (strongly reflecting atoms) have parameters x_s, y_s, z_s and x_s, y_s, $\frac{1}{2}$ + z_s (Fig. 1). Then they contribute only to the structure factors F(hkl) with l = 2n, since their contribution F_s(hkl) is equal to

(1.1)

Therefore the |F(hkl)|² with l = 2n are systematically strong, compared with those with l = 2n + 1, i.e. for the mean values $\langle{I}(hkl)\rangle$ of the intensities within any region of $\sigma$ values

$\begin{displaymath} \langle{I}(hkl)_{l=2n}\rangle \mbox{ strong}, \qquad \langle{I}(hkl)_{l=2n+1}\rangle \mbox{ weak}.\end{displaymath}$

**Figure 1:** Crystal structure in P1. $\clubsuit$ - symbol for the arrangement of heavy atoms, $\diamondsuit$ and $\Box$ remaining atoms of the structure.
$\begin{figure} \includegraphics {fig1.ps} \end{figure}$

If such a systematic distribution of the intensities or a similar one occurs, it is useful, for methodical reasons, to regard the electron density $\rho(x, y, z)$ as the sum of two - or if necessary - of several parts.

Thus the electron density function

$\begin{displaymath} {\rho}(x, y, z) = \sum_h \sum_k \sum_l F(hkl) \exp[-2{\pi}i(hx + ky + lz)]\end{displaymath}$

may for the case indicated in Fig. 1 be written

	(1.2)
	(1.2')

where

${\rho}_{\mbox{sup}}(x, y, z)$ denotes a hypothetical structure, called the superposition structure, which is related to the real structure in the following way

$\begin{displaymath} \textstyle{\rho}_{\mbox{sup}}(x, y, z) = \frac{1}{2}[{\rho}(x, y, z) + {\rho}(x, y, \frac{1}{2} + z)]\end{displaymath}$

(1.3)

In ${\rho}_{\textrm{sup}}(x, y)$ (Fig. 2) the two heavy atoms ( $\clubsuit$ ) appear with correct weights because they are connected by a translation of c/2, whereas the remaining atoms (squares and triangles) appear with half their weights (marked by shaded symbols) at the original position and at a second point shifted relative to it by c/2.

**Figure 2:** Superposition structure. $\clubsuit$ - heavy atoms $\heartsuit$ , $\spadesuit$ - remaining atoms with half weight.
$\begin{figure} \includegraphics {fig2.ps} \end{figure}$

Generally speaking, the symmetry of ${\rho}_{\mbox{sup}}(x, y, z)$ is identical with the symmetry of the arrangement of the heavy atoms (or other building units with higher symmetry) taken by themselves. This higher symmetry may either be strictly true for the heavy atoms taken by themselves, or only in approximation. In the latter case it may be useful to disregard deviations from the higher symmetry, to start with.

The symmetry of the complementary structure ${\rho}_{\mbox{com}}(x, y, z)$ follows from (1.2) and (1.3):

$\begin{displaymath} \textstyle{\rho}_{\mbox{com}}(x, z, y) = \frac{1}{2}[{\rho}(x, y, z) - {\rho}(x, y, \frac{1}{2} + z)]\end{displaymath}$

(1.4)

Its properties are shown in Fig. 3. Accordingly, the heavy atoms (i.e. those which occur in pairs related to a shift of c/2) are absent in ${\rho}_{\mbox{com}}(x, y, z)$ , whereas any other atom appears with half its weight at its real position and with half negative weight at a position shifted by c/2 (Fig. 3). The space group symmetry of the arrangement with positive weights in ${\rho}_{\mbox{com}}(x, y, z)$ is identical with the space group of the real structure $\rho(x, y, z)$ . In a similar way, the introduction of a superposition and a complementary structure may be indicated by systematically strong reflections occurring, e.g. for h = 2n or h + k = 2n or h + k + l = 2n etc.

**Figure 3:** Complementary structure, $\heartsuit$ , $\spadesuit$ --atoms with positive half weight. $\flat$ . $\natural$ --atoms with negative half weight.
$\begin{figure} \includegraphics {fig3.ps} \end{figure}$

If, on the other hand, the heavy atoms (or another part of the structure, taken by itself) possesses a higher symmetry (other than translation) than the structure as a whole, the introduction of other kinds of hypothetical structures, also to be called ${\rho}_{\mbox{sup}}(x, y, z)$ and ${\rho}_{\mbox{com}}(x, y, z)$ may be of use. In this case, no systematically strong and weak reflections result.

As an example, we consider a structure in $P\overline 1$ containing two heavy atoms of the same element per unit cell. The heavy atoms considered by themselves are connected by a centre of symmetry. The structure computed with phases (signs) taken from the heavy atom contribution is necessarily centrosymmetric and is related to the real structure by its superposition with its centrosymmetric image (Fig. 4). If referred to such a partial centre of symmetry as origin, the electron density distribution of the superposition structure ${\rho}_{sup}(x, y, z)$ may be expressed as

$\begin{displaymath} \textstyle{\rho}_{\mbox{sup}}(x, y, z) = \frac{1}{2}[{\rho}(x, y, z) + {\rho}(\overline{x}, \overline{y}, \overline{z})]\end{displaymath}$

(1.5)

The real structure or at least part of it may in many cases of this kind be obtained using well established chemical knowledge, such as atomic distances, known stereochemistry of molecules or parts of them as of coordination polyhedra etc.

**Figure 4:** Superposition structure (centrosymmetric). $\clubsuit$ - centrosymmetric arrangement of heavy atoms, $\heartsuit$ , $\spadesuit$ - remaining atoms with half weights.
$\begin{figure} \includegraphics {fig4.ps} \end{figure}$

2. Special methods

2.1 Linear structure factor equations

In many cases two coordinates (e.g. x_j, y_j) of any atom in the real structure are the same as those of the superposition structure. There arises the task of determining the third atomic coordinate (z_j). The method of linear structure factor equations (SFE) by Kutschabsky² and by Kutschabsky and Höhne³ allows us to calculate these atomic coordinates (z_j) directly, using the reflection of the first level of the reciprocal lattice (F(hkl)).

The basic relation follows directly from the formula for the structure factor

$\begin{displaymath} F(\textbf{H}) = \sum_{s = 1}^p \sum_{j = 1}^N (f_{sj}(\textb... ...bf{Hr}_{sj} + if_{sj}(\textbf{H}) \sin 2 {\pi}\textbf{Hr}_{sj})\end{displaymath}$

(2.1)

where P is the number of equipoints in the unit cell, N is the number of symmetrically independent atoms, f_sj is the atomic scattering factor and $\textbf{r}_{sj}$ is the radius vector of the centre of atom s, j.

Vector $\textbf{H}$ is defined by $h\textbf{a}$ * + $k\textbf{b}$ * + $l\textbf{c}$ *, where ( $\textbf{a}$ *, $\textbf{b}$ *, $\textbf{c}$ *) are the basic reciprocal vectors.

Using the symmetry matrices R_s and the translation t_s we obtain:

$\begin{displaymath} F(H) = \sum^p_{s = 1} \sum^N_{j = 1} f_{sj}(\textbf{H})[\cos... ...\textbf{t}_s) + i \sin 2{\pi}H(R_s\textbf{r}_j + \textbf{t}_s)]\end{displaymath}$

(2.2)

The separation into the components leads to

$\begin{displaymath} F(H) = \sum^p_{s = 1} \sum^N_{j = 1} {\gamma}_{sj}(\textbf{H... ...pi}({\alpha}_sx_j + {\beta}_sy_j + {\gamma}_sz_j + {\delta}_s)]\end{displaymath}$

(2.3)

(2.4)

In the most important cases the factor ${\gamma}_s$ depends only on l (not h or k). In those cases we obtain for structure factors P(hkl) with constant L

$\begin{displaymath} F(\textbf{H}) = \sum^N_{j = 1} (a_j + ib_j) C^{(L)}_j + \sum^N_{j = 1} (c_j + id_j)s_j^{(L)}\end{displaymath}$

(2.5)

where the unknown variables cos 2 ${\pi}Lz_j$ and sin 2 ${\pi}Lz_j$ have been denoted by C^(L)_j and S^(L)_j, respectively, and their known coefficients by a_j, b_j, c_j and d_j.

a_j = a_j(h, k, f_j, x_j, y_j) etc. where the exact form of dependence on h, k, f_j, x_j and y_j may be obtained from Table 4 of International Tables for X-ray Crystallography, Vol. 1.

The 2N unknown variables C^(L)_j and S^(L)_j may be determined by a system of linear equations using $F_{\mbox{obs}}(\textbf{H}$ )for $F(\textbf{H}$ ).

If the phases of the $F_{\mbox{obs}}(\textbf{H})$ are unknown the unobserved reflections may be used to obtain a system of homogeneous linear equations. Often it is of advantage to use in addition to the homogeneous equations one equation belonging to a strong structure factor whose phase may be fixed arbitrarily in centrosymmetrical space groups, and in the non-centrosymmetrical space groups in which the origin may have any position in the z-direction.

Because the coefficients of these equations are inaccurate and, moreover, the structure factors are zero only approximately a more accurate solution for the values C^(L)_j and S^(L)_j may be obtained by using more equations than there are variables and by minimizing the sum of the squares of the deviations $\sum\vert F_{\mbox{obs}}(\textbf{H})-F_{\mbox{calc}}(\textbf{H})\vert^2$ ,where $F_{\mbox{calc}}(\textbf{H})$ stands for the right side of equation (2.5) and F(H) is to be replaced by $F_{\mbox{obs}}(\textbf{H})$ in this relation. The C^(L)_j and S^(L)_j from the first calculation may be used to determine the phases of further structure factors. Taking these equations in addition to those used already, the number of equations increases and thus the accuracy of C^(L)_j and S^(L)_j is improved.

If the F(hkl) with L = 1 are used, the atomic parameters z_j of all atoms resolved in the (x, y)-projection follow from C⁽¹⁾_j= cos 2 ${\pi}z_j$ and S⁽¹⁾_j = sin 2 ${\pi}z_j$ and (C⁽¹⁾_j)² + (S_j⁽¹⁾)² = 1. The accuracy of the values z_j may be further improved by using F(hkl) with L larger than one.

2.2 The application of direct methods to centrosymmetric structures containing heavy atoms⁴

It is assumed that the positions of the heavy atoms are known and that there is a sufficient number of reflections whose signs are determined by the heavy atoms. These reflections do not obey the probability relation (2.6).

$\begin{displaymath} S_{h+h^{\prime}} {\sim} S_h . S_{h^{\prime}}\end{displaymath}$

(2.6)

On subtracting the heavy atom contribution from the observed structure factors of these reflections, one obtains the sign of the light atom contributions for these reflections. Thereafter one can solve the remaining light atom structure by applying equation (2.6) to obtain the signs of the reflections that do not have contributions from the heavy atoms.

The procedure was used to solve the structure of the complex Au[S₂C₂(CN)₂]₂Au[S₂CN(C₄H₉)₂]₂. The space group was found to be P2₁/c, with two formula units per unit cell. The reflections hkl (h = 2n, k + l = 2n) were all very strong and the gold atoms were placed at the (special) position 000, $\frac{1}{2}$ 00, $\frac{1}{2}$ $\frac{1}{2}$ $\frac{1}{2}$ , and 0 $\frac{1}{2}$ $\frac{1}{2}$ . 1337 observed 'strong' reflections (with equal positive contributions from the gold atoms) and 538 observed 'weak' reflections (without any contributions from the gold atoms) were used.

The first step was a calculation of the Wilson plot. The following expression was used:

$\begin{displaymath} \langle{I}\rangle_h = K_L \langle\textstyle\sum^L_i f^2_i \e... ...ert F_H\vert^2 \exp(-2B_H \sin^2 {\theta}/{\lambda}^2)\rangle_h\end{displaymath}$

(2.7)

where $I = (K\vert F_{\mbox{obs}}\vert^2_{\perp})$ is the observed intensity on a relative scale, K = K_L = K_H is the scale factor, $\sum^L$ denotes a summation over all light atoms in the unit cell, F_H is the heavy atom contribution to the structure factor and B_L and B_H are the overall temperature factor parameters of the light and heavy atoms respectively. The average is taken over reflections h within a given sin $\theta$ interval.

For the 'weak' reflections (F_H = 0) the second term in equation (2.7) vanishes and a Wilson plot for these reflections gave the scale factor K_L (1.29) and the value of B_L (3.24 Å²). On substituting these results in equation (2.7) a Wilson plot for the 'strong' reflections gave the scale factor K_H (1.26) and the value of B_H (2.91 Å²). A small difference in K_L and K_H will not affect the following steps.

The second step is the calculation of the normalized structure factors E. The formulae normally used for the calculation of E values do not make sense for a structure containing heavy atoms. For the corresponding light atom structure the E values, E_L, are defined by:

$\begin{displaymath} E_L = F_L ({\varepsilon} \sum^L_i f^2_i)^{-1/2} \exp (B_L \sin^2 {\theta}/{\lambda}^2)\end{displaymath}$

(2.8)

where F_L is the light atom contribution to the structure factor and, for space group $P2_1/c, \varepsilon$ = 2 for h01 and 0k0 reflections and $\varepsilon$ = 1 for all other reflections.⁶

The 'strong' reflections have positive structure factors and we have $F_1 = F_{\mbox{obs}} - F_H$ ; the magnitude and the sign of the E_L value is obtained by equation (2.8). This resulted in 365 signed E_L values, with |E_L| > 1.3. For the 'weak' reflections we have |F_L| = $\vert F_{\mbox{obs}}\vert$ and only the magnitude of the E_L value is obtained. This resulted in 270 reflections with |E_L| > 1.3.

The third step is the application of equation (2.6) to obtain the signs of the 'weak' reflections. When several interactions of the type ( $h + h^{\prime}$ ) = (h) + ( $h^{\prime}$ ) occur for |E_L| > 1.3, where both S_h and $S_{h^{\prime}}$ are known, several predictions of the sign $S_{h+h^{\prime}}$ are obtained by application of (2.6). These predictions should be reasonably consistent before $S_{h+h^{\prime}}$ is considered to be determined and singly occurring interactions should never be trusted. We have followed a procedure similar to the sign correlation procedure. The origin is partly fixed by the choice of the gold atom positions and further determined by assigning arbitrary signs to two 'weak' reflections: 221 (|E_L| = 4.0) and 34 $\overline{8}$ (|E_L| = 2.9). We define the following sets of reflections, all |E_L| > 2.0:

h₁ are 'strong' reflections, hkl(h = 2n, k + 1 = 2n).
h₂ are the two origin determining choices.
h₃ are the reflections h₁ + h₂ and $h_2 + h^{\prime}_2$ .
h₄ are the reflections h₁ + h₃, h₂ + h₃ and $h_3 + h^{\prime}_3$ .

The application of the equation (2.6) on only reflections h₁ cannot give new signs; together with the reflections h₂ probable signs for 36 reflections h₃ were calculated. Upon entering h₃ in equation (2.6), many reflections take part in the calculations and consequently the sign of one reflection h₄ will often be found from several independent sign relations (2.6). Signs were calculated for 48 reflections h₄; of these the signs of 24 reflections were determined by at least five consistent relations (2.6) and accepted to be correct. Although some of the signs for reflections h₃ may be incorrectly determined, it is highly improbable that all reflections h₃ used for the signs determination of one reflection h₄ are incorrect. The intermediate results for h₃ and the rest of h₄ were rejected.

Continued application of equation (2.6) on 365 'strong' reflections, 2 reflections h₂ and 24 reflections h₄ resulted in the sign determination of 158 more 'weak' reflections with |E_L| > 1.3. A Fourier synthesis revealed the positions of all of the light atoms, except the hydrogen atoms.

The above described procedure may be generalized for heavy atoms on general positions. In this case there also exist reflections with intermediate heavy atom contributions. For these reflections |F_L| = | $\vert F_{\mbox{obs}}\vert$ $\pm$ $\vert F_H\Vert$ | and the lowest F_L value is taken to avoid incorrect sign indications. In our opinion this procedure is well suited to an automatic solution of structures containing heavy atoms.

3. Steps of structure determination

1. Determine the space group and unit cell of the real structure.

2. Test for systematically strong intensities.

3. Determine the space group of the superposition structure (see examples).

4. Determine the superposition structure ${\rho}_{\mbox{sup}}(x, y, z)$ or at least the heavy atom positions(s) in ${\rho}_{\mbox{sup}}(x, y, z)$ .

5. Determine the complementary structure ${\rho}_{\mbox{com}}(x, y, z)$ or the real structure.

5.1 Compute and discuss the Patterson function of the complementary structure, if this corresponds to the systematically weak reflection.

5.2 Apply 'Direct methods' (see above).

5.3 Apply the method of 'linear structure factor equations'.

5.4 Resolve the ambiguity of the superposition structure using chemical knowledge, such as minimum distance between atoms, knowledge of groups, coordination polyhedra etc.

4. Examples

A. Demissidine hydroiodide⁸

Crystal data: C₂₇H₄₅NO.HI. $\frac{1}{2}$ C₂H₅OH

orthorhombic: P2₁2₁2₁; a = 23.0 Å, b = 7.6 Å, c = 16.0 Å; Z = 4

Observed systematic intensity distribution:

$\begin{displaymath} \langle{I}(hkl)_{l=2n}\rangle \mbox{ strong;} \qquad \langle{I}(hkl)_{l=2n+1}\rangle \mbox{ weak}.\end{displaymath}$

According to the chemical formula and number of molecules per unit cell there are 4 heavy atoms per unit cell, i.e. one per asymmetric unit. Thus the 2 heavy atoms related by a shift of c/2 (on account of systematic intensity distribution) must necessarily belong to the same set of equipoints. This results if and only if the atoms lie on screw dyads parallel to c (see Fig. 5), thus the set of equipoints in P2₁2₁2₁

$\begin{displaymath} \textstyle x, y, z; \quad \frac{1}{2} - x, \overline{y}, \fr... ...verline{z};\quad \overline{x}, \frac{1}{2} + y, \frac{1}{2} - z\end{displaymath}$

specializes to

$\begin{displaymath} \textstyle (000,00\frac{1}{2}) + \frac{1}{4}, 0, z; \quad \frac{3}{4}, \frac{1}{2}, \overline{z}\end{displaymath}$

With $z^{\prime} = 2z$ this corresponds within the unit cell $a^{\prime} = a$ , $b^{\prime} = b$ , $c^{\prime} = c/2$ of the superposition structure, to the equipoints

$\begin{displaymath} \textstyle \frac{1}{4}, 0, z^{\prime}; \quad \frac{3}{4}, \frac{1}{2}, \overline{z}^{\prime}\end{displaymath}$

Obviously (Fig. 5), any of these two points lies on mirror planes perpendicular to $a^{\prime}$ and $b^{\prime}$ and are related by an n-glide plane perpendicular to $c^{\prime}$ . Thus the space group of the superposition structure is Pmmn.

**Figure 5:** (a) Space group symmetry of (a) real structure P2₁2₁2₁ $\diamondsuit$ building unit in y, $\sharp$ in $\overline{y}$ , $\wp$ in 1/2 - y, $\bowtie$ in 1/2 + y; the heavy atoms marked by circles. (b) superposition structure - *Pmmn*, derived from P2₁2₁2₁; the reduced weights of all building units in general position (triangles) to a quarter compared with the weight of the heavy atoms (circles) is symbolized by dashed lines.

The same result could have been obtained by scanning the orthorhombic higher symmetry group for such equipoints. Then the set of special positions (a) 00z, $\frac{1}{2}\frac{1}{2}z$ would be found for Pmmn, which corresponds to the set found, after a shift of the origin by $a^{\prime}$ /4.

The space group for the superposition structure thus obtained may now be tested with the usual space group tests, and indeed, the hk0-reflections with h + k = 2n + 1 are weak (corresponding to the n-glide plane). The superposition structure thus contains for each of 4 symmetry related atoms (x, y, z) etc., of the real structure the following sets of 4 atoms.

$\begin{displaymath} \textstyle (x, y, z), \quad (x, y, \frac{1}{2} + z),\quad (\frac{1}{2} - x, y, z),\quad (\frac{1}{2} - x, y, \frac{1}{2} + z)\end{displaymath}$

i.e. 4 atoms to any atom of the real structures. This superposition structure would be obtained, if the usual heavy atom technique could be applied, and would certainly be difficult to interpret.

The Patterson function gave, however, not only the z-coordinate of the heavy atom but also hinted that it may not lie exactly on the dyad screw, but only approximately so; this was confirmed by the Patterson of the complementary structure, obtained from the reflections with l = 2n + 1.

This indicated a deviation of x_j from $\frac{1}{4}$ , and this deviation results in contribution of reflections with high values of h which even determine their phases.

The iodine parameters were refined and with the resulting phases a first Fourier synthesis of the complementary structure was obtained in space group P2₁2₁2₁.

This result was compared with the known part of the model and thus a part of the structure deduced and used as a starting point for the final determination of the real structure.

B. Piperidino-acet-m-bromo-anilide⁹

Crystal data: C₁₃H₁₇N₂OBr

orthorhombic: Pbca; a = 23.65 Å, b = 12.66 Å, c = 9.37 Å; Z = 8

Observed systematic intensity distribution:

$\begin{displaymath} \langle{I}(hkl)_{h=2n}\rangle \mbox{ strong}; \quad \langle{I}(hkl)_{h=2n+1}\rangle \mbox{ weak}\end{displaymath}$

Space group of ${\rho}_{\mbox{sup}}(x, y, z)$ : Pbcm with lattice parameters

$\begin{displaymath} \textstyle a^{\prime} = \frac{a}{2},\quad b^{\prime} = b,\quad c^{\prime} = c.\end{displaymath}$

The 3-dimensional Patterson function explained the systematic distribution in the intensities by the particular position of the bromine-atom on the a-glide plane with the fractional coordinates x = 0.159, y = 0.193, z = 0.25. With the known position of the bromine atom (refined by least squares methods) the signs of most of the $F_{\mbox{obs}}(hkl)_{h=2n}$ but not of the $F_{\mbox{obs}}(hkl)_{h=2n+1}$ were determined.

With $F_{\mbox{obs}}(hkl)_{h=2n}$ a 3-dimensional Fourier synthesis of the superposition structure was calculated. This ${\rho}_{\mbox{sup}}(x, y, z)$ involves perpendicular to c an additional mirror plane, not existing in the real structure, through the bromine atom. That is why each maximum in the synthesis has a corresponding reflected one (Fig. 6). But only one of these pairs corresponds to an atom in the real structure. In addition many maxima occurred in the Fourier synthesis which do not refer to atoms. Therefore the interpretation of the synthesis by the model of the molecule failed.

**Figure 6:** (y, z)-projection of the superposition structure with the pseudo-mirror-plane.
$\begin{figure} \includegraphics {fig6.ps} \end{figure}$

To preclude the spurious peaks in the Fourier synthesis a spatial minimum function M₄(x, y, z)¹⁰ was derived from the Patterson function by using the known bromine-bromine vectors.

The comparison of Fourier synthesis and minimum function revealed to which of the pairs of peaks connected by the mirror plane atoms could be assigned. These peaks of the superposition structure are shown in Fig. 7. Its (x, y)-projection is identical with the projection of the real structure. A model of the molecule enabled us to determine the z-coordinates of the atoms by eliminating the ambiguity in the Fourier synthesis.

**Figure 7:** Peak coinciding in Fourier synthesis and minimum function.
$\begin{figure} \includegraphics {fig7.ps} \end{figure}$

C. Acetamide hemihydrobromide^11,12

Crystal data: (CH₃CONH₂)₂.HBr

monoclinic: P2₁/c, a = 6.51 Å, b = 8.64 Å, c = 8.24 Å, $\beta$ = 113.1 $^{\circ}$ ; Z = 2

$\begin{displaymath} \langle{I}(hkl)_{k+l=2n}\rangle \mbox{ strong,} \quad \langle{I}(hkl)_{k+l=2n+1}\rangle \mbox{ weak}.\end{displaymath}$

From Z = 2 and P2₁/c it follows that the bromine atom lies at the centre of symmetry forming, taken by themselves, an A-centred lattice. The $F_{\mbox{obs}}(hkl)_{k+l=2n}$ correspond to a superposition structure with the space group A 2/m, which has a mirror plane perpendicular b in addition to the space group of the real structure.

**Figure 8:** Electron density projection ${\rho} (x, z)$ of acetamide hemihydrobromide.
$\begin{figure} \includegraphics {fig8.ps} \end{figure}$

The (x, z)-projection of the superposition structure (Fig. 8) is identical to the corresponding projection of the real structure. This projection, calculated with $F_{\mbox{obs}}(h01)$ revealed the position of all atoms. Although the determination of the real structure in 3 dimensions with the help of a model did not seem feasible, due to the poor data available, it was possible to determine approximate y-coordinates of the atoms by means of two independent systems of linear structure factor equations:

$\begin{displaymath} F(h1l)_{l=2n+1}=K\sum_j 4f_j \cos 2{\pi}(hx_j + lz_j) \cos 2{\pi}y_j\end{displaymath}$

and

$\begin{displaymath} F(h1l)_{l=2n}=-K\sum_j 4f_j \sin 2{\pi}(hx_j + lz_j) \sin 2{\pi}y_j\end{displaymath}$

where K is the scaling factor. The expressions $4f_j \cos 2{\pi}(hx_j + lz_j) = a_j$ and $-4f_j \sin 2{\pi}(hx_j + lz_j) = b_j$ may be calculated because x_j and z_j are known, whereas K cos 2 ${\pi}y_j$ = $C_j^{\prime}$ and K sin 2 ${\pi}y_j$ = $S_j^{\prime}$ are the unknown values. From the known position of the heavy atom (bromine) most of the signs of the F(h1l)_l=2n+1 could be determined and a system of equations (4.1) with a twelvefold overdetermination could be set up

$\begin{displaymath} F(h1l)_{l=2n+1} = \sum_j a_j(h1l)C_j^{\prime}.\end{displaymath}$

(4.1)

This system of equations was solved by least squares technique. Because the F(h1l) are on a relative scale the solutions $C_j^{\prime}$ were multiplied with a constant (1/K) so that (1/K) $C_{\mbox{Br}}^{\prime}$ = 1 is valid.

Two values y_j = y_jo and y_j = 1 - y_jo are in keeping with the solutions C_j = (1/K) $C^{\prime}$ (y_jo) obtained. To find out which of these two values is correct, equations of the type (4.2):

$\begin{displaymath} F(h1l)_{l=2n} = \sum_j b_j(h1l)S_j^{\prime}\end{displaymath}$

(4.2)

were used. The bromine atoms do not contribute to the F(h1l)_j=2n. The signs of these structure factors were unknown. Therefore the unobserved reflections F(h1l)_l=2n and one strong reflection F(h1l)_l=2n were used for setting up a system of inhomogeneous equations. The S_j = (1/k) $S_j^{\prime}$ were less accurate than the C_j because this system of equations had only a twofold overdetermination. That is why the absolute values of the y-coordinates were calculated from the C_j, but the ambiguity was eliminated by the S_j. The results are shown in Fig. 9. Structure refinement proved these approximate values to be correct.

**Figure 9:** The average structure ${\rho}(yz)$ of acetamide hemihydrobromide calculated only with $F_{\mbox{obs}}(0k1)$ for k + l = 2n. Atomic positions obtained with the help of *SFE* are marked by crosses, the refined positions are marked by squares.
$\begin{figure} \includegraphics {fig9.ps} \end{figure}$

D. ${\alpha}$ -Calcium tetraborate hydrate¹³

Crystal data: CaB₂O₄.4H₂O

monoclinic: Pc or P2/c, a = 5.86 Å, b = 6.93 Å, c = 7.78 Å, $\beta$ = 94 $^{\circ}$ ; Z = 2

Observed systematic intensity distribution:

$\begin{displaymath} \langle{I}(hkl)_{l=2n}\rangle \mbox{ strong}, \quad \langle{I}(hkl)_{l=2n+1}\rangle \mbox{ weak}\end{displaymath}$

The intensity statistic of Howells, Phillips and Rogers¹⁴ using the I(hkl) showed that the real structure has the centrosymmetric space group P2/c.

The Patterson function showed in agreement with Z = 2 and P2/c that the calcium atom occupies a special position on the twofold rotation axis with parameters $x_{\mbox{Ca}}$ = 0, $z_{\mbox{Ca}}$ = $\frac{1}{4}$ and $y_{\mbox{Ca}}$ approximately zero. This position is near the c-glide plane and thus explains why the reflections I(hkl)_l=2n+1 are systematically weak.

On the other hand the calcium atom determined most of the signs of the F(hk0). The Fourier projection ${\rho}(x, y)$ gave the positions of the oxygen and boron atoms. Because the signs of the F(hkl) with l = 2n + 1 were not determined by the contribution of the calcium atom the z-parameters of the atoms could not be derived from a Fourier synthesis based on the contributions of the calcium atom to the sign of the F(hkl). But with the SFE-method the approximate z-coordinates were easily obtained.

From the structure factor formula follows

Two systems of equations were set up. For the first system F(hk1) and for the second F(hk2) were used. In each case the unobserved structure factors and one strong structure factor with arbitrary sign was used. These systems of equations gave approximate values for C^(L)_j and S^(L)_j, by which F_c(hkl) were calculated.

By comparing the F_c(hkl) with the F_o(hkl) the signs of more structure factors could be determined. The corresponding equations were added to the previous systems of equations. In this way the overdetermination of the systems of equations was increased and the accuracy of the results improved. The final results obtained after several cycles are listed in the Table. The last column contains the refined parameter for comparison.

Atom	C⁽¹⁾_j	S⁽¹⁾_j	C⁽²⁾_j	S⁽²⁾_j	z_j	z_j refined
Ca	-	1.00	-1.000	-	0.250	0.25
O₁	-	0.20	0.872	-	0.036	0.0359
O₂	1.00	0.51	0.703	0.756	0.070	0.0588
O₃	0.68	0.44	0.111	0.923	0.106	0.1042
O₄	-0.66	-0.74	-0.338	0.680	0.842	0.8193

E. Dimethylaminomethylpinene^12,15

Crystal data: C₁₃H₂₄NBr

monoclinic: P2₁, a = 11.37 Å, b = 8.62 Å, c = 7.48 Å, $\beta$ = 97.4; Z = 2

The x- and z-parameters of the bromine atom were determined from the Patterson synthesis and refined by Fourier methods. The y coordinate was chosen arbitrarily as $y_{\mbox{Br}} = \frac{1}{4}$ . The 3-dimensional Fourier synthesis based on the phases of the $F_{\mbox{obs}}(hkl)$ derived from the bromine contributions is a superposition structure with the space group P2₁/m with an additional mirror plane at $y = \frac{1}{4}$ . For the calculation of this synthesis only F_o(hkl) for 0k4 were available because the crystals were very small. The Fourier synthesis revealed the positions of all non-hydrogen atoms, most of them resolved in the x and z directions. Nearly all these atoms, however, located so closely to the pseudo mirror plane that the peak corresponding to one atom and its mirror image were not separated but formed an elliptical maximum with its peak on the mirror plane (Fig. 10).

**Figure 10:** Comparison of the composite three dimensional electron density projected along 001 (only one full or dashed contour of the same height at arbitrary level is drawn) of atomic positions obtained by SFE (results from F(h21) are marked by open circles, from F(h31) by open squares, average values solid) and of the positions obtained by least-squares refinement marked by crosses.
$\begin{figure} \includegraphics {fig10.ps} \end{figure}$

The main problem of the structure determination was to determine the small deviations of the light atoms from this pseudo mirror plane.

Analysis of the peak shape resulted in rather inaccurate values of the deviations from the mirror plane. Better values were obtained by the SFE method. To start with, the positions of all atoms in (x, z)-projection were refined by difference Fourier synthesis to an R value of 0.16. Using the formulae

$\begin{displaymath} A_o(h21) = K_1 \sum_j 2f_j \cos 2 {\pi}(hx_j + lz_j) \cos 2{\pi}2y_j\end{displaymath}$

and

$\begin{displaymath} A_o(h31) = K_2 \sum_j - 2f_j \sin 2{\pi}(hx_j + lz_j) \sin 2{\pi}3y_j\end{displaymath}$

where K₁ and K₂ are the scaling factors, two systems of equations were obtained taking A_o(hkl) equal to F_o(hkl). This may be done without creating large errors since the B_o(hkl), to which the bromine atoms do not contribute, are expected to be small. In these systems of equations only those A_o(hkl) were used whose signs could be deduced from the contributions $F_{\mbox{Br}}(hkl)$ of the bromine atoms.

With the abbreviations

the systems of the equations have the form

The structure factors F_o(h21) and F_o(h31) are on a relative scale. The scaling factors K₁ and K₂ were given such values that (1/K₁) $C_{\mbox{Br}}^{(2)}$ = (1/K₂) $S_{\mbox{Br}}^{(3)}$ = 1. Four values for the coordinate y_j of any atom are in keeping with

$\begin{displaymath} \textstyle C_j^{(2)} = (1/K_1)C_j^{(2)}(y_{jo}) \quad \mbox{namely} \quad y_j = {\pm} y_{jo}, y_j = {\pm}y_{jo} + \frac{1}{2}.\end{displaymath}$

Two of these values for any atom could be excluded by comparison with the Fourier synthesis of the superposition structure (see above). From the two remaining values one could be precluded for most of the atoms by using a model of the molecule (Fig. 10). The accuracy of the y coordinates obtained from the C⁽²⁾_j was improved using the results obtained from S⁽³⁾_j. The coordinates thus obtained were sufficiently accurate for a starting set for a least squares refinement of the real structure.

References

1. K. Dornberger-Schiff, Kritallografija, 6 (1961), 859.

2. L. Kutschabsky, Mber. Dt. Akad. Wiss., 7 (1965), 95, 509.

3. L. Kutschabsky and E. Höhne, Acta Cryst., 19 (1965), 747.

4. P. T. Beurskens and J. H. Noordik, Acta Cryst., A27 (1971) 187.

5. S. Parthasarathy, (1966). Z. Kristallogr. 123, 27.

6. H. Hauptman and J. Karle. Solution of the Phase Problem. I. The Centrosymmetric Crystal, A.C.A. Monograph No. 3 (1953).

7. P. T. Beurskens, (1963). Technical Report of Sign Correlation by the Sayre Equation. The Crystallography Laboratory, Univ. of Pittsburgh, Pennsylvania.

8. E. Höhne, Journ. prakt. Chem., 314 (1972), 371.

9. L. Kutschabsky, P. Leibnitz and J. P. Wenzel, Kristall und Technik, 9 (1974) 605.

10. M. J. Buerger, Vector Space and its application to Crystal Structure Analysis, Wiley, 1959.

11. G. Reck, Mber. Dt. Akad. Wiss. 9, (1967) 505.

12. L. Kutschabsky, S. Kulpe and G. Reck Acta Cryst., A27 (1971), 29.

13. D. Zeigan and L. Kutschabsky Mber. Dt. Akad. Wiss. 7 (1965), 876.

14. E. R. Howells, D. C. Phillips and D. Rogers, Acta Cryst., 3 (1950), 210.

15. L. Kutschabsky and G. Reck, Journ. prakt. Chem. 312 (1970), 896.

Crystallography Matters!

Crystal structure analysis using the 'superposition' - and 'complementary' - structures

Ernst Höhne and Leo Kutschabsky

1. General

2. Special methods

2.1 Linear structure factor equations

2.2 The application of direct methods to centrosymmetric structures containing heavy atoms4

3. Steps of structure determination

4. Examples

A. Demissidine hydroiodide8

B. Piperidino-acet-m-bromo-anilide9

C. Acetamide hemihydrobromide11,12

D. -Calcium tetraborate hydrate13

E. Dimethylaminomethylpinene12,15

References

2.2 The application of direct methods to centrosymmetric structures containing heavy atoms⁴

A. Demissidine hydroiodide⁸

B. Piperidino-acet-m-bromo-anilide⁹

C. Acetamide hemihydrobromide^11,12

D. ${\alpha}$ -Calcium tetraborate hydrate¹³

E. Dimethylaminomethylpinene^12,15