The Dupuy Institute

Excellence in Historical Research and Analysis

Conference:HAAC 2026

The Dupuy Institute

Excellence in Historical Research and Analysis

Tag David Rowland

Military History and Validation of Combat Models

Shawn Woodford
October 12, 2017
7 Comments

Military History and Validation of Combat Models

A Presentation at MORS Mini-Symposium on Validation, 16 Oct 1990

By Trevor N. Dupuy

In the operations research community there is some confusion as to the respective meanings of the words “validation” and “verification.” My definition of validation is as follows:

“To conﬁrm or prove that the output or outputs of a model are consistent with the real-world functioning or operation of the process, procedure, or activity which the model is intended to represent or replicate.”

In this paper the word “validation” with respect to combat models is assumed to mean assurance that a model realistically and reliably represents the real world of combat. Or, in other words, given a set of inputs which reﬂect the anticipated forces and weapons in a combat encounter between two opponents under a given set of circumstances, the model is validated if we can demonstrate that its outputs are likely to represent what would actually happen in a real-world encounter between these forces under those circumstances

Thus, in this paper, the word “validation” has nothing to do with the correctness of computer code, or the apparent internal consistency or logic of relationships of model components, or with the soundness of the mathematical relationships or algorithms, or with satisfying the military judgment or experience of one individual.

True validation of combat models is not possible without testing them against modern historical combat experience. And so, in my opinion, a model is validated only when it will consistently replicate a number of military history battle outcomes in terms of: (a) Success-failure; (b) Attrition rates; and (c) Advance rates.

“Why,” you may ask, “use imprecise, doubtful, and outdated history to validate a modem, scientific process? Field tests, experiments, and field exercises can provide data that is often instrumented, and certainly more reliable than any historical data.”

I recognize that military history is imprecise; it is only an approximate, often biased and/or distorted, and frequently inconsistent reﬂection of what actually happened on historical battleﬁelds. Records are contradictory. I also recognize that there is an element of chance or randomness in human combat which can produce different results in otherwise apparently identical circumstances. I further recognize that history is retrospective, telling us only what has happened in the past. It cannot predict, if only because combat in the future will be fought with different weapons and equipment than were used in historical combat.

Despite these undoubted problems, military history provides more, and more accurate information about the real world of combat, and how human beings behave and perform under varying circumstances of combat, than is possible to derive or compile from arty other source. Despite some discrepancies, patterns are unmistakable and consistent. There is always a logical explanation for any individual deviations from the patterns. Historical examples that are inconsistent, or that are counter-intuitive, must be viewed with suspicion as possibly being poor or false history.

Of course absolute prediction of a future event is practically impossible, although not necessarily so theoretically. Any speculations which we make from tests or experiments must have some basis in terms of projections from past experience.

Training or demonstration exercises, proving ground tests, ﬁeld experiments, all lack the one most pervasive and most important component of combat: Fear in a lethal environment. There is no way in peacetime, or non-battleﬁeld, exercises, test, or experiments to be sure that the results are consistent with what would have been the behavior or performance of individuals or units or formations facing hostile ﬁrepower on a real battleﬁeld.

We know from the writings of the ancients (for instance Sun Tze—pronounced Sun Dzuh—and Thucydides) that have survived to this day that human nature has not changed since the dawn of history. The human factor the way in which humans respond to stimuli or circumstances is the most important basis for speculation and prediction. What about the “scientific” approach of those who insist that we cart have no conﬁdence in the accuracy or reliability of historical data, that it is therefore unscientific, and therefore that it should be ignored? These people insist that only “scientific” data should be used in modeling.

In fact, every model is based upon fundamental assumptions that are intuitive and unprovable. The ﬁrst step in the creation of a model is a step away from scientific reality in seeking a basis for an unreal representation of a real phenomenon. I have shown that the unreality is perpetuated when we use other imitations of reality as the basis for representing reality. History is less than perfect, but to ignore it, and to use only data that is bound to be wrong, assures that we will not be able to represent human behavior in real combat.

At the risk of repetition, and even of protesting too much, let me assure you that I am well aware of the shortcomings of military history:

The record which is available to us, which is history, only approximately reﬂects what actually happened. It is incomplete. It is often biased, it is often distorted. Even when it is accurate, it may be reﬂecting chance rather than normal processes. It is neither precise nor consistent. But, it provides more, and more accurate, information on the real world of battle than is available from the most thoroughly documented ﬁeld exercises, proving ground less, or laboratory or ﬁeld experiments.

Military history is imperfect. At best it reﬂects the actions and interactions of unpredictable human beings. We must always realize that a single historical example can be misleading for either of two reasons: (1) The data may be inaccurate, or (2) The data may be accurate, but untypical.

Nevertheless, history is indispensable. I repeat that the most pervasive characteristic of combat is fear in a lethal environment. For all of its imperfections, military history and only military history represents what happens under the environmental condition of fear.

Unfortunately, and somewhat unfairly, the reported ﬁndings of S.L.A. Marshall about human behavior in combat, which he reported in Men Against Fire, have been recently discounted by revisionist historians who assert that he never could have physically performed the research on which the book’s ﬁndings were supposedly based. This has raised doubts about Marshall’s assertion that 85% of infantry soldiers didn’t ﬁre their weapons in combat in World War ll. That dramatic and surprising assertion was ﬁrst challenged in a New Zealand study which found, on the basis of painstaking interviews, that most New Zealanders ﬁred their weapons in combat. Thus, either Americans were different from New Zealanders, or Marshall was wrong. And now American historians have demonstrated that Marshall had had neither the time nor the opportunity to conduct his battlefield interviews which he claimed were the basis for his ﬁndings.

I knew Marshall, moderately well. I was fully as aware of his weaknesses as of his strengths. He was not a historian. I deplored the imprecision and lack of documentation in Men Against Fire. But the revisionist historians have underestimated the shrewd journalistic assessment capability of “SLAM” Marshall. His observations may not have been scientifically precise, but they were generally sound, and his assessment has been shared by many American infantry officers whose judgements l also respect. As to the New Zealand study, how many people will, after the war, admit that they didn’t ﬁre their weapons?

Perhaps most important, however, in judging the assessments of SLAM Marshall, is a recent study by a highly-respected British operations research analyst, David Rowland. Using impeccable OR methods Rowland has demonstrated that Marshall’s assessment of the inefficient performance, or non-performance, of most soldiers in combat was essentially correct. An unclassified version of Rowland’s study, “Assessments of Combat Degradation,” appeared in the June 1986 issue of the Royal United Services Institution Journal.

Rowland was led to his investigations by the fact that soldier performance in ﬁeld training exercises, using the British version of MILES technology, was not consistent with historical experience. Even after allowances for degradation from theoretical proving ground capability of weapons, defensive rifle ﬁre almost invariably stopped any attack in these ﬁeld trials. But history showed that attacks were often in fact, usually successful. He therefore began a study in which he made both imaginative and scientific use of historical data from over 100 small unit battles in the Boer War and the two World Wars. He demonstrated that when troops are under ﬁre in actual combat, there is an additional degradation of performance by a factor ranging between 10 and 7. A degradation virtually of an order of magnitude! And this, mind you, on top of a comparable built-in degradation to allow for the difference between ﬁeld conditions and proving ground conditions.

Not only does Rowland‘s study corroborate SLAM Marshall’s observations, it showed conclusively that ﬁeld exercises, training competitions and demonstrations, give results so different from real battleﬁeld performance as to render them useless for validation purposes.

Which brings us back to military history. For all of the imprecision, internal contradictions, and inaccuracies inherent in historical data, at worst the deviations are generally far less than a factor of 2.0. This is at least four times more reliable than ﬁeld test or exercise results.

I do not believe that history can ever repeat itself. The conditions of an event at one time can never be precisely duplicated later. But, bolstered by the Rowland study, I am conﬁdent that history paraphrases itself.

If large bodies of historical data are compiled, the patterns are clear and unmistakable, even if slightly fuzzy around the edges. Behavior in accordance with this pattern is therefore typical. As we have already agreed, sometimes behavior can be different from the pattern, but we know that it is untypical, and we can then seek for the reason, which invariably can be discovered.

This permits what l call an actuarial approach to data analysis. We can never predict precisely what will happen under any circumstances. But the actuarial approach, with ample data, provides conﬁdence that the patterns reveal what is to happen under those circumstances, even if the actual results in individual instances vary to some extent from this “norm” (to use the Soviet military historical expression.).

It is relatively easy to take into account the differences in performance resulting from new weapons and equipment. The characteristics of the historical weapons and the current (or projected) weapons can be readily compared, and adjustments made accordingly in the validation procedure.

In the early 1960s an effort was made at SHAPE Headquarters to test the ATLAS Model against World War II data for the German invasion of Western Europe in May, 1940. The ﬁrst excursion had the Allies ending up on the Rhine River. This was apparently quite reasonable: the Allies substantially outnumbered the Germans, they had more tanks, and their tanks were better. However, despite these Allied advantages, the actual events in 1940 had not matched what ATLAS was now predicting. So the analysts did a little “ﬁne tuning,” (a splendid term for fudging). Alter the so-called adjustments, they tried again, and ran another excursion. This time the model had the Allies ending up in Berlin. The analysts (may the Lord forgive them!) were quite satisfied with the ability of ATLAS to represent modem combat. (Or at least they said so.) Their official conclusion was that the historical example was worthless, since weapons and equipment had changed so much in the preceding 20 years!

As I demonstrated in my book, Options of Command, the problem was that the model was unable to represent the German strategy, or to reﬂect the relative combat effectiveness of the opponents. The analysts should have reached a different conclusion. ATLAS had failed validation because a model that cannot with reasonable faithfulness and consistency replicate historical combat experience, certainly will be unable validly to reﬂect current or future combat.

How then, do we account for what l have said about the fuzziness of patterns, and the fact that individual historical examples may not ﬁt the patterns? I will give you my rules of thumb:

The battle outcome should reﬂect historical success-failure experience about four times out of five.
For attrition rates, the model average of ﬁve historical scenarios should be consistent with the historical average within a factor of about 1.5.
For the advance rates, the model average of ﬁve historical scenarios should be consistent with the historical average within a factor of about 1.5.

Just as the heavens are the laboratory of the astronomer, so military history is the laboratory of the soldier and the military operations research analyst. The scientific basis for both astronomy and military science is the recording of the movements and relationships of bodies, and then analysis of those movements. (In the one case the bodies are heavenly, in the other they are very terrestrial.)

I repeat: Military history is the laboratory of the soldier. Failure of the analyst to use this laboratory will doom him to live with the scientific equivalent of Ptolomean astronomy, whereas he could use the evidence available in his laboratory to progress to the military science equivalent of Copernican astronomy.

Human Factors In Warfare: Fatigue

Tom Lea, “The 2,000 Yard Stare” 1944 [Oil on canvas, 36 x 28 Life Collection of Art WWII, U.S. Army Center of Military History, Fort Belvoir, Virginia]

That idea that fatigue is a human factor in combat seems relatively uncontroversial. Military history is replete with examples of how the limits of human physical and mental endurance have affected the character of fighting and the outcome of battles. Perhaps the most salient aspect of military training is preparing soldiers to deal with the rigors of warfare.

Trevor Dupuy was aware that fatigue has a degrading effect on the effectiveness of troops in combat, but he never was able to study the topic specifically himself. He was aware of other examinations of historical experience that were relevant to the issue.

The effectiveness of a military force declines steadily every day that it is engaged in sustained combat. This is an indication that fear has a physical effect on human beings equitable with severe exertion. S.L.A. Marshall documented this extremely well in a report that he wrote a few years before he died. I shall shortly have more to say about S.L.A. Marshall…

An approximate value for the daily effect of fatigue upon the effectiveness of weapons employment emerged from a HERO study several years ago. There is no question that fatigue has a comparable degrading effect upon the ability of a force to advance. I know of no research to ascertain that effect. Until such research is performed, I have arbitrarily assumed that the degrading effect of fatigue upon advance rates is the same as its degrading effect upon weapons effectiveness. To those who might be shocked at such an assumption, my response is: We know there is an effect; it is better to use a crude approximation of that effect than to ignore it…

During World War II when Colonel S.L.A. Marshall was the Chief Historian of the US European Theater of Operations, he undertook a number of interviews of units just after they had been in combat. After the war, in his book Men Against Fire, Marshall asserted that his interviews revealed that only 15% of US infantry soldiers fired their small arms weapons in combat. This revelation created something of a sensation at the time.

It has since been demonstrated that Marshall did not really have solid, scientific data for his assertion. But those who criticize Marshall for unscholarly, unscientific work should realize that in private life he was an exceptionally good newspaper reporter. His conclusions, based upon his observations, may have been largely intuitive, but I am convinced that they were generally, if not specifically, sound…

One of the few examples of the use of military history in the West in recent years was an important study done at the British Defence Operational Analysis Establishment (DOAE) by David Rowland. An unclassified condensation of that study was published in the June 1986 issue of the Journal of the Royal United Services Institution (RUSI). The article, “Assessments of Combat Degradation,” demonstrates conclusively that, in historical combat, small arms weapons have had only one-seventh to one-tenth of their theoretical effectiveness. Rowland does not attempt to say why this is so, but it is interesting that his value of one-seventh is very close to the S. L. A. Marshall 15% figure. Both values translate into casualty effects very similar to those that have emerged from my own research.

The intent of this post is not to rehash the debate on Marshall. As Dupuy noted above, even if Marshall’s conclusions were not based on empirical evidence, his observations on combat were nevertheless on to something important. (Details on the Marshall debate can be easily found with a Google search. A brief discussion took place on the old TDI Forum in 2007.)

David Rowland also presented a paper on the same topic Dupuy referenced above at the Military Operations Research Society (MORS) MORIMOC II conference in 1989, “Assessment of Combat Performance With Small Arms” He later published a book detailing his research on the subject in 2006, The Stress of Battle: Quantifying Human Performance in Combat, which is very much worth tracking down and reading.

Dupuy provided a basic version of his theoretical combat exhaustion methodology on pages 223-224 in Numbers, Predictions and War: Using History to Evaluate Combat Factors and Predict the Outcome of Battles (Indianapolis; New York: The Bobbs-Merrill Co., 1979).

Rules For Exhaustion Rates, 20th Century*

The exhaustion factor (ex) of a fresh unit is 1.0; this is the maximum ex value.
At the conclusion of an engagement, a new ex factor will be calculated for each side.
A unit in normal offensive or defensive combat has its ex factor reduced by .05 for each consecutive day of combat; the ex factor cannot be less than 0.5.
An attacking unit opposed by delaying tactics has its ex factor reduced by 0.05 per day.
A defending unit in delay posture neither loses nor gains in its ex factor.
A withdrawing unit, not seriously engaged, has its ex factor augmented at the rate of 0.05 per day.
An advancing unit in pursuit, and not seriously delayed, neither loses nor gains in its ex factor.
For a unit in reserve, or in non-active posture, an exhaustion factor of less than 1.0 is augmented at the rate of .1 per day.
When a unit in combat, or recently in combat, is reinforced by a unit at least half of its size (in numbers of men), it adopts the ex factor of the reinforcing unit or—if the ex factor of the reinforcing unit is the same or lower than that of the reinforced—both adopt an ex factor 0.1 higher than that of the reinforced unit at the time of reinforcement, save that an ex factor cannot be greater than 1.0.
When a unit in combat, or recently in combat, is reinforced by a unit less than half its size, but not less than one quarter its size, augmentations or modiﬁcations of ex factors will be 0.5 times those provided for in paragraph 9, above. When the reinforcing unit is less than one-quarter the size of the reinforced unit, but not less than one-tenth its size, augmentations or modiﬁcations of ex factors will be 0.25 times those provided for in paragraph 9, above.

* Approximate reflection of preliminary QJM assessment of effects of casualty and fatigue, WWII engagements. These rates are for division or smaller size; for corps and larger units exhaustion rates are calculated for component divisions and smaller separate units.

EXAMPLES OF APPLICATION

A division in continuous offensive combat for ﬁve days stays in the line in inactive posture for two days, then resumes the offensive:
1. Combat exhaustion effect: 1 – (5 x .05) = 0.75;
2. Recuperation effect: 75 + (2 x .l) = 0.95.
A division in defensive posture for ﬁfteen days is ordered to undertake a counterattack:
1. Combat exhaustion effect: 1 – (15 x .05) =0.25; this is below the minimum ex factor, which therefore applies: 0.5;
2. Recuperation effect: None; ex factor is 0.5.
A division in offensive posture for three days is reinforced by two fresh brigades:
1. Combat exhaustion effect: 1 – (3 x .05) = 0.85;
2. Reinforcement effect: Augmentation from 0.85 to 1.0.
A division in offensive posture for three days is reinforced by one fresh brigade:
1. Combat exhaustion effect: 1 – (3 x .05) = 0.85;
2. Reinforcement effect: 0.5 x augmentation from 0.85 to 1 = 0.93.