Finalize index.html discussion.

2025-09-29 01:50:54 +02:00 · 2013-05-28 09:42:04 -04:00
parent 01eba9b491
commit 9c25d54d38
2 changed files with 97 additions and 49 deletions
--- a/doc/index.html
+++ b/doc/index.html
@@ -39,6 +39,11 @@
      <a href="#Abstract">Abstract</a><br>
      <a href="#Introduction-to-endianness">Introduction to endianness</a><br>
      <a href="#Introduction">Introduction to the Boost.Endian library</a><br>
      <a href="#Choosing">Choosing approaches</a><br>
      <a href="#Performance">Performance</a><br>
 &nbsp;&nbsp;&nbsp; <a href="#Timings">Timings</a><br>
 &nbsp;&nbsp;&nbsp; <a href="#Conclusions">Conclusions</a><br>
      <a href="#FAQ">FAQ</a><br>
      <a href="#Acknowledgements">Acknowledgements</a></td>
  </tr>
  <tr>
@@ -51,18 +56,19 @@
      <a href="../../../boost/endian/types.hpp">&lt;boost/endian/types.hpp&gt;</a></td>
  </tr>
 </table>
 <h2><a name="Abstract">Abstract</a></h2>
-<p>Boost.Endian provides two facilities to manipulate the byte ordering of integers.</p>
+<p>Boost.Endian provides facilities to manipulate the endianness of integers, 
 floating point, and user defined data.</p>
 <ul>
-  <li>The primary use case is binary I/O of integers for portable exchange with 
+  <li>The primary use case is binary I/O for portable data exchange with 
  other systems, via either file or network transmission.<br>
 &nbsp;</li>
-  <li>A secondary use case is minimizing storage size via integers of sizes 
+  <li>A secondary use case is minimizing storage size via sizes and/or 
-  and/or alignments not supported by the built-in types. Integers 1, 2, 3, 4, 5, 
+  alignments not supported by the built-in types.<br>
  6, 7, and 8 bytes in length are supported.<br>
 &nbsp;</li>
-  <li>Two distinct approaches to byte ordering are provided. Each approach has a 
+  <li>Two distinct approaches to dealing with endianness are provided. Each approach has a 
  long history of successful use, and each approach has use cases where it is 
  superior to the other approach.</li>
 </ul>
@@ -87,9 +93,9 @@ output file produces:</p>
 <blockquote>
  <p><code>0102</code></p>
 </blockquote>
-<p>What's happening here is that Intel CPU's order the bytes of an integer with 
+<p>What's happening here is that Intel CPUs order the bytes of an integer with 
-the least-significant byte first, while SPARC CPU's place the most-significant 
+the least-significant byte first, while SPARC CPUs place the most-significant 
-byte first. Some CPU's, such as the PowerPC, allow the operating system to 
+byte first. Some CPUs, such as the PowerPC, allow the operating system to 
 choose which ordering applies.</p>
 <p><a name="definition"></a>Most-significant-byte-first ordering is traditionally called &quot;big endian&quot; 
 ordering and the least-significant-byte-first is traditionally called 
@@ -102,29 +108,33 @@ at different ends.</p>
 <p>See the Wikipedia's
 <a href="http://en.wikipedia.org/wiki/Endianness">Endianness</a> article for an 
 extensive discussion of endianness.</p>
-<p>Except for reading a core dump on little-endian systems, most programmers can 
+<p>Most programmers can ignore endianness, except perhaps for reading a core 
-ignore endianness. But when exchanging binary integers and binary floating point 
+dump on little-endian systems. Programmers have to deal with endianness in their 
-values between computer systems with differing endianness, whether by physical file transfer or over a network, programmers have to deal with endianness 
+code when exchanging binary integers and binary floating point 
-in their code. </p>
+values between computer systems with differing endianness, whether by physical file transfer or over a network, 
 . </p>
 <h2><a name="Introduction">Introduction</a> to the Boost.Endian library</h2>
-<p>The Boost.Endian library provides two facilities for dealing with endianness.</p>
+<p>The Boost.Endian library provides two different approaches to dealing with 
 integer endianness. Both approaches support integers, floating point types 
 except&nbsp; <code>long double</code>, and user defined types (UDTs).</p>
-<p>The library provides two approaches to dealing with integer endianness:</p>
+<p>Each approach has a long history of successful use, and each approach has use 
 cases where it is superior to the other approach.</p>
 <blockquote>
 <p><b><a href="conversion.html">Endian conversions</a> -</b> The application 
 uses the built-in integer and floating point types, and calls the provided 
 conversion functions to convert byte ordering as needed. Both mutating and 
 non-mutating conversions are supplied, and each comes in unconditional and 
 conditional variants. Type <code>long double</code> is not currently supported.</p>
 <p><b><a href="types.html">Endian types</a> -</b> The application uses the provided endian types 
 which mimic the 
-built-in integer types. For example, <code>big32_t</code> or <code>little64_t</code>. Types with lengths of 
+built-in integer types. For example, <code>big_int32_t</code> or <code>little_float64_t</code>. 
-1-8 bytes are supported, rather than just&nbsp; 2, 4, and 8 bytes. There are no alignment 
+Integer types with lengths of 1 through 8 bytes are supported, rather than just 
-requirements. Floating point types are not currently supported.</p>
+2, 4, and 8 byte integers. The types may be aligned or unaligned.</p>
 <p><b><a href="conversion.html">Endian conversion functions</a> -</b> The 
 application uses the built-in integer and floating point types, and calls the 
 provided conversion functions to convert byte ordering as needed. Both mutating 
 and non-mutating conversions are supplied, and each comes in unconditional and 
 conditional variants.</p>
 </blockquote>
@@ -133,16 +143,18 @@ requirements. Floating point types are not currently supported.</p>
 <h2><a name="Choosing">Choosing</a> between endian types and endian 
 conversion functions</h2>
-<p>Which approach is best for dealing with endianness depends on  
+<p>Which approach is better for dealing with endianness depends on  
-application concerns.</p>
+application needs.</p>
 <table border="1" cellpadding="5" cellspacing="0" style="border-collapse: collapse" bordercolor="#111111">
  <tr>
    <th colspan="2">Needs that favor one approach over the other</th>
  </tr>
  <tr>
-    <th width="50%"><b><a href="types.html">Endian types</a></b></th>
+    <th width="50%"><b><a href="types.html">Endian types</a> are better with 
-    <th><b><a href="conversion.html">Endian conversion functions</a></b></th>
+    these needs</b></th>
    <th><b><a href="conversion.html">Endian conversion functions</a> are better 
    with these needs</b></th>
  </tr>
  <tr>
    <td valign="top">
@@ -188,6 +200,10 @@ application concerns.</p>
 <p>Consider this problem:</p>
 <table border="1" cellpadding="5" cellspacing="0" style="border-collapse: collapse" bordercolor="#111111">
  <tr>
    <td colspan="2">
    <p align="center"><i><b><a name="Example-1">Example 1</a></b></i></td>
  </tr>
  <tr>
    <td colspan="2"><b><i>Add 100 to a big endian value in a file, then write the 
    result to a file</i> </b> </td>
@@ -223,13 +239,18 @@ big_endian(x);
  </tr>
 </table>
-<p><b>There will be no performance difference between the two approaches, 
+<p>There will be no performance difference between the two approaches, 
-regardless of the native endianness of the machine.</b> Optimizing compilers will likely 
+regardless of the native endianness of the machine. Optimizing compilers will likely 
-generate exactly the same code for both.</p>
+generate exactly the same code for both. That conclusion was confirmed by 
 studying the generated assembly code.</p>
 <p>Now consider a slightly different problem:&nbsp; </p>
 <table border="1" cellpadding="5" cellspacing="0" style="border-collapse: collapse" bordercolor="#111111">
  <tr>
    <td colspan="2">
    <p align="center"><b><i><a name="Example-2">Example 2</a></i></b></td>
  </tr>
  <tr>
    <td colspan="2"><i><b>Add a million values to a big endian value in a file, then write the 
    result to a file </b></i> </td>
@@ -268,25 +289,14 @@ big_endian(x);
  </tr>
 </table>
-<p><b>There may or may not be a considerable performance difference, depending 
+<h3><a name="Timings">Timings</a> for Example 2 (conversion functions hoisted 
-on the endianness of the machine. </b>If machine endianness differs from the 
+out of loop)</h3>
 desired endianness, the Endian type approach must do the byte reversal a million 
 times while the Endian conversion approach only does the reversal once. But if 
 the endianness is the same, there is no conversion with either approach and no 
 conversion code is generated for typical release builds.</p>
 <p><b>If compiler byte swap intrinsics are not available, any timing differences 
 will be magnified.</b> Byte swap intrinsics are not available on some older 
 compilers and on some machine architectures, such as pre-486 X86 CPUs.</p>
 <p><b>Unaligned types are much slower that aligned types, regardless of 
 endianness considerations.</b> Instead of single instruction register loads and 
 stores, multiple instructions are required.</p>
 <h3>Timing tests</h3>
 <p>These tests were run against release builds on a circa 2012 4-core little endian X64 Intel Core i5-3570K 
 CPU @ 3.40GHz under Windows 7.</p>
 <p><b>Caveat emptor: The Windows CPU timer has very high granularity. Repeated 
 runs of the same tests often yield considerably different results.</b></p>
 <p>See <a href="../test/loop_time_test.cpp">loop_time_test.cpp</a> and 
 <a href="../build/Jamfile.v2">Jamfile.v2</a> for the actual code and build 
 setup. 
@@ -376,6 +386,44 @@ setup.
 </table>
 <h3><a name="Conclusions">Conclusions</a></h3>
 <p>When program logic dictates the same number of conversions for both Endian 
 integer approach and Endian conversion function approach (<a href="#Example-1">example 
 1</a>):</p>
 <blockquote>
 <p><b>There will be no performance difference between the two approaches, 
 regardless of the native endianness of the machine.</b> Optimizing compilers will likely 
 generate exactly the same code for both. This conclusion was confirmed by 
 studying the generated assembly code.</p>
 </blockquote>
 <p>When program logic dictates many more conversions for the Endian integer 
 approach than the Endian conversion function approach (<a href="#Example-2">example 
 2</a>):</p>
 <blockquote>
 <p><b>There may be a considerable performance difference. </b>If machine endianness differs from the 
 desired endianness, the Endian type approach must do the byte reversal many 
 times while the Endian conversion approach only does the reversal once. But if 
 the endianness is the same, there is no conversion with either approach and no 
 conversion code is generated for typical release builds.</p>
 <p><b>Whether or not compiler byte swap intrinsics are available has little 
 impact.</b> Byte swap intrinsics are not available on some older 
 compilers and on some machine architectures, such as pre-486 X86 CPUs.</p>
 <p><b>Unaligned types are much slower that aligned types, regardless of 
 endianness considerations.</b> Instead of single instruction register loads and 
 stores, multiple instructions are required.</p>
 </blockquote>
 <h2>Overall <a name="FAQ">FAQ</a></h2>
 <p><b>Why bother with endianness?</b></p>
 <blockquote>
--- a/test/loop_time_test.cpp
+++ b/test/loop_time_test.cpp
@@ -9,7 +9,7 @@
 #define _SCL_SECURE_NO_WARNINGS
-#define BOOST_ENDIAN_NO_INTRINSICS
+//#define BOOST_ENDIAN_NO_INTRINSICS
 //#define BOOST_ENDIAN_LOG
 #include <boost/endian/detail/disable_warnings.hpp>