You are on page 1of 15

644

A neural network model for predicting maximum shear capacity of concrete beams without transverse reinforcement
Ayman Ahmed Seleemah

Abstract: Different relationships have been proposed by codes and researchers for predicting the shear capacity of members without transverse reinforcement. In this paper, the applicability of the artificial neural network (ANN) technique as an analytical alternative to existing methods for predicting this shear capacity is investigated using a critically reviewed and agreed upon database of experimental work that serves as a basis of comparison and (or) assessment of existing and new relationships. Both ANN and eight different codes and researchers predictions of the shear capacity of the specimens of the database were compared. The ANN predictions are much superior to those of any of the current available relationships. Key words: artificial neural networks, shear capacity, transverse reinforcement, beams. Rsum : Les codes et les chercheurs ont propos diverses relations pour prdire la rsistance au cisaillement de membrures sans armature transversale. Le prsent article tudie lapplicabilit de la technique des rseaux neuronaux artificiels comme alternative analytique aux mthodes existantes de prdiction de la rsistance au cisaillement. Nous avons utilis pour cela une base de donnes du travail exprimental accepte et examine scrupuleusement pour servir de base de comparaison et-ou dvaluation des relations existantes et nouvelles. Nous avons compar les prvisions mises par les rseaux neuronaux artificiels, huit diffrents codes et des chercheurs concernant la rsistance au cisaillement dchantillons de la base de donnes. Nous avons trouv que les prvisions des rseaux neuronaux artificiels sont grandement suprieures celles de toute autre relation actuellement disponible. Mots cls : rseaux neuronaux artificiels, rsistance au cisaillement, armature transversale, poutres. [Traduit par la Rdaction] Seleemah 657

Introduction
Civil and structural engineers attempt to improve the analysis, design, and control of the behavior of structural systems. The behavior of structural systems, however, is complex and often governed by both known and unknown multiple variables, with their interrelationship generally unknown, nonlinear, and sometimes very complicated. The traditional approach used in most research in modeling starts with an assumed form of an empirical or analytical equation and is followed by a regression analysis using experimental data to determine unknown coefficients such that the equation will fit the data. In the last two decades, researchers explored the potential of artificial neural networks (ANNs) as an analytical alternative to conventional techniques, which are often limited by strict assumptions of normality, linearity, homogeneity, variReceived 22 April 2004. Revision accepted 4 January 2005. Published on the NRC Research Press Web site at http://cjce.nrc.ca on 3 August 2005. A.A. Seleemah. Department of Civil Engineering, Benha High Institute of Technology, Benha, Egypt (e-mail: seleemah55@yahoo.com). Written discussion of this article is welcomed and will be received by the Editor until 31 December 2005.
Can. J. Civ. Eng. 32: 644657 (2005)

able independence, etc. Researchers found ANNs particularly useful for function approximation and mapping problems, which are tolerant of some imprecision and have a considerable amount of experimental data available. In a strict mathematical sense, ANNs do not provide closed-form solutions for modeling problems but offer a complex and accurate solution based on a representative set of historical examples of the relationship. Advantages of ANNs include the ability to learn and generalize from examples, produce meaningful solutions to problems even when input data contain errors or are incomplete, adapt solutions over time to compensate for changing circumstances, process information rapidly, and transfer readily between computing systems (Flood and Kartam 1994). This paper focuses on the prediction of the ultimate shear strength of reinforced concrete beams. This topic is still an active area of research because of the complexity of the shear transfer mechanism and the large number of affecting parameters. While many efforts have been conducted to understand the shear behavior of reinforced concrete beams and (or) to derive equations for estimating such shear capacity, some researchers explored the application of ANNs for such predictions. For example, Sanad and Saka (2001) applied ANNs to predict the ultimate shear capacity of reinforced concrete deep beams; Mansour et al. (2004) successfully used ANNs to predict the shear capacity of reinforced concrete beams with transverse reinforcement; and Oreta
2005 NRC Canada

doi: 10.1139/L05-003

Seleemah

645 Table 1. Ranges of parameters in the database. bw (mm) d (mm) a/d d/bw M/Vd f1c (MPa) f1t (MPa) fy (MPa) l (%) Max. aggregate size (mm) 76.21000.0 1102000 2.418.03 0.257.20 1.417.03 12.6110.9 1.275.29 275.91779.3 0.1386.635 6.3538.00

(2004) applied ANNs to a set of 155 experimental tests to simulate the size effect on the shear strength of reinforced concrete beams without transverse reinforcement. In the current paper, one of the largest and most confident databases is utilized to investigate the applicability of the ANN technique to predict the concrete contribution to shear resistance of beams. Moreover, a comparison of both ANN and different codes and researchers predictions for this important structural criteria is conducted.

Shear capacity of beams without transverse reinforcement


Different relationships have been proposed by researchers all over the world for predicting the shear capacity of beams without transverse reinforcement. The importance of these relationships is that they represent the concrete contribution to the overall shear resistance of a reinforced concrete beam with transverse reinforcement, which is a common design case. Unfortunately, these proposed relationships are usually empirical and designed to fit a limited set of shear test results that is most familiar to the researcher. Moreover, the relationships differ quite considerably in their selected parameters because there is no generally accepted model for the load transfer and the ultimate capacity of members without transverse reinforcement. This fact was discussed in the two state-of-the-art reports by the Joint ASCEACI Committee 445 on Shear and Torsion (1998) and the CEB (1997). Furthermore, the test data used for comparing proposals with tests differ very much from one researcher to another in the amount and quality of selected data. Therefore, it is essential to collect a unique and critically reviewed database of experimental work that would serve as a basis of comparison and (or) assessment of assumed relationships. Moreover, it is important also to explore other methods for predicting the shear capacity that would give results superior to those suggested relationships. In this paper, the capability of ANNs as an analytical alternative to conventional techniques for predicting the ultimate shear capacity of concrete beams is investigated.

Note: M, maximum bending moment; f1t, uniaxial concrete tensile strength; l, percentage of main steel. All other terms as defined in the text.

Shear database
A databank established by Reineck (1999) and checked by a group of experts was used to verify the empirical equation for members without transverse reinforcement proposed for the German standard DIN 1045-1 (DIN 2001). Kuchma (2000) conducted a wide-range collection of test data on members both with and without transverse reinforcement. Recently, both authors have merged their databanks into a collection shear databank (CSDB) that contains mainly beams with a rectangular cross section (933 beams, 231 of which have a shear span to beam depth ratio of less than 2.40). A subcommittee was then established within the Joint ASCEACI Committee 445 on Shear and Torsion to come to a consensus on criteria that must be satisfied if a specific test result is to be used for assessing the capability of empirical design rules or expressions derived from models for struc-

tural behavior. A sanctioned set of criteria was developed for accepting a test result into an evaluation-level database. Examples of these criteria include a minimum compressive strength, a minimum overall height and width, and a check against flexural failure. A databank of members that satisfy these agreed-upon criteria was established and called the evaluation shear databank (ESDB) (Reineck et al. 2003). This databank contained 439 shear tests collected from 64 references. All beams in this database have a rectangular cross section, do not contain shear reinforcement, and were subjected to point loads. Extensive discussions and review on the ESDB were then conducted by the American Concrete Institute (ACI) Subcommittee 445-F on Beam Shear, which led to some changes in the database, and a total of 41 tests were excluded for several reasons. The revised version (398 tests) was intended to serve as a basis for any code changes. This database is utilized in this study to evaluate and demonstrate the capability of the ANN technique for predicting the shear capacity of members without transverse reinforcement. Moreover, it was also used to evaluate eight different codes and formulas that exist for predicting such shear capacity, and several comparisons of the predictions of these methods and those of the ANNs are conducted. It should be pointed out that the aforementioned database covers a very wide range of beam depth, beam breadth, shear span to beam depth ratio, maximum aggregate size used in concrete and its tensile and compressive strengths, main reinforcement percentage, and yield stress. Table 1 summarizes the ranges of the parameters in the database.

Existing models for shear capacity prediction


Eight different major codes and methods of estimation of the shear capacity of reinforced concrete beams without shear reinforcement were collected and are as follows: ACI Committee 318 (1999), BSI (1990), ECS (1992), NZS (1995), Zsutty (1971), Collins and Kuchma (1999), Niwa et al. (1986), and Reineck (1991). The expressions utilized by each of these methods are summarized in Table 2. These methods were applied to the whole database (398 specimens) to achieve the prediction of each method for the shear capacity of each specimen.
2005 NRC Canada

646 Table 2. Summary of the expressions to calculate the concrete contribution to shear resistance. Reference ACI Committee 318 1999 BSI 1990 ECS 1992 NZS 1995 Zsutty 1971 Collins and Kuchma 1999 Niwa et al. 1986 Equation Vc = ( Fc / 6 ) bwd Vc = 0.27bwd (100 As / bwd )1 / 3 ( fcu )1 / 3 m

Can. J. Civ. Eng. Vol. 32, 2005

Remarks fcu 40.0 MPa k = (1.6 d/1000) 1.0; l = As/bwd 0.02 a/d 2 a/d 2.5 where S E = 35 S x /( a + 16 ) 1.4 0.75 + a /d Es = 2 000 000 MPa Sx 0.9d

Vrd1 = [Rdk(1.2 + 40l)]bwd Vc = ( 0.07 + 10 l ) fc bwd Vu = 2.2( fc l d/ a )1 / 3 bwd Vc = 245 1275 + S E fc bwd

Vc = 0.2 bwd

( fc )1 / 3 (100 As / bwd )1 / 3 ( d/1000 )1 / 4


8/9 2/3

Reineck 1991

Vu = (0.4bwdfct + Vdu)/(1 + 0.54); fct = 0.246(fc)2/3; Vdu/bw dfc = 1.33 /(fc) ; = fcd/Es wu , wu = 0.09 mm

Note: Es, modulus of elasticity of steel; fct, axial tensile strength of concrete; k, constant relating to section depth and curtailment; SE, crack spacing parameter; Sx, crack spacing; Vc, nominal shear strength; Vdu, dowel force; Vrd1, shear resistance; Vu, ultimate shear force; wu, limiting crack width; m, partial safety factor for strength; , dimension-free value for the crack width, which determines the friction capacity. All other terms as defined in the text.

It should be mentioned that the original values of the basic design shear strength Rd of the Eurocode EC2 (ECS 1992) is given by the code in a tabular form in terms of the characteristic cylinder strength fck (in MPa). Appropriate conversions were conducted to relate this factor to the uniaxial concrete compressive strength f1c. Moreover, for the British Standard BS5400-4 (BSI 1990), the characteristic concrete cube strength, fcu, was also converted in terms of f1c. Generally speaking, appropriate conversions for the unit system and (or) the compressive strength were conducted whenever such conversions were essential. The equations most frequently used in these conversions are as follows: [1] [2] [3] fck = fc 1.60 f1c = 0.95 fc f1c = 0.75 fc,cube

Fig. 1. Typical architecture of multilayer ANN.

where fc is the cylinder strength, and fc,cube is the cube strength of concrete.

Neural network based modeling


A neural network is a nonlinear dynamic system consisting of a large number of highly interconnected processing units, called artificial neurons. Its architecture and operations are inspired by the biological structure of neurons and the internal operation of the human brain. The main computational characteristics of neural networks are their ability to learn functional relationships from examples and to discover patterns and regularities in data through self-organization. In this study, the multilayer feed-forward neural networks are utilized because they are very suitable for modeling the nonlinear mapping type of problems. Figure 1 shows a typical

architecture of multilayer feed-forward neural networks with an input layer, an output layer, and one hidden layer. As shown in Fig. 1, the artificial neurons are arranged in layers, and all the neurons in each layer have connections to all the neurons in the next layer. Associated with each connection between these artificial neurons, a weight value (wi) is defined to represent the connection weight. The connec 2005 NRC Canada

Seleemah

647

tions between the neurons are individually weighed so that the total of i inputs (Xi) to the single neuron is [4] input =

the training data consist of input and associated output pairs representing the problem that we want the network to learn. Training and testing patterns An important factor that can significantly influence the ability of a network to learn and generalize is the number of patterns in the training set. Although it increases the time required to train a network, increasing the number of training patterns provides more information about the shape of the solution surface and thus increases the potential level of accuracy that can be achieved by the network. Another important condition is that the training data should be well distributed within the problem domain. Carpenter and Barthelemy (1994) stated that a necessary condition for obtaining a unique approximation is to have the number of training pairs equal to or greater than the number of weights and biases associated with the network. In our case this number ranges between nine and 161 for networks with one and 20 nodes in a single hidden layer, respectively. It also ranges between 11 and 191 for networks with two hidden layers having one and 10 neurons per hidden layer, respectively. Oreta and Kawashima (2003) stated that the minimum number (NT) of training data pairs, for ANNs with bias terms at the input and hidden layers, should be NT = J(I + 1) + K(J + 1), where I is the number of input nodes, J is the number of hidden layer nodes, and K is the number of output nodes. Some ANN experts suggest that NT should be in the range 10(I + J + K), which yields a number between 80 and 270. Applying all suggestions to our case, we reach a number of training data pairs between nine and 270. Carpenter and Hoffman (1995) suggested that an ANN model that is 20%50% overdetermined tends to produce reasonably good approximations. Compromising these suggestions and having in mind that we have a total of 398 data patterns, it was decided to use 50% of the data (199 specimens) for training and save the other 50% for testing or validation. Training data were first selected randomly, and then they were checked to make sure that they satisfy a good distribution within the problem domain. Back-propagation learning algorithm In this study, the training phase of ANNs is implemented by using the back-propagation learning algorithm. A backpropagation network typically starts out with a random set of weights. The network adjusts its weights each time it sees an inputoutput pair. Each pair requires two stages: a forward pass and a backward pass. The forward pass involves presenting a sample input to the network and letting activations flow until they reach the output layer. During the backward pass, the networks actual output (from the forward pass) is compared with the target output and error estimates are computed for the output units. The weights connected to the output units can be adjusted to reduce those errors. We can then use the error estimates of the output units to derive error estimates for the units in the hidden layers. Lastly, errors are propagated back to the connections stemming from the input units. The back-propagation algorithm updates its weights incrementally, after seeing each inputoutput pair. After it has seen all the inputoutput pairs (and adjusted its weights
2005 NRC Canada

wiX i + b
i

where b is the connection weight associated with a bias node that has an input value of 1.0. The weights may be positive or negative such that some inputs will be excitatory and others will be inhibitory. This input passes through an activation function to produce the values of Yi of the hidden layer(s) or Oi of the output layer. The activation function may have many forms. The most familiar and effective form in our case is the sigmoid function, defined as [5] output = 1/{[1 + exp[ (input)]} where is a constant that typically varies between 0.01 and 1.00. Signals are received at the input layer, pass through the hidden layers, and reach the output layer, producing the output of the network. The learning process primarily involves the determination of connection weight and bias matrices and the pattern of connections. It is through the presentation of examples, or training cases, and application of the learning rule that the neural network obtains the relationship embedded in the data. In applications of multilayer feed-forward neural networks to engineering modeling, there are four important issues to be addressed: (i) representation of the problem in which the composition of the input and output layers must be determined, (ii) determination of the configuration of hidden layer(s) through trial and error process, (iii) training of the ANN, and (iv) generalization verification of the trained ANN. The representation scheme depends on the nature of the problem and the intended use of the neural network model. Experience shows that there is no unique solution for representation schemes; different neural networks can produce similar results with the same set of training data. Input and output layers of ANN In this study, the neural networks were designed to have an input layer that consists of six input nodes representing the most important parameters that affect the shear capacity of reinforced concrete beams. Based on careful study of recent approaches for the shear phenomena in concrete members, it was decided to design the input layer to consist of (i) beam depth (d), (ii) beam breadth (bw), (iii) ratio of shear span (a) to beam depth (a/d), (iv) uniaxial concrete compressive strength (f1c), (v) percentage of main reinforcement (l ), and (vi) steel yield stress (fy). The output layer consisted of one node representing the ultimate shear capacity of the beam. Training of the network In a multilayer feed-forward neural network, training refers to the iterative process involving the presentation of training data to the network, the invocation of learning rules to modify the connection weights, and, usually, the evolution of the network architecture, such that the knowledge embedded in the training data is appropriately captured by the weight structure of the network. During the training phase,

648 Fig. 2. Training and testing root mean square (RMS) error trace: (a) ANN 6-12-1; (b) ANN 6-4-4-1.

Can. J. Civ. Eng. Vol. 32, 2005

many times), it is said that one epoch has been completed. Training a back-propagation network usually requires many thousands of epochs. An error criteria for the network output is usually chosen and the maximum number of iterations is set to provide a condition for terminating the learning process. The performance of ANNs can be monitored by monitoring the training error with respect to the number of iterations. If the network learns, the error will approach a minimum value. After the training phase, the ANNs can be tested for the other set of patterns, which the network has never seen, where the final values of the weights obtained in the training phase are used. No weight modification is involved in the testing phase. Normalization of input and output data Neural networks usually provide improved performance when the data lie within the range (0, 1). Because a sigmoid function is used, a slow rate of learning occurs near the end points of the sigmoid function. To avoid this, all input values and associated outputs are transformed to values ranging from 0.1 to 0.9 rather than from 0.0 to 1.0. The following equation is used: [6] XT = 0.1 + 0.8[(X Xmin)/(Xmax Xmin)]

Network topology Since there is no direct and precise way of determining the most appropriate number of hidden layers and number of neurons to include in each hidden layer for a problem, a trial and error procedure is typically used to approach the best network topology for a particular problem. In this paper, several network topologies were examined. These included networks with one hidden layer containing 1, 2, 4, 6, 8, 10, 12, 14, 16, 18, and 20 neurons in the single hidden layer; and networks with two hidden layers containing 1, 2, 4, 5, 6, 8, and 10 neurons in each hidden layer. The target network would be the smallest sized network that not only produces a minimum error for the training pattern but also gives a generalized solution that performs well for the testing pattern.

Results and discussion


Error trace Figure 2 shows the RMS error trace with the number of iterations for networks with a single hidden layer (ANN 6-121) and those with two hidden layers (ANN 6-4-4-1). Clearly, the RMS training error decreases with an increase in the number of iterations, whereas the RMS testing error decreases to a minimum and then begins to increase again. During the first part of the training, performance on the training set improves as the network adjusts its weights through back-propagation. Performance on the test set (dataset that the network is not allowed to learn on) also improves. After a while, the weights shift around, looking for a path for further improvement. When this path is found, performance on the training set improves but performance on the test set gets worse due to the fact that the network begins to memorize the individual inputoutput pairs rather than settling for weights that generally describe the mapping for all cases. Therefore, it is not always beneficial to increase the maximum number of allowed iterations. In this study, this number was set to 40 000 iterations, which was fairly suitable for most of the studied cases. Achieving best network The performance of different networks in terms of RMS error of both training and testing patterns is shown in Fig. 3. Increasing the number of neurons causes the RMS training error to decrease. In other words, networks with a large number of neurons in the hidden layers achieve good learning in terms of RMS training error. On the other hand, they give worse results in terms of RMS testing error. This indicates that too much power of the network (large number of
2005 NRC Canada

where XT is the normalized value, X is the original value, Xmax is the maximum value of the attribute or output, and Xmin is the minimum value of the attribute or output. Network validation and error analysis After an ANN model has been trained, validation of the network on the test patterns should be undertaken. The usual practice for ANN model validation is to evaluate the network performance measure using a selected error metric based on both training and testing data. The evaluation and validation of an ANN prediction model can be done by using common error metrics such as the root mean squared error (RMS error). The definition for RMS error is given in the following equation: [7] RMS =

(Oij Yij )2 / nm
i =1 j =1

where n is the number of patterns in the validation set, m is the number of components in the output vector, O is the output of a single neuron, and Y is the target output for the single neuron.

Seleemah Fig. 3. RMS error for different network topologies: (a) networks with a single hidden layer; (b) networks with two hidden layers.

649

Fig. 4. Statistical errors for all patterns (398 specimens): (a) networks with a single hidden layer; (b) networks with two hidden layers.

neurons in the hidden layers) allows the network to memorize the training pattern rather than extract the noticeable features that map the input data to the output data. To judge which network performs better, a statistical error of each network was calculated for all patterns, that is, 398 pattern sets. The results are shown in Fig. 4, in which the average error, median, standard deviation (SD), and standard variation (SV) are shown for each of the tested networks. For the networks with a single hidden layer, the networks with six and 10 neurons in the hidden layer are better than all other networks. For the networks with two hidden layers, the one with six neurons in each hidden layer can be considered best. These three networks were selected and allowed to run 200 000 more iterations to reach a total of 240 000 iterations. The maximum, minimum, average, SD, and SV of the ratio of experimental to model-predicted shear capacity ( mod = Vexp/Vpred) for the three networks with 40 000 iterations and their counterparts with 240 000 iterations are shown in Fig. 5. Note that the networks with 240 000 iterations are denoted ANN#. Clearly the ANN 6-6-1 network with 40 000 iterations was superior to all other networks. It is concluded that too much increase in the number of iterations, although consuming a large amount of computer running time, did not improve the performance of the networks at all. A comparison of the experimental and predicted shear capacity by the aforementioned six networks for all 398 data patterns is shown in Fig. 6. The line of equality and the lines of plus or minus 10% error are also plotted to facilitate visualization and judgment of the results. The network that gives results closer to the equality line is of course better. ANN 66-1 is obviously the best network and has the greatest num-

Fig. 5. Statistical data for the ratio between experimental and predicted shear capacity (mod = Vexp/Vpred) for different networks.

ber of predictions lying between the plus and minus 10% error lines. Figure 7 shows a histogram of the ratio between experimental and predicted shear capacity for ANN 6-6-1. The histogram is slightly skewed to the left-hand side and has its maximum occurring at mod = 0.96. Comparison with different models The ANN 6-6-1 is employed to predict the shear capacity of all 398 beam specimens, together with the eight different codes and methods listed in Table 2. The results are presented in Fig. 8, which shows, in separate plots, comparisons between experimental and predicted shear capacity by each of the methods. It is clear from Fig. 8 that the ACI Committee 318 (1999), BSI (1990), ECS (1992), Collins and
2005 NRC Canada

650

Can. J. Civ. Eng. Vol. 32, 2005

Fig. 6. Comparison of experimental and predicted shear capacity for different networks. The broken lines represent 10% error on either side of the line of equality.

Fig. 7. Histogram of the ratio of experimental to predicted shear capacity for ANN 6-6-1.

Kuchma (1999), and Niwa et al. (1986) methods give wide variations on either side of the equality line. This means that these methods underestimate the shear capacity for some specimens and overestimate the shear capacity for other specimens. The NZS (1995) and Zsutty (1971) methods overestimate the shear capacity. The Reineck (1991) method is the only conservative method, in which most of the predictions are underestimates. In contrast, the predictions of the ANN are extremely impressive, with most results lying on or very close to the equality line. This accuracy suggests that the most critical variables controlling the shear capacity of concrete beams without transverse reinforcement are the six that have been used as input data to the ANN model.

The ratio of the experimental to predicted shear capacity by specific model ( mod = Vexp/Vpred) can be interpreted as the additional factor of safety implied by this method, since no strength reduction factor was applied during the calculation of shear capacity by any of the methods. For economic considerations, this additional safety factor should not be very large because there is a concrete strength reduction factor, m , that is applied during any design process. The ratio mod also should not be much lower than unity, since this means that the method overestimates the shear capacity of the beam and may lead to unsafe design. A comparison of statistical calculations of the ratio of the experimental to predicted shear capacity for all models ( mod ) and for all data patterns was conducted and the results are shown in Fig. 9. As observed earlier, the method of Reineck (1991) gives the most conservative results, with a maximum ratio of 4.40, a minimum ratio of 0.87, and an average ratio of 1.43. The ANN method gives the best results, with a maximum ratio of 1.26, a minimum ratio of 0.74, and an average ratio of 0.94. All other methods have a maximum ratio lying between 2.00 and 3.30, a minimum ratio between 0.32 and 0.58, and an average ratio between 0.87 and 1.47. Both the SD and SV are minimum for the ANN method. To describe the influence of dominant parameters on the predictions of each model, the ratio of actual to predicted shear capacity ( mod ) is plotted versus the primary parameters in Figs. 1013. A common feature of all the plots is the very large scatter in the predictions by different models, in which mod ranged from 0.33 to 4.41. Figure 10 shows the plot of mod versus the ratio of shear span to beam depth (a/d). Most tests were conducted for low a/d ratios, for example, 81% of all tests (323 of 398 tests)
2005 NRC Canada

Seleemah

651

Fig. 8. Comparison of experimental and predicted shear capacity for 398 specimens. The broken lines represent 10% error on either side of the line of equality.

Fig. 9. Statistical data for the ratio between experimental and predicted shear capacity for different shear proposals.

were carried out for beams with an a/d ratio of less than or equal to 4.0. A large scatter is noticed in the predictions made by all methods for a/d ratios between 2.4 and 4.0. For the ACI Committee 318 (1999), BSI (1990), ECS (1992), NZS (1995), Collins and Kuchma (1999), and Reineck (1991) models, the additional safety factor, mod , increases with a decrease in the a/d ratio, indicating a beneficial influence due to direct load transfer that is not captured by these models. The results obtained from the neural network indicate consistent accuracy in all ranges of the a/d ratio, indicating that the shear phenomena were well captured by this network. Variation of the additional safety factor, mod , with the concrete uniaxial compressive strength (f1c) for the models considered in this study is shown in Fig. 11. Most of the tests in the database (73%) were carried out for normalstrength concrete (NSC) having f1c less than 45.0 MPa (289 of 398 tests). The scatter of the predictions by the ACI Committee 318 (1999), BSI (1990), and Collins and Kuchma
2005 NRC Canada

652

Can. J. Civ. Eng. Vol. 32, 2005

Fig. 10. Ratio of experimental to predicted shear capacity versus the ratio a/d for different shear proposals.

2005 NRC Canada

Seleemah

653

Fig. 11. Ratio of experimental to predicted shear capacity versus uniaxial concrete compressive strength (f1c) for different shear proposals.

2005 NRC Canada

654

Can. J. Civ. Eng. Vol. 32, 2005

Fig. 12. Ratio of experimental to predicted shear capacity versus percentage of main steel (1 = As/bwd) for different shear proposals.

2005 NRC Canada

Seleemah Fig. 13. Ratio of experimental to predicted shear capacity versus beam depth (d) for different shear proposals.

655

2005 NRC Canada

656

Can. J. Civ. Eng. Vol. 32, 2005

(1999) methods for the specimens with high-strength concrete (HSC) having f1c greater than 45.0 MPa is very apparent. The results of the NZS (1995), Zsutty (1971), and Niwa et al. (1986) methods are mostly unaffected by the variation of f1c, indicating better recognition of the effect of f1c on the shear capacity of beams without shear reinforcement. Among all of the models shown, the results obtained from the neural network are the most consistent, with values close to unity for a wide variation in concrete uniaxial compressive strength. Figure 12 shows a plot of mod versus the longitudinal reinforcement percentage l (= As/bwd, where As is the area of longitudinal reinforcement). Fifty-five percent of all tests (219 of 398 tests) were conducted for a relatively high reinforcement ratio of 1 > 2%. Only 14.6% of the tests (58 of 398 tests) were conducted for 1 < 1%. The results obtained using the ACI Committee 318 (1999), BSI (1990), ECS (1992), Zsutty (1971), and Collins and Kuchma (1999) methods show a general increase in the models additional safety factor, mod , with the increase in 1 indicating a pronounced beneficial effect of the reinforcement ratio that is not properly captured by any of these methods. Some of these methods, namely those of ECS and Zsutty, included the reinforcement ratio in their equations, but it seems that the beneficial effect of reinforcement exceeds what was included in the relationships. The models of Niwa et al. (1986) and Reineck (1991) are nearly unaffected by the change in the reinforcement ratio, indicating a reasonable capturing of the effect of reinforcement ratio. The performance of the neural network model is superior to that of all other models. It not only is unaffected by the change in reinforcement ratio, but also gives values of mod near or equal to unity. Lastly, the variation of the additional safety factor, mod , with beam depth, d, is shown in Fig. 13; 87% of all the specimens (346 of 398 specimens) had a beam depth of less than 500 mm. Once again, most methods yield large scatter in the results, especially for specimens with beam depths in the range 140300 mm. The additional safety factor, mod , predicted using the ACI Committee 318 (1999), BSI (1990), NZS (1995), Zsutty (1971), and Collins and Kuchma (1999) models decreases with an increase in depth, indicating improper capturing of the effect of depth on the overall shear capacity of beams. In contradiction to all other methods, the predictions of the ANN are very consistent and very close to unity.

contained a total of 398 tests and was intended, by the Joint ASCEACI Committee 445 on Shear and Torsion (1998) and the ACI Subcommittee 445-F on Beam Shear, to serve as a basis for any code changes regarding the shear phenomena. The database covered a very wide range of beam parameters, including their dimensions, concrete strengths, reinforcement ratios, and shear span to beam depth ratios. This database was utilized in this study to evaluate the following eight different existing codes and formulas for predicting the shear capacity of members without transverse reinforcement: ACI Committee 318 (1999), BSI (1990), ECS (1992), NZS (1995), Zsutty (1971), Collins and Kuchma (1999), Niwa et al. (1986), and Reineck (1991). Moreover, the database was used to evaluate and demonstrate the capability of the feed-forward back-propagation artificial neural networks (ANNs) for predicting such shear capacity. Based on careful study of recent approaches for the shear phenomena in concrete members, it was decided to design the ANN to have six nodes in the input layer containing data regarding beam depth (d), beam breadth (bw), shear span to beam depth ratio (a/d), uniaxial concrete compressive strength (f1c), percentage of main reinforcement (1), and steel yield stress (fy). The output layer consisted of one node representing the ultimate shear capacity of the beam. Several network topologies were examined. These included networks with one hidden layer containing 1, 2, 4, 6, 8, 10, 12, 14, 16, 18, and 20 neurons in the single hidden layer; and networks with two hidden layers containing 1, 2, 4, 5, 6, 8, and 10 neurons in each hidden layer. All the networks were trained on 199 shear tests, and the performance of the networks on both 199 training datasets and 199 testing datasets was compared; the network with the best prediction was selected for comparison with the eight models. In comparing the predictions of the ANN with those from the eight models, it was found that the results obtained from the ANN were the most accurate, giving values of maximum shear capacity very close to the experimental values. Moreover, the results obtained using the ANN were very consistent and covered a very wide range of variation of any of the input parameters. This accuracy suggests that the most critical variables controlling the shear capacity of concrete beams without transverse reinforcement are the six variables used as input data to the ANN model.

Acknowledgment Summary and conclusions


Different relationships have been proposed by researchers all over the world for predicting the shear capacity of concrete members without transverse reinforcement. Unfortunately, these proposed relationships are usually empirical and designed to fit a limited set of shear test results that is most familiar to the researcher. Accordingly, the relationships differ quite considerably in their selected parameters, since there is no generally accepted model for the load transfer and ultimate shear capacity of members without transverse reinforcement. A unique and critically reviewed database of experimental work conducted on beams without transverse reinforcement was collected from 64 different references. This database The author would like to thank Professor Amr W. Sadek of Cairo University and the Kuwait Institute for Scientific Research (KISR) for his valuable suggestions regarding determination of the most appropriate number of training and testing patterns.

References
ACI Committee 318. 1999. Building code requirements for structural concrete (ACI318-99) and commentary (318R-99). American Concrete Institute, Farmington Hills, Mich. BSI. 1990. Steel, concrete and composite bridges Part 4: Code of practice for design of concrete bridges. British Standard BS5400-4, British Standards Institution, London, U.K.
2005 NRC Canada

Seleemah Carpenter, W.C., and Barthelemy, J. 1994. Common misconceptions about neural networks as approximators. ASCE Journal of Computing in Civil Engineering, 8(3): 345358. Carpenter, W.C., and Hoffman, M.E. 1995. Training backpropagation neural networks. AI Expert, 10(3): 3033. CEB. 1997. Concrete tension and size effects: utilisation of concrete tension in structural concrete design and relevance of size effect. Bulletin 237. Comit Euro-International du Bton (CEB), Lausanne, Switzerland. 258 pp. Collins, M.P., and Kuchma, D.A. 1999. How safe are our large, lightly reinforced concrete beams, slabs, and footings. ACI Structural Journal, 96(4): 482490. DIN. 2001. Plain concrete, reinforced and prestressed concrete structures Part 1: Design and construction (DIN 1045-1). Deutsches Institut fr Normung e.V. (DIN), Berlin, Germany. ECS. 1992. Eurocode 2: design of concrete structures Part 1: General rules and rules for buildings (DD ENV 1992-1-1, BSI). European Committee for Standardization (ECS), Brussels, Belgium. 253 pp. Flood, I., and Kartam, N. 1994. Neural networks in civil engineering. I: principles and understanding. ASCE Journal of Computing in Civil Engineering, 8(2): 131148. Joint ASCEACI Committee 445 on Shear and Torsion. 1998. Recent approaches to shear design of structural concrete. ASCE Journal of Structural Engineering, 124(12): 13751417. Kuchma, D.A. 2000. Shear Data Bank. Available from http://www. ce.uiuc. edu/kuchma/sheardatabank [cited December 2003]. Mansour, M.Y., Dicleli, M., Lee, J.Y., and Zhang, J. 2004. Predicting the shear strength of reinforced concrete beams using artificial neural networks. Engineering Structures, 26(6): 781799.

657 Niwa, J., Yamada, K., Yokozawa, K., and Okamura, M. 1986. Reevaluation of the equation for shear strength of RC beams without web reinforcement. Proceedings of the Japan Society of Civil Engineering, 5(372): 19861988. [In Japanese.] NZS. 1995. New Zealand standard code of practice for the design of concrete structures (NZS 3101). Standard Association of New Zealand (NZS), Wellington, New Zealand. Oreta, A.C. 2004. Simulating size effect on shear strength of RC beams without stirrups using neural networks. Engineering Structures, 26(5): 681691. Oreta, A.C., and Kawashima, K. 2003. Neural network modeling of confined compressive strength and strain of circular concrete columns. ASCE Journal of Structural Engineering, 129(4): 554561. Reineck, K.H. 1991. Ultimate shear force of structural concrete members without transverse reinforcement derived from a mechanical model. ACI Structural Journal, 88(5): 592602. Reineck, K.H. 1999. Shear design of members without transverse reinforcement in DIN 1045-1. Final research report for DIBt, Project IV 1-5-876/98. Chapter 3 and Appendixes. University of Leipzig, Aachen, and University of Stuttgart, TU Munchen, Germany. [In German.] Reineck, K.H., Kuchma, D., Kim, K.S., and Marx, S. 2003. Shear database for reinforced concrete members without shear reinforcement. ACI Structural Journal, 100(2): 240249. Sanad, A., and Saka, M.P. 2001. Prediction of ultimate shear strength of reinforced concrete deep beams using neural networks. ASCE Journal of Structural Engineering, 127(7): 818827. Zsutty, T.C. 1971. Shear strength prediction for separate categories of simple beams tests. ACI Journal, 68(2): 138143.

2005 NRC Canada

You might also like