Do not enable proxying with
A set of modules must be loaded into the server to provide the
necessary features. These modules can be included statically at
build time or dynamically via the
Protocol | Module |
---|---|
AJP13 (Apache JServe Protocol version 1.3) | |
CONNECT (for SSL) | |
FastCGI | |
ftp | |
HTTP/0.9, HTTP/1.0, and HTTP/1.1 | |
SCGI |
In addition, extended features are provided by other modules.
Caching is provided by SSLProxy*
directives of
Apache HTTP Server can be configured in both a forward and reverse proxy (also known as gateway) mode.
An ordinary forward proxy is an intermediate server that sits between the client and the origin server. In order to get content from the origin server, the client sends a request to the proxy naming the origin server as the target and the proxy then requests the content from the origin server and returns it to the client. The client must be specially configured to use the forward proxy to access other sites.
A typical usage of a forward proxy is to provide Internet
access to internal clients that are otherwise restricted by a
firewall. The forward proxy can also use caching (as provided
by
The forward proxy is activated using the
A reverse proxy (or gateway), by contrast, appears to the client just like an ordinary web server. No special configuration on the client is necessary. The client makes ordinary requests for content in the name-space of the reverse proxy. The reverse proxy then decides where to send those requests, and returns the content as if it was itself the origin.
A typical usage of a reverse proxy is to provide Internet users access to a server that is behind a firewall. Reverse proxies can also be used to balance load among several back-end servers, or to provide caching for a slower back-end server. In addition, reverse proxies can be used simply to bring several servers into the same URL space.
A reverse proxy is activated using the [P]
flag to the
The examples below are only a very basic idea to help you get started. Please read the documentation on the individual directives.
In addition, if you wish to have caching enabled, consult
the documentation from
The proxy manages the configuration of origin servers and their communication parameters in objects called workers. There are two built-in workers, the default forward proxy worker and the default reverse proxy worker. Additional workers can be configured explicitly.
The two default workers have a fixed configuration and will be used if no other worker matches the request. They do not use HTTP Keep-Alive or connection pooling. The TCP connections to the origin server will instead be opened and closed for each request.
Explicitly configured workers are identified by their URL.
They are usually created and configured using
This will create a worker associated with the origin server URL
http://backend.example.com
and using the given timeout
values. When used in a forward proxy, workers are usually defined
via the
or alternatively using
Using explicitly configured workers in the forward mode is
not very common, because forward proxies usually communicate with many
different origin servers. Creating explicit workers for some of the
origin servers can still be useful, if they are used very often.
Explicitly configured workers have no concept of forward or reverse
proxying by themselves. They encapsulate a common concept of
communication with origin servers. A worker created by
The URL identifying a direct worker is the URL of its origin server including any path components given:
This example defines two different workers, each using a separate connection pool and configuration.
Worker sharing happens if the worker URLs overlap, which occurs when the URL of some worker is a leading substring of the URL of another worker defined later in the configuration file. In the following example
the second worker isn't actually created. Instead the first
worker is used. The benefit is, that there is only one connection pool,
so connections are more often reused. Note that all configuration attributes
given explicitly for the later worker will be ignored. This will be logged
as a warning. In the above example the resulting timeout value
for the URL /examples
will be 60
instead
of 10
!
If you want to avoid worker sharing, sort your worker definitions
by URL length, starting with the longest worker URLs. If you want to maximize
worker sharing use the reverse sort order. See also the related warning about
ordering
Explicitly configured workers come in two flavors:
direct workers and (load) balancer workers.
They support many important configuration attributes which are
described below in the
The set of options available for a direct worker
depends on the protocol, which is specified in the origin server URL.
Available protocols include ajp
, fcgi
,
ftp
, http
and scgi
.
Balancer workers are virtual workers that use direct workers known as their members to actually handle the requests. Each balancer can have multiple members. When it handles a request, it chooses a member based on the configured load balancing algorithm.
A balancer worker is created if its worker URL uses
balancer
as the protocol scheme.
The balancer URL uniquely identifies the balancer worker.
Members are added to a balancer using
You can control who can access your proxy via the
For more information on access control directives, see
Strictly limiting access is essential if you are using a
forward proxy (using the ProxyRequests Off
), access control is less
critical because clients can only contact the hosts that you
have specifically configured.
See Also the Proxy-Chain-Auth environment variable.
If you're using the
An Apache httpd proxy server situated in an intranet needs to forward
external requests through the company's firewall (for this, configure
the
Users within an intranet tend to omit the local domain name from their
WWW requests, thus requesting "http://somehost/" instead of
http://somehost.example.com/
. Some commercial proxy servers
let them get away with this and simply serve the request, implying a
configured local domain. When the
For circumstances where
These are the force-proxy-request-1.0
and
proxy-nokeepalive
notes.
Some request methods such as POST include a request body.
The HTTP protocol requires that requests which include a body
either use chunked transfer encoding or send a
Content-Length
request header. When passing these
requests on to the origin server, Content-Length
. But
if the body is large and the original request used chunked
encoding, then chunked encoding may also be used in the upstream
request. You can control this selection using environment variables. Setting
proxy-sendcl
ensures maximum compatibility with
upstream servers by always sending the
Content-Length
, while setting
proxy-sendchunked
minimizes resource usage by using
chunked encoding.
Under some circumstances, the server must spool request bodies to disk to satisfy the requested handling of request bodies. For example, this spooling will occur if the original body was sent with chunked encoding (and is large), but the administrator has asked for backend requests to be sent with Content-Length or as HTTP/1.0. This spooling can also occur if the request body already has a Content-Length header, but the server is configured to filter incoming request bodies.
When acting in a reverse-proxy mode (using the
X-Forwarded-For
X-Forwarded-Host
Host
HTTP request header.X-Forwarded-Server
Be careful when using these headers on the origin server, since
they will contain more than one (comma-separated) value if the
original request already contained one of these headers. For
example, you can use %{X-Forwarded-For}i
in the log
format string of the origin server to log the original clients IP
address, but you may get more than one address if the request
passes through several proxies.
See also the
Directives placed in
For example, the following will allow only hosts in
yournetwork.example.com
to access content via your proxy
server:
The following example will process all files in the foo
directory of example.com
through the INCLUDES
filter when they are sent through the proxy server:
The
IsError
Ignore
StartBody
The
When enabled, this option will pass the Host: line from the incoming
request to the proxied host, instead of the hostname specified in the
This option should normally be turned Off
. It is mostly
useful in special configurations like proxied mass name-based virtual
hosting, where the original Host header needs to be evaluated by the
backend server.
This allows or prevents Apache httpd from functioning as a forward proxy
server. (Setting ProxyRequests to Off
does not disable use of
the
In a typical reverse proxy or gateway configuration, this
option should be set to
Off
.
In order to get the functionality of proxying HTTP or FTP sites, you
need also
Do not enable proxying with
This defines remote proxies to this proxy. match is either the
name of a URL-scheme that the remote server supports, or a partial URL
for which the remote server should be used, or *
to indicate
the server should be contacted for all requests. remote-server is
a partial URL for the remote server. Syntax:
scheme is effectively the protocol that should be used to
communicate with the remote server; only http
and https
are supported by this module. When using https
, the requests
are forwarded through the remote proxy using the HTTP CONNECT method.
In the last example, the proxy will forward FTP requests, encapsulated as yet another HTTP proxy request, to another proxy which can handle them.
This option also supports reverse proxy configuration - a backend webserver can be embedded within a virtualhost URL space even if that server is hidden by another forward proxy.
The
This directive allows for growth potential in the number of Balancers available for a virtualhost in addition to the number pre-configured. It only take effect if there is at least 1 pre-configured Balancer.
This directive adds a member to a load balancing group. It could be used
within a <Proxy balancer://...>
container
directive, and can take any of the key value pair parameters available to
One additional parameter is available only to
The balancerurl is only needed when not in <Proxy balancer://...>
container directive. It corresponds to the url of a balancer defined in
This directive is used as an alternate method of setting any of the
parameters available to Proxy balancers and workers normally done via the
<Proxy balancer url|worker url>
container directive, the url argument is not required. As a side
effect the respective balancer or worker gets created. This can be useful
when doing reverse proxying via a
Keep in mind that the same parameter key can have a different meaning depending whether it is applied to a balancer or a worker as shown by the two examples above regarding timeout.
This directive allows remote servers to be mapped into the space of the local server; the local server does not act as a proxy in the conventional sense, but appears to be a mirror of the remote server. The local server is often called a reverse proxy or gateway. The path is the name of a local virtual path; url is a partial URL for the remote server and cannot include a query string.
Suppose the local server has address http://example.com/
;
then
will cause a local request for
http://example.com/mirror/foo/bar
to be internally converted
into a proxy request to http://backend.example.com/bar
.
The following alternative syntax is possible, however carries a performance penalty when present in large numbers:
If the first argument ends with a trailing /, the second argument should also end with a trailing / and vice versa. Otherwise the resulting requests to the backend may miss some needed slashes and do not deliver the expected results.
The !
directive is useful in situations where you don't want
to reverse-proxy a subdirectory, e.g.
will proxy all requests to /mirror/foo
to
backend.example.com
except requests made to
/mirror/foo/i
.
The configured
For the same reasons exclusions must come before the
general
In Apache HTTP Server 2.1 and later, mod_proxy supports pooled
connections to a backend server. Connections created on demand
can be retained in a pool for future use. Limits on the pool size
and other settings can be coded on
the key=value
parameters, described in the table
below.
By default, mod_proxy will allow and retain the maximum number of
connections that could be used simultaneously by that web server child
process. Use the max
parameter to reduce the number from
the default. Use the ttl
parameter to set an optional
time to live; connections which have been unused for at least
ttl
seconds will be closed. ttl
can be used
to avoid using a connection which is subject to closing because of the
backend server's keep-alive timeout.
The pool of connections is maintained per web server child
process, and max
and other settings are not coordinated
among all child processes, except when only one child process is allowed
by configuration or MPM design.
BalancerMember parameters |
---|
Parameter | Default | Description | ||||||
---|---|---|---|---|---|---|---|---|
min | 0 | Minimum number of connection pool entries, unrelated to the actual number of connections. This only needs to be modified from the default for special circumstances where heap memory associated with the backend connections should be preallocated or retained. | ||||||
max | 1...n | Maximum number of connections that will be allowed to the
backend server. The default for this limit is the number of threads
per process in the active MPM. In the Prefork MPM, this is always 1,
while with other MPMs it is controlled by the
| ||||||
smax | max | Retained connection pool entries above this limit are freed
during certain operations if they have been unused for longer than
the time to live, controlled by the ttl parameter. If
the connection pool entry has an associated connection, it will be
closed. This only needs to be modified from the default for special
circumstances where connection pool entries and any associated
connections which have exceeded the time to live need to be freed or
closed more aggressively. | ||||||
acquire | - | If set this will be the maximum time to wait for a free
connection in the connection pool, in milliseconds. If there are no free
connections in the pool the Apache httpd will return SERVER_BUSY
status to the client.
| ||||||
connectiontimeout | timeout | Connect timeout in seconds. The number of seconds Apache httpd waits for the creation of a connection to the backend to complete. By adding a postfix of ms the timeout can be also set in milliseconds. | ||||||
disablereuse | Off | This parameter should be used when you want to force mod_proxy
to immediately close a connection to the backend after being used, and
thus, disable its persistent connection and pool for that backend.
This helps in various situations where a firewall between Apache
httpd and
the backend server (regardless of protocol) tends to silently
drop connections or when backends themselves may be under round-
robin DNS. To disable connection pooling reuse,
set this property value to On .
| ||||||
flushpackets | off | Determines whether the proxy module will auto-flush the output brigade after each "chunk" of data. 'off' means that it will flush only when needed, 'on' means after each chunk is sent and 'auto' means poll/wait for a period of time and flush if no input has been received for 'flushwait' milliseconds. Currently this is in effect only for AJP. | ||||||
flushwait | 10 | The time to wait for additional input, in milliseconds, before flushing the output brigade if 'flushpackets' is 'auto'. | ||||||
iobuffersize | 8192 | Adjusts the size of the internal scratchpad IO buffer. This allows you
to override the | ||||||
keepalive | Off | This parameter should be used when you have a firewall between your
Apache httpd and the backend server, who tend to drop inactive connections.
This flag will tell the Operating System to send The frequency of initial and subsequent TCP keepalive probes depends on global OS settings, and may be as high as 2 hours. To be useful, the frequency configured in the OS must be smaller than the threshold used by the firewall. | ||||||
lbset | 0 | Sets the load balancer cluster set that the worker is a member of. The load balancer will try all members of a lower numbered lbset before trying higher numbered ones. | ||||||
ping | 0 | Ping property tells the webserver to "test" the connection to
the backend before forwarding the request. For AJP, it causes
CPING
request on the ajp13 connection (implemented on Tomcat 3.3.2+, 4.1.28+
and 5.0.13+). For HTTP, it causes 100-Continue to the backend (only valid for
HTTP/1.1 - for non HTTP/1.1 backends, this property has no
effect). In both cases the parameter is the delay in seconds to wait
for the reply.
This feature has been added to avoid problems with hung and
busy backends.
This will increase the network traffic during the normal operation
which could be an issue, but it will lower the
traffic in case some of the cluster nodes are down or busy.
By adding a postfix of ms the delay can be also set in
milliseconds.
| ||||||
receivebuffersize | 0 | Adjusts the size of the explicit (TCP/IP) network buffer size for
proxied connections. This allows you to override the
| ||||||
redirect | - | Redirection Route of the worker. This value is usually set dynamically to enable safe removal of the node from the cluster. If set all requests without session id will be redirected to the BalancerMember that has route parameter equal as this value. | ||||||
retry | 60 | Connection pool worker retry timeout in seconds. If the connection pool worker to the backend server is in the error state, Apache httpd will not forward any requests to that server until the timeout expires. This enables to shut down the backend server for maintenance, and bring it back online later. A value of 0 means always retry workers in an error state with no timeout. | ||||||
route | - | Route of the worker when used inside load balancer. The route is a value appended to session id. | ||||||
status | - | Single letter value defining the initial status of
this worker.
| ||||||
timeout | Connection timeout in seconds. The number of seconds Apache httpd waits for data sent by / to the backend. | |||||||
ttl | - | Time to live for inactive connections and associated connection pool entries, in seconds. Once reaching this limit, a connection will not be used again; it will be closed at some later time. |
If the Proxy directive scheme starts with the
balancer://
(eg: balancer://cluster/
,
any path information is ignored) then a virtual worker that does not really
communicate with the backend server will be created. Instead it is responsible
for the management of several "real" workers. In that case the special set of
parameters can be add to this virtual worker. See
Balancer parameters |
---|
Parameter | Default | Description |
---|---|---|
lbmethod | byrequests | Balancer load-balance method. Select the load-balancing scheduler
method to use. Either byrequests , to perform weighted
request counting, bytraffic , to perform weighted
traffic byte count balancing, or bybusyness , to perform
pending request balancing. Default is byrequests .
|
maxattempts | One less than the number of workers, or 1 with a single worker. | Maximum number of failover attempts before giving up. |
nofailover | Off | If set to On the session will break if the worker is in
error state or disabled. Set this value to On if backend servers do not
support session replication.
|
stickysession | - | Balancer sticky session name. The value is usually set to something
like JSESSIONID or PHPSESSIONID ,
and it depends on the backend application server that support sessions.
If the backend application server uses different name for cookies
and url encoded id (like servlet containers) use | to to separate them.
The first part is for the cookie the second for the path.
|
scolonpathdelim | Off | If set to On the semi-colon character ';' will be
used as an additional sticky session path deliminator/separator. This
is mainly used to emulate mod_jk's behavior when dealing with paths such
as JSESSIONID=6736bcf34;foo=aabfa
|
timeout | 0 | Balancer timeout in seconds. If set this will be the maximum time to wait for a free worker. Default is not to wait. |
failonstatus | - | A single or comma-separated list of HTTP status codes. If set this will force the worker into error state when the backend returns any status code in the list. Worker recovery behaves the same as other worker errors. |
nonce | <auto> | The protective nonce used in the balancer-manager application page.
The default is to use an automatically determined UUID-based
nonce, to provide for further protection for the page. If set,
then the nonce is set to that value. A setting of None
disables all nonce checking.
In addition to the nonce, the |
growth | 0 | Number of additional BalancerMembers to allow to be added to this balancer in addition to those defined at configuration. |
A sample balancer setup
Setting up a hot-standby, that will only be used if no other members are available
Normally, mod_proxy will canonicalise ProxyPassed URLs. But this may be incompatible with some backends, particularly those that make use of PATH_INFO. The optional nocanon keyword suppresses this, and passes the URL path "raw" to the backend. Note that may affect the security of your backend, as it removes the normal limited protection against URL-based attacks provided by the proxy.
The optional interpolate keyword (available in
httpd 2.2.9 and later), in combination with
When used inside a
This directive is not supported in
If you require a more flexible reverse-proxy configuration, see the
[P]
flag.
This directive is equivalent to
Suppose the local server has address http://example.com/
;
then
will cause a local request for
http://example.com/foo/bar.gif
to be internally converted
into a proxy request to http://backend.example.com/foo/bar.gif
.
The URL argument must be parsable as a URL before regexp substitutions (as well as after). This limits the matches you can use. For instance, if we had used
in our previous example, it would fail with a syntax error at server startup. This is a bug (PR 46665 in the ASF bugzilla), and the workaround is to reformulate the match:
The !
directive is useful in situations where you don't want
to reverse-proxy a subdirectory.
When used inside a
If you require a more flexible reverse-proxy configuration, see the
[P]
flag.
Take care when constructing the target URL of the rule, considering the security impact from allowing the client influence over the set of URLs to which your server will act as a proxy. Ensure that the scheme and hostname part of the URL is either fixed, or does not allow the client undue influence.
This directive lets Apache httpd adjust the URL in the Location
,
Content-Location
and URI
headers on HTTP
redirect responses. This is essential when Apache httpd is used as a
reverse proxy (or gateway) to avoid by-passing the reverse proxy
because of HTTP redirects on the backend servers which stay behind
the reverse proxy.
Only the HTTP response headers specifically mentioned above will be rewritten. Apache httpd will not rewrite other response headers, nor will it rewrite URL references inside HTML pages. This means that if the proxied content contains absolute URL references, they will by-pass the proxy. A third-party module that will look inside the HTML and rewrite URL references is Nick Kew's mod_proxy_html.
path is the name of a local virtual path. url is a
partial URL for the remote server - the same way they are used for the
For example, suppose the local server has address
http://example.com/
; then
will not only cause a local request for the
http://example.com/mirror/foo/bar
to be internally converted
into a proxy request to http://backend.example.com/bar
(the functionality ProxyPass
provides here). It also takes care
of redirects the server backend.example.com
sends: when
http://backend.example.com/bar
is redirected by him to
http://backend.example.com/quux
Apache httpd adjusts this to
http://example.com/mirror/foo/quux
before forwarding the HTTP
redirect response to the client. Note that the hostname used for
constructing the URL is chosen in respect to the setting of the
Note that this RewriteRule ... [P]
) from
The optional interpolate keyword (available in
httpd 2.2.9 and later), used together with
When used inside a
This directive is not supported in
Usage is basically similar to
domain
string in Set-Cookie
headers.
Useful in conjunction with
path
string in
Set-Cookie
headers. If the beginning of the cookie path matches
internal-path, the cookie path will be replaced with
public-path.
In the example given with
will rewrite a cookie with backend path /
(or
/example
or, in fact, anything) to /mirror/foo/
.
The
Note that example
would also be sufficient to match any
of these sites.
Hosts would also be matched if referenced by IP address.
Note also that
blocks connections to all sites.
The 512
or set
to 0
to indicate that the system's default buffer size should
be used.
The 512
.
In almost every case there's no reason to change that value.
If used with AJP this directive sets the maximum AJP packet size in
bytes. If you change it from the default, you must also change the
packetSize
attribute of your AJP connector on the
Tomcat side! The attribute packetSize
is only available
in Tomcat 5.5.20+
and 6.0.2+
Normally it is not necessary to change the maximum packet size. Problems with the default value have been reported when sending certificates or certificate chains.
The Max-Forwards
header supplied with the request. This may
be set to prevent infinite proxy loops, or a DoS attack.
Note that setting Max-Forwards
if the Client didn't set it.
Earlier Apache httpd versions would always set it. A negative
This directive is only useful for Apache httpd proxy servers within
intranets. The
The host arguments to the
A Domain is a partially qualified DNS domain name, preceded by a period. It represents a list of hosts which logically belong to the same DNS domain or zone (i.e., the suffixes of the hostnames are all ending in Domain).
To distinguish Domains from Hostnames (both syntactically and semantically; a DNS domain can have a DNS A record, too!), Domains are always written with a leading period.
Domain name comparisons are done without regard to the case, and
Domains are always assumed to be anchored in the root of the
DNS tree, therefore two domains .ExAmple.com
and
.example.com.
(note the trailing period) are considered
equal. Since a domain comparison does not involve a DNS lookup, it is much
more efficient than subnet comparison.
A SubNet is a partially qualified internet address in numeric (dotted quad) form, optionally followed by a slash and the netmask, specified as the number of significant bits in the SubNet. It is used to represent a subnet of hosts which can be reached over a common network interface. In the absence of the explicit net mask it is assumed that omitted (or zero valued) trailing digits specify the mask. (In this case, the netmask can only be multiples of 8 bits wide.) Examples:
192.168
or 192.168.0.0
255.255.0.0
)192.168.112.0/21
192.168.112.0/21
with a netmask of 21
valid bits (also used in the form 255.255.248.0
)As a degenerate case, a SubNet with 32 valid bits is the equivalent to an IPAddr, while a SubNet with zero valid bits (e.g., 0.0.0.0/0) is the same as the constant _Default_, matching any IP address.
A IPAddr represents a fully qualified internet address in numeric (dotted quad) form. Usually, this address represents a host, but there need not necessarily be a DNS domain name connected with the address.
An IPAddr does not need to be resolved by the DNS system, so it can result in more effective apache performance.
A Hostname is a fully qualified DNS domain name which can be resolved to one or more IPAddrs via the DNS domain name service. It represents a logical host (in contrast to Domains, see above) and must be resolvable to at least one IPAddr (or often to a list of hosts with different IPAddrs).
In many situations, it is more effective to specify an IPAddr in place of a Hostname since a DNS lookup can be avoided. Name resolution in Apache httpd can take a remarkable deal of time when the connection to the name server uses a slow PPP link.
Hostname comparisons are done without regard to the case,
and Hostnames are always assumed to be anchored in the root
of the DNS tree, therefore two hosts WWW.ExAmple.com
and www.example.com.
(note the trailing period) are
considered equal.
This directive allows a user to specifiy a timeout on proxy requests. This is useful when you have a slow/buggy appserver which hangs, and you would rather just return a timeout and fail gracefully instead of waiting however long it takes the server to return.
This directive is only useful for Apache httpd proxy servers within
intranets. The
Via
HTTP response
header for proxied requestsThis directive controls the use of the Via:
HTTP
header by the proxy. Its intended use is to control the flow of
proxy requests along a chain of proxy servers. See RFC 2616 (HTTP/1.1), section
14.45 for an explanation of Via:
header lines.
Off
, which is the default, no special processing
is performed. If a request or reply contains a Via:
header,
it is passed through unchanged.On
, each request and reply will get a
Via:
header line added for the current host.Full
, each generated Via:
header
line will additionally have the Apache httpd server version shown as a
Via:
comment field.Block
, every proxy request will have all its
Via:
header lines removed. No new Via:
header will
be generated.This directive is useful for reverse-proxy setups, where you want to
have a common look and feel on the error pages seen by the end user.
This also allows for included files (via
This directive does not affect the processing of informational (1xx), normal success (2xx), or redirect (3xx) responses.
This directive, together with the interpolate argument to
varname
for the string ${varname}
in configuration directives (if the interpolate option is set).
Keep this turned off (for server performance) unless you need it!
This directive determines whether or not proxy
loadbalancer status data is displayed via the
Full is synonymous with On
This directive determines whether or not proxy related information should be passed to the backend server through X-Forwarded-For, X-Forwarded-Host and X-Forwarded-Server HTTP headers.
This option is of use only for HTTP proxying, as handled by
This directive allows to set a specific local address to bind to when connecting to a backend server.