driver/xf86-video-intel - Intel video cards driver

Age	Commit message (Collapse)	Author
2011-07-28	render: Refactor to use newly shared pipeline setup code in i965_3d.c.	Kenneth Graunke
	Slightly generalize the shared SF and CC code to accomodate both. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	Xv: Refactor out pipeline setup functions for future reuse in render.	Kenneth Graunke
	While we're at it, make the functions simply take an intel_screen_private pointer directly instead of having to fetch it from ScrnInfoPtr. Also coalesce some gen6/gen7 functions that were 98% identical. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	render: Update 3DPRIMITIVE for Ivybridge.	Kenneth Graunke
	Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	render: Set Address Modify Enable in 3DSTATE_VERTEX_BUFFERS on Gen7.	Kenneth Graunke
	Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	render: Update SAMPLER_STATE for Ivybridge.	Kenneth Graunke
	Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	render: Update SURFACE_STATE for Ivybridge.	Kenneth Graunke
	Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-28	render: New Ivybridge assembly programs for render acceleration.	Kenneth Graunke
	These are exactly the same as the ones for Sandybridge, but with message registers translated (hopefully) in the same way as Haihao's new programs (m1 == g65). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-26	sna: Revert enabling scan-line wait on SNB	Chris Wilson
	Hanging the machine does indeed prevent video tearing. Just not quite what the user expected... Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=39497 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-20	sna: Enable gen6 scan-line waiting	Chris Wilson
	The code was ready and waiting, just forgot to turn it on. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-19	dri: Do not tile stencil buffer	Chad Versace
	Until now, the stencil buffer was allocated as a Y tiled buffer, because in several locations the PRM states that it is. However, it is actually W tiled. From the PRM, 2011 Sandy Bridge, Volume 1, Part 2, Section 4.5.2.1 W-Major Format: W-Major Tile Format is used for separate stencil. The GTT is incapable of W fencing, so we allocate the stencil buffer with I915_TILING_NONE and decode the tile's layout in software. This commit mutually depends on the mesa commit: intel: Fix stencil buffer to be W tiled Author: Chad Versace <chad@chad-versace.us> Date: Mon Jul 18 00:37:45 2011 -0700 Signed-off-by: Chad Versace <chad@chad-versace.us> Reviewed-by: Ian Romanick <ian.romanick@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>
2011-07-13	Disable adding normal RTF modes for an eDP	Chris Wilson
	This is causing a hard hang with 2.6.39+, we don't know why so play safe and disable for the time being. References: https://bugs.freedesktop.org/show_bug.cgi?id=38012 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna: Fast-path single span boxes	Chris Wilson
	These are very common when compositing unclipped trapezoids, and the majority of the overhead is in handling the arbitrary number of boxes and misses out on the constant folding the compiler can do if it is known we have just one box. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna/damage: Avoid testing against a completey damaged region	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna/gen3: Tune emit_spans_primitive_constant	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna/glyphs: Discard GLYPH_PICTURE hint if the glyph doesn't fit into the cache	Chris Wilson
	If the glyph is too big to fit into the cache, than ideally we do want to keep an associated GPU bo around for future use. As it is too large to fit into the cache, it of reasonable size and there is little wastage in allocating indiviual GPU bo for each oversized glyph. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna: Add some extra debugging to the texture upload fallback paths	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna/dri: Fix a couple of typos	Chris Wilson
	Somehow these were lost in the rebasing. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-13	sna/dri: Remove the unused id/type members for Resource tracking	Chris Wilson
	...and reduce it to a simple list. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-12	dri: Always initialise resource members of DRI2FrameEvent	Chris Wilson
	As we now attempt to always decouple the lists upon freeing the frame event, we need to initialise them along all code paths. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-11	sna/dri: Add some simple debugging	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-11	sna/dri: Refactor common code for assigning a pixmap to the DRI2 buffer	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-11	dri: Enable triple-bufferred pageflips	Chris Wilson
	By popular demand. Triple-buffering trade-offs output latency versus jitter. By having a pre-rendered frame ready to swap in following a pageflip, we avoid the scenario where the latency between receiving the flip complete signal from the kernel, waking up the vsynced application, it render the new frame and then for the server to process the swap request is greater than the frame interval, causing us to miss the vblank. The result is that application can become frame-locked to 30fps. Instead, we report to the application that the first frame swap is immediately completed, supply a new back buffer (or else the rendering would be blocked on waiting for the front-buffer to be swapped away from the scanout) and let them proceed to render the second frame. The second frame is added to the swap queue, and the client throttled to vrefresh. (If the client missed the vblank, the swap queue is empty and the client is immediately woken again, whilst the pageflip is pending.) Note, for practical reasons this only applies to page-flipping, for example, calls to glXSwapBuffer() on fullscreen applications. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-11	dri: Prevent abuse of the Resource database	Chris Wilson
	The Resource database is only designed to store a single value for a particular type associated with an XID. Due to the asynchronous nature of the vblank/flip requests, we would often associate multiple frame events with a particular drawable/client. Upon freeing the resource, we would not necessarily decouple the right value, leaving a stale pointer behind. Later when the client disappeared, we would write through that stale pointer upsetting valgrind and causing memory corruption. MDK. Instead, we need to implement an extra layer for tracking multiple frames within a single Resource. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=37700 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-09	dri: Remove the shadow copy of attachment	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-09	sna: Clamp results for computing BoxRec coords from xRectangle	Chris Wilson
	As the width/height in the rectangle is specified as uint16_t, the result may be larger than is storagable in the int16_t of the box. Of course it would take a really inane client to do attempt to draw something much larger than the largest possible surface... Is it strange that first example I've found to do so is a Java application? Reported-by: Nicolas Kalkhof <nkalkhof@web.de> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-09	sna/accel: Fix fallback for depth=1 copy	Chris Wilson
	A little carelessness with passing down the offsets caused us to incorrectly copy depth=1 bitmaps, as exemplified by gkrellm. Reported-by: Nicolas Kalkhof <nkalkhof@web.de> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-08	sna: A buffer only needs a flush if it remains dirty at the end of the batch	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-08	sna/gen3: So we also need to ensure stippling is cleared...	Chris Wilson
	My theory that we used nothing that invoked polygon stippling proved baseless. Fixes regression from 3b5971bd2359383cb8326702d80e03bc15d34c69 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-08	sna/gen3: Restore disabling the use of stencil/fog in the invariant	Chris Wilson
	One cleanup too far causing spurious results after rebooting. We also need to ensure that the writemask is fully enabled (ie not disabled) as well. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-08	sna/dri: Enable chaining of page-flips	Chris Wilson
	Trade off extra frames of latency for extra frames of anti-jitter buffering and loss of completion information; compiz users rejoice. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-07	sna/dri: Remove redundant NULL check in reference	Chris Wilson
	The buffer has already been dereferenced by this point... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-07	sna: Take advantage of the needs_flush tracking on the front buffer	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-06	sna: Check against an execbuffer reference before discarding partials	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-06	sna: Compute aligned tiled heights for gen2 correctly	Chris Wilson
	We were underestimating the height of X-tiled surfaces (and less harmfully overestimating the height of Y-tiled surfaces.) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna: Reset unused partial buffers	Chris Wilson
	Whilst searching for available space on the active partial buffer list, if we discover an unreferenced one, reset its used counter to zero. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna: Discard unused partial buffers	Chris Wilson
	If we allocate a partial buffer and then fallback for the operation, the buffer would remain on the partial list waiting for another user. Discard any unused partials at the next batch submission or expiration point. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna/gen2: Restore invariant ENABLES	Chris Wilson
	One deletion too many, unnoticed until the next reboot. Besides the failure to disable logic op and enable colour buffer blending which causes a hang if you subsequently try to enable both, you also need to request texture caching... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna/gen[23]: Do as the comments suggest and prefer the BLT	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna: Reduce tiling if pitch is less than a tile_width/height only on pre-G33	Chris Wilson
	(Note this only applies to 2D pixmaps.) The rationale, borne out by experimentation with cairo-perf-trace, is that on the pre-G33 devices we always need a fence region region for tiled surfaces, i.e. at least .5/1MiB in size, and that combined with the smaller GTT on those devices, we loose the benefit of tiling to the excessive GTT thrashing. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-05	sna: Remove unused aperture_size member	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Clamp object size to the min of 1/4 of the whole GTT or 1/2 the mappable	Chris Wilson
	... for those pesky early devices whose GTT was no larger than the AGP aperture. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Update flush/retirement lists after a implicit flush for mmap	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Enable relaxed-fencing for gen2 devices	Chris Wilson
	(Just as dependent upon non-buggy kernels as gen3...) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Fix gen2 tiled surface sizes	Chris Wilson
	Actually use the gen2 path for gen2 devices! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna/gen2: fix batch buffer acounting	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Manually set to the GTT domain for mmap	Chris Wilson
	...since the kernel no longer does strict coherency. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Trim number of downsample passes	Chris Wilson
	If we can fit the entire width or the entire height into the pipeline when downsampling, do so. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Don't change tiling modes on replace	Chris Wilson
	This was trying to workaround a kernel bug, and instead causes a performance cliff for textures that need to be tiled. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Force tiled modes for large pitches	Chris Wilson
	If the surface is so big that the 2x2 texel sampling will cause a TLB miss everytime, i.e. the row pitch exceeeds 4096, then we need to encourage tiling to prevent attrocious performance. For example, try downscaling a 2560x1600 background image on a gen3 device using I915_TILING_NONE... Using slideshow-demo /usr/share/backgrounds/cosmos/whirlpool.jpg, on a PineView netbook, fps goes from under 4 to over 40. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-07-04	sna: Finer-grained debugging for trapezoids	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>