GNU bug report logs - #5118
DocView: copy/search text

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: emacs; Severity: wishlist; Reported by: Juri Linkov <juri@HIDDEN>; dated Fri, 4 Dec 2009 00:35:05 UTC; Maintainer for emacs is bug-gnu-emacs@HIDDEN.

Message received at 5118@HIDDEN:

Received: (at 5118) by; 5 Dec 2009 23:24:28 +0000
From juri@HIDDEN Sat Dec  5 15:24:28 2009
X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02
	(2008-06-10) on
X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available.
	hammytokens:Tokens not available.
X-Spam-Status: No, score=-2.7 required=4.0 tests=AWL,HAS_BUG_NUMBER
	autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02
Received: from ( [])
	by (8.14.3/8.14.3/Debian-5) with ESMTP id nB5NOQeI015343
	for <5118@HIDDEN>; Sat, 5 Dec 2009 15:24:27 -0800
X-Virus-Scanned: by Amavisd-New at
Received: from ( [])
	by (Postfix) with ESMTP id 20AE03F409B;
	Sun,  6 Dec 2009 01:24:19 +0200 (EET)
From: Juri Linkov <juri@HIDDEN>
To: Stefan Monnier <monnier@HIDDEN>
Cc: 5118 <at>, Kevin Rodgers <kevin.d.rodgers@HIDDEN>,
        Tassilo Horn <tassilo@HIDDEN>
Subject: Re: bug#5118: DocView: copy/search text
Organization: JURTA
References: <87hbs7bvbw.fsf@HIDDEN> <hfd7de$kh5$2@HIDDEN>
Date: Sun, 06 Dec 2009 01:21:00 +0200
In-Reply-To: <jwvpr6tm8jw.fsf-monnier+emacsbugreports@HIDDEN> (Stefan
	Monnier's message of "Sat, 05 Dec 2009 16:12:42 -0500")
Message-ID: <87ein96nmz.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (x86_64-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii

>>>> Please add two features that would be easy to implement:
>>>> 1. Copying the text of the current page to the kill ring.
>>> I don't know anything about DocView, but shouldn't that be possible
>>> in any major mode via C-x C-p (mark-page) then M-w (kill-ring-save)?
>> `C-x C-p' and `M-w' will copy binary data, whereas `pdftotext' converts
>> it to plain text in a hidden buffer that can be displayed with `C-c C-t'
>> where you can do `C-x C-p' and `M-w'.  But the problem is that it's
>> difficult to find the current page in the text representation to be able
>> to copy it.
> doc-view-search-internal seems to just count ^Ls, so assuming
> doc-view-search-internal works (which it appears to do), it shouldn't be
> that hard.

I meant it's difficult for the user to find the current page (that is
displayed in the image buffer) to find the corresponding page in the
text buffer (with the output from `pdftotext').

I don't mind having a command that would switch from the image buffer
to the text buffer and narrow it to the current page.  So it would be easy
to copy any text from the textual representation of the current page.

Juri Linkov

Acknowledgement sent to Juri Linkov <juri@HIDDEN>:
Extra info received and forwarded to list. Copy sent to Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN. Full text available.
Information forwarded to bug-submit-list@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN:
bug#5118; Package emacs,doc-view. Full text available.

Message received at 5118@HIDDEN:

Received: (at 5118) by; 5 Dec 2009 21:12:49 +0000
From monnier@HIDDEN Sat Dec  5 13:12:49 2009
X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02
	(2008-06-10) on
X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available.
	hammytokens:Tokens not available.
X-Spam-Status: No, score=-2.8 required=4.0 tests=AWL,HAS_BUG_NUMBER
	autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02
Received: from ( [])
	by (8.14.3/8.14.3/Debian-5) with ESMTP id nB5LCmDQ032692
	for <5118@HIDDEN>; Sat, 5 Dec 2009 13:12:49 -0800
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: Au4EAOdbGktFpZ7i/2dsb2JhbACBS9MDhDMEih4
X-IronPort-AV: E=Sophos;i="4.47,347,1257138000"; 
Received: from (HELO pastel.home) ([])
  by with ESMTP; 05 Dec 2009 16:12:42 -0500
Received: by pastel.home (Postfix, from userid 20848)
	id 45DCF80B1; Sat,  5 Dec 2009 16:12:42 -0500 (EST)
From: Stefan Monnier <monnier@HIDDEN>
To: Juri Linkov <juri@HIDDEN>
Cc: 5118 <at>, Kevin Rodgers <kevin.d.rodgers@HIDDEN>,
        Tassilo Horn <tassilo@HIDDEN>
Subject: Re: bug#5118: DocView: copy/search text
Message-ID: <jwvpr6tm8jw.fsf-monnier+emacsbugreports@HIDDEN>
References: <87hbs7bvbw.fsf@HIDDEN> <hfd7de$kh5$2@HIDDEN>
Date: Sat, 05 Dec 2009 16:12:42 -0500
In-Reply-To: <87pr6te9tt.fsf@HIDDEN> (Juri Linkov's message of "Sat,
	05 Dec 2009 21:47:50 +0200")
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii

>>> Please add two features that would be easy to implement:
>>> 1. Copying the text of the current page to the kill ring.
>> I don't know anything about DocView, but shouldn't that be possible
>> in any major mode via C-x C-p (mark-page) then M-w (kill-ring-save)?

> `C-x C-p' and `M-w' will copy binary data, whereas `pdftotext' converts
> it to plain text in a hidden buffer that can be displayed with `C-c C-t'
> where you can do `C-x C-p' and `M-w'.  But the problem is that it's
> difficult to find the current page in the text representation to be able
> to copy it.

doc-view-search-internal seems to just count ^Ls, so assuming
doc-view-search-internal works (which it appears to do), it shouldn't be
that hard.


Acknowledgement sent to Stefan Monnier <monnier@HIDDEN>:
Extra info received and forwarded to list. Copy sent to Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN. Full text available.
Information forwarded to bug-submit-list@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN:
bug#5118; Package emacs,doc-view. Full text available.

Message received at 5118@HIDDEN:

Received: (at 5118) by; 5 Dec 2009 19:52:41 +0000
From juri@HIDDEN Sat Dec  5 11:52:41 2009
X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02
	(2008-06-10) on
X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available.
	hammytokens:Tokens not available.
X-Spam-Status: No, score=-2.7 required=4.0 tests=AWL,HAS_BUG_NUMBER
	autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02
Received: from ( [])
	by (8.14.3/8.14.3/Debian-5) with ESMTP id nB5JqeWd022223
	for <5118@HIDDEN>; Sat, 5 Dec 2009 11:52:41 -0800
X-Virus-Scanned: by Amavisd-New at
Received: from ( [])
	by (Postfix) with ESMTP id 16E583F40E0;
	Sat,  5 Dec 2009 21:52:33 +0200 (EET)
From: Juri Linkov <juri@HIDDEN>
To: Kevin Rodgers <kevin.d.rodgers@HIDDEN>
Cc: 5118 <at>, Tassilo Horn <tassilo@HIDDEN>
Subject: Re: bug#5118: DocView: copy/search text
Organization: JURTA
References: <87hbs7bvbw.fsf@HIDDEN> <hfd7de$kh5$2@HIDDEN>
Date: Sat, 05 Dec 2009 21:47:50 +0200
In-Reply-To: <hfd7de$kh5$2@HIDDEN> (Kevin Rodgers's message of "Sat, 05
	Dec 2009 01:59:26 -0700")
Message-ID: <87pr6te9tt.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (x86_64-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii

>> Please add two features that would be easy to implement:
>> 1. Copying the text of the current page to the kill ring.
> I don't know anything about DocView, but shouldn't that be possible
> in any major mode via C-x C-p (mark-page) then M-w (kill-ring-save)?

`C-x C-p' and `M-w' will copy binary data, whereas `pdftotext' converts
it to plain text in a hidden buffer that can be displayed with `C-c C-t'
where you can do `C-x C-p' and `M-w'.  But the problem is that it's
difficult to find the current page in the text representation to be able
to copy it.

Juri Linkov

Acknowledgement sent to Juri Linkov <juri@HIDDEN>:
Extra info received and forwarded to list. Copy sent to Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN. Full text available.
Information forwarded to bug-submit-list@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN:
bug#5118; Package emacs,doc-view. Full text available.

Message received at submit@HIDDEN:

Received: (at submit) by; 5 Dec 2009 09:00:04 +0000
From geb-bug-gnu-emacs@HIDDEN Sat Dec  5 01:00:03 2009
X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02
	(2008-06-10) on
X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available.
	hammytokens:Tokens not available.
X-Spam-Status: No, score=-2.9 required=4.0 tests=AWL,HAS_BUG_NUMBER
	autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02
Received: from ( [])
	by (8.14.3/8.14.3/Debian-5) with ESMTP id nB5900E9026118
	for <submit@HIDDEN>; Sat, 5 Dec 2009 01:00:03 -0800
Received: from mailman by with tmda-scanned (Exim 4.43)
	id 1NGqUi-0000xZ-DL
	for bug-gnu-emacs@HIDDEN; Sat, 05 Dec 2009 04:00:00 -0500
Received: from exim by with spam-scanned (Exim 4.43)
	id 1NGqUb-0000rd-7e
	for bug-gnu-emacs@HIDDEN; Sat, 05 Dec 2009 03:59:57 -0500
Received: from [] (port=46603
	by with esmtp (Exim 4.43)
	id 1NGqUb-0000ra-3B
	for bug-gnu-emacs@HIDDEN; Sat, 05 Dec 2009 03:59:53 -0500
Received: from ([]:59891)
	by with esmtps (TLS-1.0:RSA_AES_256_CBC_SHA1:32)
	(Exim 4.60)
	(envelope-from <geb-bug-gnu-emacs@HIDDEN>)
	id 1NGqUa-0004xp-Mx
	for bug-gnu-emacs@HIDDEN; Sat, 05 Dec 2009 03:59:52 -0500
Received: from list by with local (Exim 4.50)
	id 1NGqUW-0005UF-IK
	for bug-gnu-emacs@HIDDEN; Sat, 05 Dec 2009 09:59:48 +0100
Received: from ([])
        by with esmtp (Gmexim 0.1 (Debian))
        id 1AlnuQ-0007hv-00
        for <bug-gnu-emacs@HIDDEN>; Sat, 05 Dec 2009 09:59:48 +0100
Received: from kevin.d.rodgers by with local (Gmexim 0.1 (Debian))
        id 1AlnuQ-0007hv-00
        for <bug-gnu-emacs@HIDDEN>; Sat, 05 Dec 2009 09:59:48 +0100
To: bug-gnu-emacs@HIDDEN
From: Kevin Rodgers <kevin.d.rodgers@HIDDEN>
Subject: Re: bug#5118: DocView: copy/search text
Date: Sat, 05 Dec 2009 01:59:26 -0700
Lines: 19
Message-ID: <hfd7de$kh5$2@HIDDEN>
References: <87hbs7bvbw.fsf@HIDDEN>
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
X-Complaints-To: usenet@HIDDEN
User-Agent: Thunderbird (Macintosh/20090812)
In-Reply-To: <87hbs7bvbw.fsf@HIDDEN>
Sender: news <news@HIDDEN>
X-detected-operating-system: by GNU/Linux 2.6 (newer, 3)

Juri Linkov wrote:
> Package: emacs,doc-view
> Severity: wishlist
> Please add two features that would be easy to implement:
> 1. Copying the text of the current page to the kill ring.

I don't know anything about DocView, but shouldn't that be possible
in any major mode via C-x C-p (mark-page) then M-w (kill-ring-save)?

> PDF viewers allow selecting a rectangular area to copy it to the
> clipboard.  If this is not possible in DocView, then at least copying
> the text of the whole page would be a satisfactory compensation.
> Info and Dired bind similar commands to `w'.

Kevin Rodgers
Denver, Colorado, USA

Acknowledgement sent to Kevin Rodgers <kevin.d.rodgers@HIDDEN>:
Extra info received and forwarded to list. Copy sent to Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN. Full text available.
Information forwarded to bug-submit-list@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN:
bug#5118; Package emacs,doc-view. Full text available.

Message received at submit@HIDDEN:

Received: (at submit) by; 4 Dec 2009 00:27:55 +0000
From juri@HIDDEN Thu Dec  3 16:27:55 2009
X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02
	(2008-06-10) on
X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available.
	hammytokens:Tokens not available.
X-Spam-Status: No, score=-5.7 required=4.0 tests=AWL,HAS_PACKAGE,X_DEBBUGS_CC
	autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02
Received: from ( [])
	by (8.14.3/8.14.3/Debian-5) with ESMTP id nB40RqE6008891
	for <submit@HIDDEN>; Thu, 3 Dec 2009 16:27:55 -0800
Received: from mailman by with tmda-scanned (Exim 4.43)
	id 1NGM1X-0001M7-RT
	for bug-gnu-emacs@HIDDEN; Thu, 03 Dec 2009 19:27:51 -0500
Received: from exim by with spam-scanned (Exim 4.43)
	id 1NGM1T-0001JA-L0
	for bug-gnu-emacs@HIDDEN; Thu, 03 Dec 2009 19:27:51 -0500
Received: from [] (port=37635
	by with esmtp (Exim 4.43)
	id 1NGM1T-0001J4-Hi
	for bug-gnu-emacs@HIDDEN; Thu, 03 Dec 2009 19:27:47 -0500
Received: from ([]:55406
	by with esmtp (Exim 4.60)
	(envelope-from <juri@HIDDEN>)
	id 1NGM1T-0005Hv-8l
	for bug-gnu-emacs@HIDDEN; Thu, 03 Dec 2009 19:27:47 -0500
X-Virus-Scanned: by Amavisd-New at
Received: from ( [])
	by (Postfix) with ESMTP id F248E3F40BF
	for <bug-gnu-emacs@HIDDEN>; Fri,  4 Dec 2009 02:27:36 +0200 (EET)
From: Juri Linkov <juri@HIDDEN>
To: bug-gnu-emacs@HIDDEN
Subject: DocView: copy/search text
Organization: JURTA
X-Debbugs-CC: Tassilo Horn <tassilo@HIDDEN>
Date: Fri, 04 Dec 2009 02:03:19 +0200
Message-ID: <87hbs7bvbw.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.1.50 (x86_64-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
X-detected-operating-system: by GNU/Linux 2.6, seldom 2.4 (older, 4)

Package: emacs,doc-view
Severity: wishlist

Please add two features that would be easy to implement:

1. Copying the text of the current page to the kill ring.

PDF viewers allow selecting a rectangular area to copy it to the
clipboard.  If this is not possible in DocView, then at least copying
the text of the whole page would be a satisfactory compensation.
Info and Dired bind similar commands to `w'.

2. Search currently is useless in DocView since it doesn't highlight
the current match.

What would be more useful is to implement Occur-like UI.  For instance,
typing `doc-view-occur' would display an *Occur* buffer like:

Page  1:   text text match text text
Page  1:   text text text match text
Page  1:   text text text text match
Page  2:   text match text text text
Page 42:   match text text text text

that allows to navigate to the corresponding page.

Juri Linkov

Acknowledgement sent to Juri Linkov <juri@HIDDEN>:
New bug report received and forwarded. Copy sent to tassilo@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN. Full text available.
Report forwarded to bug-submit-list@HIDDEN, tassilo@HIDDEN, Emacs Bugs <bug-gnu-emacs@HIDDEN>, owner@HIDDEN:
bug#5118; Package emacs,doc-view. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Mon, 25 Nov 2019 12:00:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.