Location>code7788 >text

C# to write a PDF batch merge tool to simplify daily work

Popularity:332 ℃/2024-09-23 12:05:32

I. Preamble

As the project requires the preparation of a large number of materials, as well as a variety of signature forms, paperwork, etc., and finally to PDF as the material delivered as a document format, the process of document changes or additions, so the handling of PDF documents has become part of the daily work.
Online there are a variety of PDF processing tools, always feel the use of the hand does not follow. Finally review their needs are summarized as follows:
1. You can easily and quickly merge multiple PDFs.
2. You can select the specified page number from the source PDF to merge.
3. You can extract specific page numbers from a single PDF (split PDF).
4. Grouping of multiple PDFs, combined as the final PDF navigation bookmarks, can be quickly located.
5. Unified after the synthesis of PDF page size, such as the unified A4 format.
6. The operation is as easy as possible, support for file drag and drop, do not need fancy things.

II. Final effect

First, let's look at the final product:

①. You can batch add multiple PDFs to the list box, or you can Data Manager to batch drag files in to realize the addition.
②. [Optional] Define grouping headers to group files, also as bookmarks for the merged PDF.
③. Batch merge PDFs from the list into one file. Convert to Split function if only PDFs are available and page number range is defined.
④. Display the total number of PDF pages, if you only need to extract part of the content, you can define the page number range.
⑤. You can change the size of the merged PDF pages to be unified in A4, B4 or A5 format.

III. Functional realization

Searching found that github has an open sourcePdfBinder1.2(/schourode/pdfbinder)It is closer to the desired effect, and based on the principle of saving as much as you can, minimizing cost, and improving energy efficiency, you can directly use it as a basis for expansion and development of your own desired functions.

1. Add files

This is relatively simple, click the button to pop up the selection dialog box, the selected files will be added to the ListBox one by one.

private void addFileButton_Click(object sender, EventArgs e)
{

    if (() == )
    {
        foreach (string file in )
        {
            AddInputFile(file);
        }
        UpdateUI();
    }
}

The AddInputFile function is written separately in order to reuse it in drag-and-drop events.

public void AddInputFile(string file)
{
    int Pages = 0;
    switch ((file, out Pages))
    {
        case :
            (((""), file), (""), , );
            break;
        case :
            (((""), file), (""), , );
            break;
        case :
            (new PdfInfo() { Fullname = file, Filename = (file), Ranges = "", TotalPages = Pages });
            break;
    }
}

Here on the validity of the PDF file to check, and added to the ListBox is the PdfInfo object, which also records the total number of pages, the extracted page range and other information.
File drag and drop implementation:

private void FileListBox_DragEnter(object sender, DragEventArgs e)
{
     = (, false) ?  : ;
}
private void FileListBox_DragDrop(object sender, DragEventArgs e)
{
    var fileNames = (string[])();
    (fileNames);

    foreach (var file in fileNames)
    {
        AddInputFile(file);
    }

    UpdateUI();
}

2. Document grouping (bookmarks)

using BookmarkName = ;
private void addBookmarkButton_Click(object sender, EventArgs e)
{
    //No file is added, it is not processed.
    if ( < 0) return;

    // If the selected bookmark (group name), read the name for modification
    BookmarkName bookmark = "";
    if ( is BookmarkName)
        bookmark = (BookmarkName);
    bookmark = (BookmarkName); else
    {
        //If a file is selected, extract the filename as the default value.
        bookmark = ((PdfInfo)).Filename; if (("."))).
        if (("."))
            bookmark = (0, (".")) ;)
    }

    // If the input is valid, add the bookmark (group name)
    BookmarkName newName = ((""), (""), bookmark) ;
    if (newName ! = "")
    {
        if ( is BookmarkName )
            [] = newName; else
        if ( is BookmarkName ) [] = newName; else
        {
            (, newName); }
            BookmarkCounter++;
        }
    }
}

3. Define page ranges

No page range is defined to indicate the entire PDF for merging. If the page range is defined, only the corresponding pages will be extracted for merging.
Page ranges are formatted to match the page number definitions of common print functions, e.g., 1,2,3,6-9.
This action is implemented in the right-click pop-up menu.

private void mnuSetPageRange_Click(object sender, EventArgs e)
{
    PdfInfo item = ((PdfInfo));
    string range = ((""), (""), );
    //No need to deal with unchanged content
    if (range != )
    {
        if (range == "")
        {
            ((PdfInfo)[]).Ranges = "";
            return;
        }

        //Handling of commas and spaces
        string[] arr = (",", ",").Replace(" ", "").Split(',');
        range = "";
        for (int i = 0; i < ; i++)
        {
            //Determining Validity with Regular Expressions
            if ("" == arr[i]) continue;
            if ((arr[i], @"^\d+$") || (arr[i], @"^\d+-\d+$"))
                range += ("" == range ? "" : ",") + arr[i];
            else
            {
                ((""));
                return;
            }
        }
        //Input valid,update
        ((PdfInfo)[]).Ranges = range;
        UpdateUI();
    }
}

4.Customized display

In order to display the bookmarks, the total number of pages and extract the page range in the ListBox, you need to take over the draw event of the ListBox.

private void FileListBox_DrawItem(object sender, DrawItemEventArgs e)
{
    ...
    StringFormat Formater = new StringFormat();
     = ;
     = ;
     = ;
     = ;

    //Drawing bookmarks(cluster name)
    if ([] is BookmarkName)
    {
        //Picture Bookmarks(cluster name)icon (computing)
        (, , + (( - ) /2));
        //Picture Bookmarks(cluster name)
        ((BookmarkName)[], ,
            , new Rectangle( + , , - RIGHT_MARGIN, ), Formater);
        return;
    }

    //draftPDFfilename
    PdfInfo item = (PdfInfo)[];
    ( ? : , ,
        , new Rectangle( + (BookmarkCounter > 0 ? (int)( * 1.5) : 0), , - RIGHT_MARGIN, ), Formater);

    //draft页码
     = ;
    (( == "" ? "" : + " | ")
        + (>1 ? ("Pages"): ("Page"), )
        , , , , Formater);
}

5. Define page size

The default is the original size (no adjustment), which can be selected as A4, A5, B4 as needed.

private void OnPageSizeChanged(object sender, EventArgs e)
{
     = ((ToolStripMenuItem)sender).Tag;
    mnuPageSize_Original.Checked = sender == mnuPageSize_Original;
    mnuPageSize_A4.Checked = sender == mnuPageSize_A4;
    mnuPageSize_A5.Checked = sender == mnuPageSize_A5;
    mnuPageSize_B4.Checked = sender == mnuPageSize_B4;
    if (mnuPageSize_Original.Checked)
         = ("");
    else
         = ("") + ":" + ((ToolStripMenuItem)sender).Text;
}

Batch merge

This one is a bit longer, so if you're interested you can go to/kacarton/PDFBinder2Download the source code and see for yourself, the following excerpts are from the core.

private void combineButton_Click(object sender, EventArgs e)
{
    if (() == )
    {
        using (var combiner = new Combiner(, ()))
        {
             = true;
             = false;

            for (int i = 0; i < ; i++)
            {
                if ([i] is BookmarkName)
                    ((string)[i]);
                else
                    (((PdfInfo)[i]).Fullname, ((PdfInfo)[i]).Ranges);
                //refresh schedule
                 = (int)(((i + 1) / (double)) * 100);
            }

             = true;
             = false;
        }

        ();
    }
}

class Combiner : IDisposable
{
    public void AddFile(string fileName, string range)
    {
        var reader = new PdfReader(fileName);
        ....
        _document.NewPage();
                
        //Add Bookmark
        if (!())
        {
            Chapter _chapter = new Chapter("", 1);
            _chapter.BookmarkTitle = ;
            _chapter.BookmarkOpen = true;
            _document.Add(_chapter);
             = null;
        }

        if (_newPageSize == )
        {
            var page = _pdfCopy.GetImportedPage(reader, i);
            _pdfCopy.AddPage(page);
        }
        else
        {
            var page = _writer.GetImportedPage(reader, i);
            _document.Add((page));
        }

        ();
    }
}

7. Other

UI synchronization, file removal, move up, move down, sorting, multi-language support are relatively simple so I won't expand on them.

Fourth, the code open source

The source code has been posted on github, url:PDFBinder2 /kacarton/PDFBinder2Welcome to the exchange.