# Quick and Easy Way o convert ASP.NET Web Page to PDF

ASP.NET is an open-source server-side web application framework designed for web development to produce dynamic web pages developed by Microsoft to allow programmers to build dynamic web sites, applications and services.

iTextSharp (now iText) is a library for creating and manipulating PDF files in Java and .NET. iText was written by Bruno Lowagie. The source code was initially distributed as open source under the Mozilla Public License.

In this post, I will show how to convert web page to pdf using

<%@ Page Language="C#" AutoEventWireup="true" CodeFile="Pdf.aspx.cs" Inherits="Pdf" %>

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<title></title>
<body>
<form id="form1" runat="server">
<asp:PlaceHolder ID="PlaceholderPdf" runat="server"></asp:PlaceHolder>
<div>
<table border="1">
<tr>
<td colspan="2">
aspdotnetcodebook
</td>
</tr>
<tr>
<td>
cell1
</td>
<td>
cell2
</td>
</tr>
<tr>
<td colspan="2">
<asp:Label ID="lblLabel" runat="server" Text="Label Test"></asp:Label>
</td>
</tr>
</table>
</div>
</form>
</body>
</html>

using System;
using System.Collections.Generic;
using System.Linq;
using System.Web;
using System.Web.UI;
using System.Web.UI.WebControls;
using System.IO;
using System.Text.RegularExpressions;
using iTextSharp.text;
using iTextSharp.text.pdf;
using iTextSharp.text.html;
using iTextSharp.text.xml;
using System.Xml;
using iTextSharp.text.html.simpleparser;
public partial class Pdf : System.Web.UI.Page
{
protected override void Render(HtmlTextWriter writer)
{
MemoryStream mem = new MemoryStream();
StreamWriter twr = new StreamWriter(mem);
HtmlTextWriter myWriter = new HtmlTextWriter(twr);
base.Render(myWriter);
myWriter.Flush();
myWriter.Dispose();
strmRdr.BaseStream.Position = 0;
strmRdr.Dispose();
mem.Dispose();
writer.Write(pageContent);
CreatePDFDocument(pageContent);

}
public  void CreatePDFDocument(string strHtml)
{

string strFileName = HttpContext.Current.Server.MapPath("test.pdf");
// step 1: creation of a document-object
Document document = new Document();
// step 2:
// we create a writer that listens to the document
PdfWriter.GetInstance(document, new FileStream(strFileName, FileMode.Create));
HTMLWorker obj = new HTMLWorker(document);
document.Open();
obj.Parse(se);
document.Close();
ShowPdf(strFileName);

}
public void ShowPdf(string strFileName)
{
Response.ClearContent();
Response.ContentType = "application/pdf";
Response.WriteFile(strFileName);
Response.Flush();
Response.Clear();
}
}


1. Very good job....It helped me a lot..

2. To simply and fast create pdf file use .rdlc file in VStudio. There is a WYSWIG that help you do it :)

3. Hi I am getting following error
Any Ideas

Description: An unhandled exception occurred during the execution of the current web request. Please review the stack trace for more information about the error and where it originated in the code.

Source Error:

Line 52: HTMLWorker obj = new HTMLWorker(document);
Line 53: document.Open();
Line 54: obj.Parse(se);
Line 55: document.Close();
Line 56: ShowPdf(strFileName);

4. This piece of code is not working verry well tabels are always 100% width and images are not in position.

somebody any ideas?

5. Even I am getting a message and a piece of code is not working. It sstopped at obj.Parse(se);
and says Could not find a part of the path 'C:\Images\logo.JPG'.

6. Even I am getting a message and a piece of code is not working. It sstopped at obj.Parse(se);
and says Could not find a part of the path 'C:\Images\logo.JPG'.

7. HI
Please debug and send me exact error message

8. HI
Please debug and send me exact error message

9. I am rajesh m Somvanshi
I Use iTextSharp.dll for Convert ASPX page in PDF Formate That is Done Successfully But My CSS is Not working in PDF File

10. Hi Rajesh,
If you want to maintain css then you may create table structure manually.For more details check out this link
http://forums.asp.net/t/1433490.aspx

11. Hi all
If page contain any image control or file then How would we render it to PDF?

12. Hi GyanPrakash,
Have u tried my code?

13. yes i have tried , it works only simple pages
if page contain any images ,it couldnt work.

14. Ok I will look into this and get back to u soon.

15. hi Santhosh,

Iam getting the following error while runing this sample:
The above error raised at "CreatePDFDocument()" method at line "obj.Parse(se); "

Plz help me as soon as posible. its very urgent for me. Any will be highly appriciatable.

Thanks,
kumar

16. Hi Kumar
From your error message it is clear that the asp.net user does not have access write to c drive.
Do one thing create a temp folder in c drive and write the newly created pdf to that folder.

string strFileName = @"C:\temp\test.pdf"

17. Hi Santosh
Could you plese suggest me what will be the variable to be used in render method.

18. I use online service http://www.web2pdfconvert.com/pdf-api.aspx
Works great, I just pass URL of web page which I want to convert using REST and get PDF as stream back. But this solution only works if web pages are accessible from online.

doesnt work with images .
it gives error in
obj.Parse(se);
so if there is solution then please tell me .

20. Hi,
is very good,
but one problem itextsharp no found with style,

for example

width:500px

doesnt work with images .
it gives error in
obj.Parse(se);
so if there is solution then please tell me .

22. If u want image u need to write server path of image in img tag like,

src="http://localhost:1060/Project/images/image1.jpg" alt="ImageName"

cheers!!!

23. i m getting error on this line
obj.Parse(se);

error is
1.The best overloaded method match for 'iTextSharp.text.html.simpleparser.HTMLWorker.Parse(System.IO.StreamReader)' has some invalid arguments E:\vikas\reporteg1\WebApplication7\Default.aspx.cs 232 14 WebApplication7
2.Argument '1': cannot convert from 'System.IO.StringReader' to 'System.IO.StreamReader' E:\vikas\reporteg1\WebApplication7\Default.aspx.cs 232 24 WebApplication7

can u just tell me what is wrong

24. Hi,

I m Vinay. I am very much thank you for your Code.

25. Basic html for styling like bgcolor, width ,border=0.5 ,align , font color , size

For image

iTextSharp.text.Image gif = iTextSharp.text.Image.GetInstance((Server.MapPath("/") + "123451.jpg"));

26. hi i m using ur code in placeholder1 which contain a table.i got this error please help me

System.InvalidCastException: Unable to cast object of type 'iTextSharp.text.html.simpleparser.CellWrapper' to type 'iTextSharp.text.Rectangle'.

27. Hi i am using ur code in placeholder which contain 5 or 6 big images i got error

28. Hi,
I'm using the above code.There is image in my page.While executing,it's giving error as "Input string is not in correct format" on line

obj.Parse(se);

can anybody help me to resolve this?

29. hi..if there are merged cells in grid view then merging disappears in pdf

30. i got a itext error ... would u pleas how to solves this name spaces error ..

32. hi,i have uploaded the word document using the code provided by your blog

public partial class Default5 : System.Web.UI.Page
{
protected void Page_Load(object sender, EventArgs e)
{

}
protected void btnRead_Click(object sender, EventArgs e)
{

ApplicationClass wordApp = new ApplicationClass();

object file = filePath;

object nullobj = System.Reflection.Missing.Value;

Document doc = wordApp.Documents.Open(ref file,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj,
ref nullobj);

Document doc1 = wordApp.ActiveDocument;

string m_Content = doc1.Content.Text;

TextBox1.Text = m_Content;

doc.Close(ref nullobj, ref nullobj, ref nullobj);
}
}

it is working fine,i have a requirement that is,after uploading the word document if i make any modifications in that it should be saved,how can do this?

33. hi , can you let know off how to extract a specific named div . I want to extract export only a part of the page . Thanks for the help

34. Hi, thanks for the code. Can you tell me how you are calling the "Render" method, though? Thanks.

35. convert aspx to pdf. using css design not appear in pdf

36. can u please tell me that what's the use of Render method here...?

htmlparser.Parse(sr);

Response.ContentType = "application/pdf";
Response.Cache.SetCacheability(HttpCacheability.NoCache);
ExportText = FormatText(ExportText);
Document pdfDoc = new Document(PageSize.A4, 20, 20, 50, 40);
HTMLWorker htmlparser = new HTMLWorker(pdfDoc);
PdfWriter.GetInstance(pdfDoc, Response.OutputStream);
pdfDoc.Open();
htmlparser.Parse(sr);
pdfDoc.Close();
Response.Write(pdfDoc);
Response.End();

38. Good blog.....It help me out thanks again:)

39. Thank you for this post, this was very helpful to me in my project. Do you have code about how to disallow anyone from copy image in asp.net?

40. Input string was not in a correct format.

Description: An unhandled exception occurred during the execution of the current web request. Please review the stack trace for more information about the error and where it originated in the code.

Exception Details: System.FormatException: Input string was not in a correct format.

Source Error:

Line 71: iTextSharp.text.html.simpleparser.StyleSheet styles = new iTextSharp.text.html.simpleparser.StyleSheet();
Line 72: iTextSharp.text.html.simpleparser.HTMLWorker hw = new iTextSharp.text.html.simpleparser.HTMLWorker(document);
Line 74: document.Close();
Line 75: ShowPdf("Chap0101.pdf");

Source File: e:\Vijith\SaveData\Default2.aspx.cs Line: 73

Stack Trace:

[FormatException: Input string was not in a correct format.]
System.Number.ParseSingle(String value, NumberStyles options, NumberFormatInfo numfmt) +9422867
System.Single.Parse(String s, IFormatProvider provider) +25
iTextSharp.text.html.simpleparser.CellWrapper..ctor(String tag, ChainedProperties chain) +245
iTextSharp.text.html.simpleparser.HTMLWorker.CreateCell(String tag) +56
iTextSharp.text.html.simpleparser.HTMLTagProcessor_TD.StartElement(HTMLWorker worker, String tag, IDictionary2 attrs) +135
iTextSharp.text.html.simpleparser.HTMLWorker.StartElement(String tag, IDictionary2 attrs) +127
iTextSharp.text.xml.simpleparser.SimpleXMLParser.ProcessTag(Boolean start) +75
iTextSharp.text.xml.simpleparser.SimpleXMLParser.Parse(ISimpleXMLDocHandler doc, ISimpleXMLDocHandlerComment comment, TextReader r, Boolean html) +83
Default2.HTMLToPdf(String HTML, String FilePath) in e:\Vijith\SaveData\Default2.aspx.cs:73
Default2.Button1_Click1(Object sender, EventArgs e) in e:\Vijith\SaveData\Default2.aspx.cs:36
System.Web.UI.WebControls.Button.OnClick(EventArgs e) +118
System.Web.UI.WebControls.Button.RaisePostBackEvent(String eventArgument) +112
System.Web.UI.WebControls.Button.System.Web.UI.IPostBackEventHandler.RaisePostBackEvent(String eventArgument) +10
System.Web.UI.Page.RaisePostBackEvent(IPostBackEventHandler sourceControl, String eventArgument) +13
System.Web.UI.Page.RaisePostBackEvent(NameValueCollection postData) +36
System.Web.UI.Page.ProcessRequestMain(Boolean includeStagesBeforeAsyncPoint, Boolean includeStagesAfterAsyncPoint) +5563

41. htmlparser.Parse(sr);

Response.ContentType = "application/pdf";
Response.Cache.SetCacheability(HttpCacheability.NoCache);
ExportText = FormatText(ExportText);
Document pdfDoc = new Document(PageSize.A4, 20, 20, 50, 40);
HTMLWorker htmlparser = new HTMLWorker(pdfDoc);
PdfWriter.GetInstance(pdfDoc, Response.OutputStream);
pdfDoc.Open();
htmlparser.Parse(sr);
pdfDoc.Close();
Response.Write(pdfDoc);
Response.End();
i m getting html parser error,plz suggets

42. plz suggest thnx
i am using itextsharp for converting my asp page to pdf...............i am facing css issue as my page is in div structure..............also getting error also but getting error:
Protected Sub btnExport_Click(sender As Object, e As EventArgs)
Response.ContentType = "application/pdf"
Response.Cache.SetCacheability(HttpCacheability.NoCache)
Dim sw As New StringWriter()
Dim hw As New HtmlTextWriter(sw)
Me.Page.RenderControl(hw)

Dim pdfDoc As New Document(PageSize.A4, 10.0F, 10.0F, 100.0F, 0.0F)
Dim htmlparser As New HTMLWorker(pdfDoc)
PdfWriter.GetInstance(pdfDoc, Response.OutputStream)
pdfDoc.Open()
htmlparser.Parse(sr)
pdfDoc.Close()
Response.Write(pdfDoc)
Response.[End]()
End Sub

http://www.mikesdotnetting.com/Article/87/iTextSharp-Working-with-images

using itextsharp dll for converting
htmlparser.Parse(sr)

is not defining proper path for the images:

Exception Details: System.IO.DirectoryNotFoundException: Could not find a part of the path 'C:\Images\Layout\an-Logo.png'.

43. Wow muchisimas gracias me sirvio de mucho, funciona perfectamente y es justo lo que estaba buscando. En verdad muchas gracias.

44. Thanks for this code May I knwo cam this work for grid view.In my aspx i have label and gridview alos and check box for label control the data is showing properly but for others it won't show any thing please help me n thans in advance I am tarun Nigam my mail id is nigam_tarun2003@yahoo.co.in

45. Hi,
Iam converting aspx page to pdf by using this..while converting iam getting error "Input string is not in correct format "

Can anyone plz help me

46. I need that my chart must export the CSV file or Excel file so for that i have to write the Function in ASP.NET MVC and I am the Beginner in MVC.

47. Hi Thanks for the code .Actually it works fine but in

obj.Parse(se);

it gives error as illegal character .i thought HTML Worker would not able to parse some of the character in string reader.
Any Idea.

48. i have link in gridview whern i click on pdf_click button then error arise in vb.net coade
Unable to cast object of type 'iTextSharp.text.html.simpleparser.CellWrapper' to type 'iTextSharp.text.Paragraph'.
how can i do

49. Hi all,will you provide solution to me "If page contain any image control or file then How would we render it to PDF? "

50. The process cannot access the file 'H:\AESMSFINAL\AESMS 4-08-2013\test.pdf' because it is being used by another process

1. Sam, try closing the document instance if there is an exception

51. Hi...
My Code

Response.ContentType = "application/pdf";
Response.Cache.SetCacheability(HttpCacheability.NoCache);
StringWriter sw = new StringWriter();
HtmlTextWriter hw = new HtmlTextWriter(sw);
this.Page.RenderControl(hw);
Document pdfDoc = new Document(PageSize.A4, 10f, 10f, 100f, 0.0f);
HTMLWorker htmlparser = new HTMLWorker(pdfDoc);
PdfWriter.GetInstance(pdfDoc, Response.OutputStream);
pdfDoc.Open();
htmlparser.Parse(sr);
pdfDoc.Close();
Response.Write(pdfDoc);
Response.End();
but error Unable to cast object of type 'iTextSharp.text.html.simpleparser.IncTable' to type 'iTextSharp.text.ITextElementArray'.
pls help

52. hi,i want to know how can i convert chart data control to pdf in the above code.

53. I want to convert my chart control with all the controls to pdf.I have done the following updation in thw above code.its givving an error.i am not able to resolve it can anybody help me
protected override void Render(HtmlTextWriter writer)
{

MemoryStream mem = new MemoryStream();
Chart1.SaveImage(mem, ChartImageFormat.Png);
iTextSharp.text.Image img = iTextSharp.text.Image.GetInstance(mem.GetBuffer());
img.ScalePercent(75f);
StreamWriter twr = new StreamWriter(mem);
HtmlTextWriter myWriter = new HtmlTextWriter(twr);
base.Render(myWriter);
myWriter.Flush();
myWriter.Dispose();
strmRdr.BaseStream.Position = 0;
strmRdr.Dispose();
mem.Dispose();
writer.Write(pageContent);
CreatePDFDocument(pageContent);

}
public void CreatePDFDocument(string strHtml)
{
string strFileName = HttpContext.Current.Server.MapPath("test.pdf");
Document document = new Document();
// step 2:
// we create a writer that listens to the document
PdfWriter.GetInstance(document, new FileStream(strFileName, FileMode.Create));
HTMLWorker obj = new HTMLWorker(document);
document.Open();
obj.Parse(se);
document.Close();
ShowPdf(strFileName);

}
public void ShowPdf(string strFileName)
{
Response.ClearContent();
Response.ContentType = "application/pdf";
Response.WriteFile(strFileName);
Response.Flush();
Response.Clear();
}

54. This comment has been removed by the author.

55. Very Simple Article . Thanks

56. Thanks a lot for posting this post, Your post has always been an informative source for me.
Timing Belt

57. The author used iTextSharp to convert web pages into PDF files. It is really great. I often browse the pages of related topics and found the SDK of convert pdf to text c#. It also includes the function of html to pdf. It is also very good. Thanks again. The sharing of bloggers.

58. From html to pdf, the methods are similar, but when you use it in practice, there are a lot of bugs. This article is great and explains a lot of difficulties.