Jump to content


 


Register a free account to unlock additional features at BleepingComputer.com
Welcome to BleepingComputer, a free community where people like yourself come together to discuss and learn how to use their computers. Using the site is easy and fun. As a guest, you can browse and view the various discussions in the forums, but can not create a new topic or reply to an existing one unless you are logged in. Other benefits of registering an account are subscribing to topics and forums, creating a blog, and having no ads shown anywhere on the site.


Click here to Register a free account now! or read our Welcome Guide to learn how to use this site.

Photo

Opening Corrupted Word file in Notepad


  • Please log in to reply
6 replies to this topic

#1 52corvair

52corvair

  • Members
  • 3 posts
  • OFFLINE
  •  
  • Local time:10:31 PM

Posted 03 November 2014 - 02:03 PM

Hi all, a new member here.

 

I was wondering if anybody could assist with a question. Windows 7, Office 2010 user here. I received a corrupted WORD file sent by someone as part of a work assignment that couldn't be opened. 

 

And then, the supposedly same WORD file sent thereafter (hours) which opens properly in Word 2010. The file size is different (107Kb for the corrupted one vs 118 Kbs for the uncorrupted one). 

 

When I open the corrupted file in Notepad, there is a: 

 

"Document Microsoft Word    MSWordDoc   Word.Document.8 ô9²q"

 

at the end. That is the only 'searchable' text. From my understanding, Word 8 is a 1997-2003 Word version. 

 

 

When I open the Uncorrupted file in Notepad, there is alot more searchable text for "Word", as well as "excel" (but I cannot find any reference to the the Word version). 

 

ie. 

"word/stylesWithEffects.xmlPK"

"word/embeddings/Microsoft_Excel_Worksheet2.xlsxPK"

 

The paragraph format of the jumbled text paragraphs are notably different too. 

 

Questions: 

1) Is there any way that the corrupted Word file 'magically' messed up that bad? 

 

2) Should the 2 files in Notepad open the same way? (type of text, shape of paragraph, size of document). Or does unintended corruption result in such a stark difference in Notepad? 

 

3) Any way to tell if it was intentionally done?  

 

Thank you in advance. 


Edited by hamluis, 04 November 2014 - 04:23 PM.
Moved from Win 7 to Business Apps - Hamluis.


BC AdBot (Login to Remove)

 


#2 JohnC_21

JohnC_21

  • Members
  • 24,426 posts
  • OFFLINE
  •  
  • Gender:Male
  • Local time:11:31 PM

Posted 03 November 2014 - 02:51 PM

Hello, and Welcome

 

If the file was transferred via email or the internet there is always a chance it could get corrupted in transit.

 

Corruption can result in a difference in Notepad results.

 

No, you can't tell if this was intentionally done.

 

Try opening the file with another word processor. You can try the portable version of Abiword or the portable version of Libre Office Write. You just double click the downloaded file from portable apps and extract to a folder on your desktop. Then double click the program exe file. I would start with Abiword as it's only about 8MB.

 

Edit: Have you tried the Open and Repair option?


Edited by JohnC_21, 03 November 2014 - 02:52 PM.


#3 52corvair

52corvair
  • Topic Starter

  • Members
  • 3 posts
  • OFFLINE
  •  
  • Local time:10:31 PM

Posted 03 November 2014 - 03:42 PM

Hello, and Welcome

 

If the file was transferred via email or the internet there is always a chance it could get corrupted in transit.

 

Corruption can result in a difference in Notepad results.

 

No, you can't tell if this was intentionally done.

 

Try opening the file with another word processor. You can try the portable version of Abiword or the portable version of Libre Office Write. You just double click the downloaded file from portable apps and extract to a folder on your desktop. Then double click the program exe file. I would start with Abiword as it's only about 8MB.

 

Edit: Have you tried the Open and Repair option?

 

Hi JohnC_21, 

 

I did try the open and repair function. As well as the other 'Word' recovery functions. I've tried various text recovery extractors as well. The reason we think it may be intentional is the file was sent 3 minutes before the deadline. Right away, we asked to resend and then we didn't hear word from that individual for over 6 hours, only to receive a beautifully done final document (this is for a case that is sent to potential hires) with nothing but a basic "I don't know what happened.. here's another copy".  

 

Would the "Microsoft.document.8" text mean anything? My understanding, files can be purposefully corrupted through various 'edits' through different programs (ie notepad), the purpose of which is a way to say you got something in on time and sent it in, but to buy you more time (a somewhat common practice in Universities nowadays).  

 

I will try your recommendations. 



#4 JohnC_21

JohnC_21

  • Members
  • 24,426 posts
  • OFFLINE
  •  
  • Gender:Male
  • Local time:11:31 PM

Posted 03 November 2014 - 04:47 PM

If possible right click the corrupted file and select Properties. It will show a Created Date and a Modified date. If the file was Created in one Windows version and Saved again in another at a later date, the Creation Date should be identical if we are talking about the same file. Looking under the Details tab can give you some extra information such as what computer and the owner.

 

Edit: Can't help on the Microsoft.document.8. Never saw that before. Maybe somebody else on the forum can give some input on it.

 

Edit Edit. Here is some info I found on Microsoft.document.8.

 

Later readable sections of the file also reveal the file version--which is not necessarily the same thing as the Word version since some Word versions can write binary files of earlier versions and the same file version is used for both Word 97 and Word 2000. The file versions are:

  • Word.Document.6 (Word 6 format)
  • Word.Document.7 (Word 95 format)
  • Word.Document.8 (Word 97 or later format)

 


Edited by JohnC_21, 03 November 2014 - 05:20 PM.


#5 52corvair

52corvair
  • Topic Starter

  • Members
  • 3 posts
  • OFFLINE
  •  
  • Local time:10:31 PM

Posted 03 November 2014 - 11:00 PM

 

The Corrupt property details is completely empty in the corrupt file. Whereas the final working copy has that information. Total edit time is 1 minute. I tried changing the extension to "Zip" as well to extract it, and it says "Folder is empty".


Edited by hamluis, 04 November 2014 - 04:22 PM.
Removed unnecessary quotebox - Hamluis.


#6 JohnC_21

JohnC_21

  • Members
  • 24,426 posts
  • OFFLINE
  •  
  • Gender:Male
  • Local time:11:31 PM

Posted 03 November 2014 - 11:26 PM

Sorry, I don't have any other ideas.

 

What is the Creation date on the good file? That should tell you when the file was first created. If it was a re-submission of the original, I would believe the Creation data would be sometime before the required hand-in time. If the Creation date of the good file was after the submission time then that could be a problem.



#7 keithy_sunny

keithy_sunny

  • Members
  • 1 posts
  • OFFLINE
  •  
  • Local time:07:31 PM

Posted 04 November 2014 - 09:45 AM

When a Word document file is corrupted, then you can try several methods to recover it:

1. First of all, you can try the recovery function integrated with Microsoft Word, as follows:

1) On the File menu, click Open.
2) In the Look in list, click the drive, folder, or Internet location that contains the file that you want to open.
3) In the folder list, locate and open the folder that contains the file.
4) Select the file that you want to recover.
5) Click the arrow next to the Open button, and then click Open and Repair.

You may find more information about this at:

http://office.microsoft.com/en-us/word-help/recover-the-text-from-a-damaged-document-HP005189610.aspx (for Word 2003)
http://support.microsoft.com/kb/893672/en-us (for Word 2007/2010/2013)

2. If you have multiple corrupt Word documents, then you can use the VBA macro provided in article http://support.microsoft.com/kb/893672/en-us so that all the files will be opened in "Open and Repair" option automatically.

3. There are also free tools from third-parties that can open and read Microsoft Word documents, for example,

3.1 OpenOffice at http://www.openoffice.org. This is a very famous open source project that is designed to support Office file formats, including Word documents. The software can run under Windows.

3.2 LibreOffice at http://www.LibreOffice.org. Another free office suite.

3.3 AbiWord at http://www.abisource.com. This is a cross-platform tool that works under Unix and Windows.

3.4 Google Drive at https://drive.google.com/ also support to load Word document files.
 
Sometimes when Word fails to open your document, these tools may be able to open it successfully. If that is the case, then after the document is opened, you can just save it as a new document which will be error-free.

4. For docx files, they are actually a group of files compressed in Zip file format. Therefore, sometimes, if the corruption is only caused by the Zip file, then you can use Zip repair tools such as WinRAR at http://www.rarlab.com to repair the file, as follows:

4.1 Assuming the corrupt document is a.docx, then you need to rename it to a.zip
4.2 Start WinRAR, go to "Tools > Repair Archive" to repair a.zip and generated a fixed file a_fixed.zip.
4.3 Rename a_fixed.zip back to a_fixed.doc
4.4 Using Word to open a_fixed.doc.
 
There may still be some warnings when opening the fixed file in Word, just let ignore it and Word will try to open and repair the fixed file. If the file can be opened successfully, then you can just save the contents into another error-free file.

5. If all above methods does not work, then you may try third-party tools such as DataNumen Word Repair at

http://www.datanumen.com/word-repair/

I have used it to repair some word documents successfully. It provides a free demo version so that you can try to see if the data you want can be recovered or not.

Good luck!


Edited by hamluis, 04 November 2014 - 04:23 PM.





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users